Martin Courtney

Don't be duped by dedupe

It is impossible to know how many terabytes of data you can fit into a Tardis unless you actually try it

Written by Martin Courtney

Firms should never promise anything unless they are certain they can deliver, but you have to question whether some of the data de-duplication vendors can back up their claims.

Sepaton’s DeltaStor is the latest in a growing line of dedupe solutions that claim to shrink large volumes of information held on enterprise servers to a tiny percentage of their original size, before shoehorning them onto far fewer hard disks than would otherwise be required for backup purposes.

DeltaStor’s marketing department reckons you can squeeze 50 petabytes (PB) onto a 1PB virtual tape library (VTL), a compression ratio of 50:1. This is feasible because every organisation has carbon copies of file, block and byte sized data needlessly stored in multiple locations all over the place, which can be greatly reduced using an efficient method of finding it, comparing it and discarding the bits that are not already safely tucked away somewhere else.

Performance and headroom issues aside (the indexes of pointers that dedupe solutions create to let us know where on the network the master copy is stored can be fairly large and difficult to process themselves), dedupe is a sensible and desperately needed approach to the growing burden of compliance and data retention rules.

Few would question the wisdom of being more selective about the stuff being copied to all those expensive disk arrays, but it is important to remember that, because not every data type is the same, the compression rates that are actually achievable in uncharted storage environments vary.

Some files (or bits of files), like those email messages we all send by clicking the “reply to all with history” button, contain lots of repeated information that does not have to be stored on the hard drive of every individual in the company that received it – a single copy of the original is all you need to save in case the next Enron breaks at your company. But could the same level of compression be achieved with a large relational database?

To prove their worth, many dedupe vendors are happy to provide a “try before you buy” sample of their technology, which prospective customers can install and run over time to see the results that can be achieved using their own unique file sets and everyday company transactions.

Without being put to such a test, those compression claims are a bit like saying an unhealthy, middle-aged journalist with a permanent hangover can run 10km in under 40 minutes – which I can, given a downhill course, a strong tailwind and a chasing pit bull. But, unfortunately, ideal conditions do not always prevail.

Tags:

reader comments

related articles

De-duplication advances puts 50PB on single VTL

Virtual tape specialist Sepaton announces 50 petabyte disk array 12 Feb 2007

 

A lot of life in the old tape yet

Businesses still reliant on traditional backup method as part of ‘belt and braces’ disaster recovery 15 Aug 2008

A lot of life in the old tape yet

Businesses still reliant on traditional backup method as part of ‘belt and braces’ disaster recovery 14 Aug 2008

HP punts de-duplication systems

HP is aiming its new de-dupe solutions at a range of firms 23 Jun 2008

related whitepapers

today's top stories

CIOs must embrace collaboration tools

Author Don Tapscott gives Angelica Mari his reasons for promoting social networking tools and says transparency is the key to security 04 Dec 2008

On a quest to build a connected society

BT Design’s JP Rangaswami talks to Gareth Morgan about his pivotal role in the telecoms giant’s efforts to deliver universal broadband and his plans to tap into the creativity of the open source community 04 Dec 2008

IT leaders must stand by India

A sense of perspective is the most important response from IT leaders to the attacks in Mumbai 04 Dec 2008

Case study: Clifford Chance

Law firm implements Sun platform and reduces datacentres to gain efficiency and cost synergies 03 Dec 2008

Should CRM be more sociable?

As vendors rush to add more social networking bells and whistles to their CRM products, some experts warn that users must tread carefully when venturing into online communities 03 Dec 2008

Advertisement

Newsletter signup

Sign up for our range of FREE newsletters:

Existing User

Newsletter user login:

Advertisement

Jobs

Related jobs

Job of the week

Job alerts

Sign up here

Find your next job

IT Salary Checker

Check salary here

Advertisement

White papers

Search white papers

Top categories

VPN, Extranet and Intranet Solutions

WAN/ LAN Solutions

Network Security

Interoperability-Connectivity

Grid/ Utility Computing

Latest poll

Will the terrorist attacks in Mumbai affect your offshoring plans?

Will the terrorist attacks in Mumbai affect your offshoring plans?

Is India becoming a risky destination?

Previous poll results

Latest audio and video articles

Padlocked CDVideo

Technology and privacy

Watch the final video in a two-part Computing roundtable debate on the importance of putting data privacy issues at the heart of your IT plans 02 Dec 2008

Podcast imageAudio

Computing podcast - Standard Life's offshoring plans; and the prospects for government IT

The insurance giant outlines its new outsourcing strategy; and we ask if the government's economic bailout will affect its IT plans 28 Nov 2008

Latest in-depth articles

Doctors looking at a computerAnalysis

Watchdog wants IT to cure privacy woes

Information Commissioner Richard Thomas is urging organisations to put privacy protection at the top of their procurement and development criteria 04 Dec 2008

Colin McDonaldComment

Web 2.0 has potential to transform staff training

Employees can sharpen their IT skills through using the latest interactive training tools, writes Colin McDonald 04 Dec 2008

Advertisement

Primary Navigation