I recently came across a fascinating research study which cited some incredible statistics relating to the amount of data companies generate.
According to ESG, “nearly 50% of mid-market companies (100-999 employees) report more than 10 TB of total data volume and nearly 50% of enterprise organizations (1,000 employees or more) cite more than 100 TB.” The study went on to say that data volumes are growing by 20% or more per year, meaning that data volume is doubling every two to five years.
Armed with this information, I dusted off the calculator and started producing our own set of intriguing numbers.
According to the 2007 Census, there are 639,288 US companies with >20 employees, broken down as follows:
Businesses 20-99 Employees (2007 US Census) 83.3% or 532,391
Businesses 100-499 Employees (2007 US Census) 13.8% or 88,586
Businesses 499+ Employees (2007 US Census) 2.9% or 18,311
Total Market 100% or 639,288
Based on ESG’s study, we assumed an average of 5TB of data resided in the 20-99 segment, 10TB in the middle segment, and 50TB in the 499+ plus market. Here’s the result – which shows a staggering 4,463,365 TB of data volume.
Average TB of Data 20-99 Employees @5TB, 2,661,955TB
Average TB of Data 100-499 Employees @10TB, 885,860TB
Average TB of Data 499+ Employees @50TB, 915,550TB
Total Market - TB of Data Stored 4,463,365TB
With massive amounts of data on hand, companies everywhere are seeking new, better and more economical ways to protect and store their data, and many are finding that the Cloud presents an excellent option. It’s accessible, inexpensive, secure and reliable, and its scales effortlessly. Still, as appealing as the Cloud is today for those seeking data storage solutions, I believe that there is one important element missing in the Cloud as we know it today - Data Storage Assurance.
Today’s challenge is less about how and where companies store their burgeoning amounts of data, and more about how they maintain the integrity of that data and how easily that data can be found and retrieved. Companies need to ensure clean, trusted audit trails when embracing the Cloud for data storage, and once they’ve assured the integrity of their stored data, they need to ensure they can easily and rapidly navigate the massive amount of files that comprise their data – their “digital haystack.” Only then will they be able to find the data they need, when they need it, and with the identifying tags that prove to any and all that their stored files are authentic.
The Digital Haystack isn’t going away, and neither is the Cloud. And if you do put the two together in the right way with strong data audit trails that deliver Cloud storage assurance, the relationship will work now and for years to come.