Archive for the ‘I/O performance’ Category

Never send HDD to do the job fit for EFD…

Monday, April 14th, 2008

Who could ask for more than seeing a new storage industry product announcement to highlight the points you’ve been trying to make?
 
I found myself in that position, and was quite surprised (well not really surprised…more like incredulous) to see a recent announcement of what had been frequently referred to as the Seagate “brick” project (not related to MiniScribe), but minimally disguised within a Seagate-funded private company.  The product that was announced is another version of a sealed unit consisting of multiple hard drives “purpose-built to maximize performance and reliability.”  The announcement makes it clear that many new techniques must have been employed to achieve “self-healing,” and to enable the product to essentially repair itself in place “to the equivalent of a fresh, factory-manufactured drive.”  Wow!  I will leave it up to people smarter than me to respond to this.

What I’d like to discuss is the price performance aspect of this announcement.  The systems tested were fully mirrored, making comparisons never quite “apples to apples.”  However, one needs to keep in mind that the MTBF of the drives employed require mirroring to reach any reasonable reliability level.  While I could not find any real price or performance data on the company’s web site, the reference to their SPC benchmarks provided considerable data.
 
From a pricing standpoint, the 1.03TB configuration sells for more than $36 per gigabyte (after a 40% discount from $60/GB)…and, flash-based SSD at $30/GB is considered expensive?
 
This benchmark is also said to be record-breaking with the lowest cost per SPC-1 IOPs.  I’m not suggesting that $36/GB is unreasonable, only that it illustrates the true cost of hard drives in high-performance environments.  A closer look at the benchmark is even more telling.  This “record-breaking” performance correlates to a response time of nearly 30 milliseconds.  In fact, response time increases dramatically starting at about 50% of the max IOPs, which is certainly troublesome for high transaction-rate systems.

This project was started a few years ago, apparently to address the growing price, performance and reliability gap in enterprise applications, as we have been talking about, and to hold off the encroachment of solid state storage devices.  However, with today’s technology, well designed Enterprise Flash Drives will not only be lower in cost per GB, less than 1/4th the cost per IOP, and more reliable.  And, did I mention power:  EFD’s will be well less than 1/100th the watts per IOPs.  I cannot help but be reminded of the Anderson Cooper segment on CNN:  “What were they thinking!”

Amyl Ahola

Hard disk is free…hardly!

Wednesday, March 26th, 2008

The dramatic reductions in HDD cost per GB have resulted in many system/storage architects (and application/operating system programmers) treating primary storage as though it is free.

Some of the results are:

  • Exponential increases in the size of operating systems and applications
  • Mass deployment of low-end and midrange servers with multiple copies of data (and applications)
  • Over-provisioning of storage to satisfy future needs projections (which also likely adopt the concept of free storage)
  • Adoption of power-hungry DRAM cache appliances to mask HDD performance shortfalls
  • Over-provisioning of HDDs to mask HDD performance shortfalls

These all result in inefficient use of storage that has many costs, not the least of which is the increasing cost of energy consumption.  Some of the energy data becoming available paints a sobering picture:

  • Data centers account for 1.5% of ALL U.S. electrical consumption, and this is expected to double in a few years
  • Power consumption per $1,000 of server spending has increased by a factor of 4 since 2000
  • Power failure and availability is expected to halt data center operations at more than 90% of all companies over the next few years
  • Fifty percent of current data centers will have insufficient power and cooling capacity this year

HDDs are clearly not the only contributor to the rapid acceleration of data center power consumption, but their inefficient use is likely one of the largest contributors.  Data that suggests more than one third of data center power consumption is storage related.

Trends and techniques such as consolidation, virtualization and thin provisioning should all contribute to improved efficiencies.  But while doing so, these approaches will put increased performance demands on the HDDs.  The result:  an increased need for higher performance (i.e., higher RPM……read that as ‘power consuming’) drives and even further over-provisioning for performance – and therefore once again increased energy consumption.

It’s time for new metrics to be considered in the data centers, which take into account energy usage to aid the system designers as they optimize their systems.  Several metrics are identified at the www.greendatastorage.com website; examples cited include activity per watt, such as transactions/Watt, IOPs/Watt, and bandwidth/Watt.

I believe that Enterprise Flash Drives (EFDs) will play a major role in reversing these trends. EFDs can provide over 1000x improvement in IOPs/Watt, and an order of magnitude or more improvement in bandwidth/Watt over the highest performing HDD’s.

Amyl Ahola

Addressing the enterprise performance gap

Monday, March 17th, 2008

Ok, I want to get back to the issue I was discussing a couple of posts ago.

The question I’m exploring is what can be done about the growing gap between disk drive and enterprise network performance, as well as the escalating inefficiencies?   One only has to look at the root cause:  the mechanical nature of disk drives.  The solution is obvious; eliminate the mechanics. 

Easier said than done! 

The Holy Grail for primary storage has always been directly addressable low latency, non-volatile random access memory.  This remains a long way off, but it is time to begin the next evolutionary step.  Solid state technology (particularly Flash) cost and performance continues to improve geometrically, and new and even more competitive semiconductor storage technologies are around the corner.  Meanwhile, disk drive performance (seek, latency) is stagnating, with only limited foreseeable improvements and with cost per I/O leveling off or even beginning to increase with time. 

Last year Greg Schulz of the StorageIO Group predicted the increasing use of solid state technology in enterprise storage applications, saying (paraphrased) that 2008 will be the year of awareness and early adoption by vendors and early deployment by customers, while 2009 will be the broader adoption phase.   Supporting that projection, EMC has recently announced their commitment to Flash and is the first major enterprise storage company to do so (http://www.emc.com/about/news/press/us/2008/011408-1.htm).  Although it was a limited announcement with what I consider an ‘entry level’ SSD technology, it is the first step towards validation of Flash technology as an enterprise primary storage device.

Solid state storage has the potential to be transformational, relegating disk drives to applications that better match their strengths, low cost per GB and large block sequential applications (for the old timers amongst us, it should be noted this is similar to the role disk played years ago with respect to magnetic tape drives).

But Flash comes with its own set of problems…(Stay tuned)

Amyl Ahola

A different “Green IT” point of view

Monday, March 10th, 2008

I was intrigued by a blog post by Chuck Hollis of EMC last week (“Chuck’s Blog”) offering an interesting perspective on the whole “green IT” issue:  http://chucksblog.typepad.com/chucks_blog/2008/03/green-it—-are.html

Chuck suggests that the IT industry may be missing the point on “green IT.”  Specifically, while it is important to pursue green IT goals from an energy efficiency perspective, the real goal should be “efficient IT,” which can, as a result, generate a number of green benefits, including improved power consumption and footprint reductions.  He further suggests that just because something is green, it doesn’t necessarily mean it is efficient.

I couldn’t agree more.  While I believe that green IT is a critical objective for virtually all enterprise IT environments, there are a number of IT efficiency issues that must be addressed now for their own sake.  Take the use of enterprise HDDs for example.  Many of today’s IT managers are using 3 to 4 times more HDDs than they need from a capacity perspective just to meet growing I/O performance requirements.   This “over provisioning” is at best a band-aid approach to improving I/O, and is probably one of the most inefficient uses of IT technology I’ve seen.

I’ll have more to say on this later, but for now I just wanted to note the importance of pursuing efficient IT for its own sake.  The benefits can be many, not the least of which is a greener IT environment.

Amyl Ahola