How Seagate Tests Its Hard Drives

Tom's Hardware gets a rare and in-depth look at how Seagate designs and tests its hard drives. Join us for a tour through the company's Longmont, Colorado R&D center.

Imagine spending vast resources on your profession, whether that's providing IT support services, making movies, raising kids, preparing the city’s finest food or steam cleaning carpets. People know what you do, and you do it every day. However, almost no one understands how you do what you do.

Why does this matter? If you provide IT services, by understanding your processes, potential customers will have a better sense of what fuels your expertise and dependability. After all, Google reviews only lend so much credibility, right? Anyone can claim to provide the best carpet cleaning. But you won’t believe those claims unless you understand the chemicals and equipment used.

The same holds true for hard drive manufacturers. Seagate spends $2 billion annually on R&D, not that you probably gave much thought to its investments when you bought your last multi-terabyte hard drive. The upside of this annual outlay is Seagate’s 1.2 percent annual failure rate. This is a company-wide average. Business drives will inevitably feature a lower AFR than consumer-class models. But saying, “Two billion USD buys better AFR numbers” doesn’t tell you much. We’re big believers in showing rather than telling.

To this end, we hired local photographer Noah Katz and packed our bags for beautiful early summer Boulder, Colorado. Seagate’s reputation as a close-to-the-vest, conservative company is well-founded. Very rarely does the drive behemoth open its doors to the press, much less at one of its four R&D centers (one is in Minnesota and another two are in Asia, although the Longmont, CO facility is the largest). So when senior staff in Longmont agreed to let us peek behind the glass curtain, we jumped.

A “sea gate” is a route or channel that gives access to the ocean, or, alternatively, that offers protection against the sea. In 1978, when the company (then called Shugart Technology) was founded, the exabyte-era metaphor of “the digital universe” had yet to become common. The “sea of information” prevailed. Hard drives, naturally, were set to become the conduit to this growing, seemingly limitless body of information. The nautical theme persists even now in the architecture of the Longmont R&D center, sporting a front edifice much like a ship’s rudder (or prow, depending on your perspective) and an interior bolstered with exposed beams.

Taking the metaphor to its drowning point, one could argue that Seagate’s Longmont center serves a similar function as the ocean faring explorers of centuries past. To tame those infinite storage seas, one must first chart a course. You don’t hear about most of the explorers who found their way to the ocean floor. Neither do you learn about all of the technology paths and drive designs that ended up in recycling bins. You simply enjoy getting more drive capacity for less money than ever before and chalk it up to the natural course of things—only there’s nothing natural about it. Breakthroughs require endless years of mind-bending, exhaustive work, not to mention one or two billion dollars in annual R&D.

After 35 years of design launches and a dizzying chain of storage manufacturer acquisition and consolidation, Seagate sticks to its Product Development Process (PDP) like a holy play book. It works. PDP drive development breaks down into eight stages, as shown here:

The focus of our infographic is on the Design phase, and this is where most of the Longmont center’s efforts lie. However, Longmont’s work actually begins earlier, in the Concept phase. During Concept, Seagate's marketing team works with customers to define product requirements, assess the market opportunity, and develop early proof of concept designs leveraging the company’s existing technology portfolio. After spending up to five years on road mapping and assessing technological and marketing feasibility, technology staging teams pass the baton to a “Core Team” to productize a drive design. The Core Team generates a contract agreement containing all of the key metrics the design must hit, covering specs such as power and performance, as well as a testing methodology that proves all metrics have been met or exceeded. This becomes the Core Team’s beacon to reach.

The Design phase delivers a working proof of concept drive. It’s almost a Bob the Builder question. “Can we build it?” Yes, we can. The resulting drive might be analogous to the Wright Brothers’ original Flyer, which never achieved flight for more than 59 seconds — but it worked.

During Design, the Core Team will either redesign a drive or leverage an existing drive design platform. Note that a drive design is not the same thing as a specific drive. A drive design is a base HDD architecture that can be leveraged to meet various market requirements. For instance, utilizing the drive design concept, a five-platter 4TB HDD design in the 3.5” form factor became the predecessor to a six-platter 6TB Enterprise Capacity 3.5 HDD. Engineers upped the disk count from five to six per drive, made minor improvements to the media design, changed the write head design, updated the nearline firmware code and so on. These were all iterative updates rather than a massive overhaul, but they were necessary to allow Seagate’s 7200 RPM drive line to evolve. Designs typically yield multiple drives and sometimes span multiple drive families, depending on how specific combinations of factors performed throughout development.

The core team’s journey begins in a room much like this one, where conferences and head-down concentration and collaboration can go on for over a year. The Core Team becomes like a second family, enabling tight idea flow and greater efficiency. It might seem strange that this Design phase in Longmont could stretch on so long, but the number of details that must be addressed when making even the smallest of design changes can be staggering. As a case in point, that transition from five disks to six ultimately required a complete redesign of the drive’s printed circuit board. Screw counts changed, chassis mounting hole locations moved, air recirculation filters and disk lubricants both had to be modified, and so on and so on, over painstaking months of analysis and simulation.

At last, after figuring out how a design could be achieved, then feasibility analysis determines whether the design should be achieved. For example, all other factors being equal, no one is going to buy a slower drive for the same price. The feasibility assessment must confirm that a design will meet the company’s profit requirements while still delivering products at an attractive price. If a new drive doesn’t offer some sort of compelling value for the company and customers alike, there’s no point in building it.

MORE: Best SSDs For The Money
MORE: How We Test HDDs And SSDs

MORE: All Storage Content

Create a new thread in the US Reviews comments forum about this subject
This thread is closed for comments
93 comments
    Your comment
    Top Comments
  • Mike-TH
    So if their testing is so good, why are their drives among the worst for reliability - to the point where most IT people I know actually refuse to use them, or if forced to use them will keep (and use) more spares than for other makers.
    11
  • Tom20160027
    The article explains the different types of drive/MTBF and why the backblaze test is useless information. Marketing plot to have folks talking about it and re-posting its link. It seems to work as we keep seeing the link over and over... They are not getting my data. They put drives designed for desktop into servers and run them to the ground and call it a "reliability test". Let's test my kids bicycle with training wheels at the Tour de France and complain about its quality....

    I know IT folks that refuse to use other brands of drives as well. I know IT folks that refuse to use servers from this brand or that brand. We can find anecdotal information about anything. It does not make it true.
    10
  • Other Comments
  • tom10167
    Awesome photos. I don't know what the last picture is but I know I need one of those in my house.
    0
  • Rookie_MIB
    Quote:
    Awesome photos. I don't know what the last picture is but I know I need one of those in my house.


    That is an enterprise storage rack full of 2u hotswap chassis. 18 chassis, 12 drives per chassis = 216 drives @ 6tb (?) per drive = 1,296 terabytes or 1.3 Petabytes.

    You could store a lot of TV shows or movies on that thing. Imagine how many of those are used for YouTube? Yikes. They get 300 hours of footage uploaded every minute.
    2
  • Mike-TH
    So if their testing is so good, why are their drives among the worst for reliability - to the point where most IT people I know actually refuse to use them, or if forced to use them will keep (and use) more spares than for other makers.
    11
  • Tom20160027
    The article explains the different types of drive/MTBF and why the backblaze test is useless information. Marketing plot to have folks talking about it and re-posting its link. It seems to work as we keep seeing the link over and over... They are not getting my data. They put drives designed for desktop into servers and run them to the ground and call it a "reliability test". Let's test my kids bicycle with training wheels at the Tour de France and complain about its quality....

    I know IT folks that refuse to use other brands of drives as well. I know IT folks that refuse to use servers from this brand or that brand. We can find anecdotal information about anything. It does not make it true.
    10
  • Glock24
    Seagate tests their drives? I thought they didn't!

    I've had more Seagate drives die without warning than any other brand. The only ones that have survived are some old 250GB Barracuda ES. All other models I've owned had lots of bad sectors or just stopped working before the first year, but SMART almost always says the drive is fine!
    5
  • zodiacfml
    Yawn. All I think of right now is that HDDs will become the tape drives of the past.
    -8
  • Garrek99
    The only drives I've ever had go bad on me were Seagate drives.
    Every other drive I've ever purchased simply became obsolete due to size and thus replaced.
    They should be reading about how the other drive makers do their testing and learn from that. Hahaha
    3
  • rosen380
    Maybe things changed... but all of my old SGI machines always had Seagate drives in them and the 20+ year old drives all still work. Hell look at what these drives *sell* for on eBay:
    http://www.ebay.com/sch/i.html?_sc=1&_udlo=0&_fln=1&_udhi=200&LH_Complete=1&_ssov=1&_mPrRngCbx=1&LH_Sold=1&_from=R40&_sacat=0&_nkw=%28st31200N%2C+st32171N%2C+st32272N%2C+ST34371N%2C+st34520N%2C+st34573n%2C+st39173N%2C+st318417N%2C+st52160N%29&_sop=16


    4.5 GB drives *selling* for $150+ I see a 2Gb for $120.

    They must have been pretty decent at some point if SGI was putting them in their $5000-20000 workstations and people are spending $40+ per GB to get these now...
    0
  • rosen380
    Link was too long... http://tinyurl.com/gntz4p2
    -2
  • Bossyfins
    How does Segate test their hard drives?


    They don't LOL


    It is nice to see this, but failure rates to damn high.
    -1
  • Colin_10
    This article is reminiscent of a "How it's made episode," with the exception that it actually covers the details that make a manufacturing process interesting. Well written article.
    3
  • Bossyfins
    ^ I agree, but If they test their HDDs, I should be able to see results, aka; less failure rates.
    2
  • kittle
    I never understood everyone griping about HDD failures for a specific brand. I have several seagate drives that are 10yrs old and STILL WORK FINE. I have several WD drives that still work and they are also 10yrs old. they are slow compared to today's standards, but they work.

    Of all the drives i have used, only 3 have failed in 20+ years of using PCs. 1st one was already well abused and it fried a chip on the pcb. 2nd one cooked itself because I had no clue 10k rpm drives needed active cooling and the 3rd one failed because I repeatedly dropped it on the floor.

    take care of your drives and they will return the favor
    3
  • none12345
    "Yawn. All I think of right now is that HDDs will become the tape drives of the past. "

    Perhapps one day....however you do realize that tape drives are still used right?

    Tapes are still cheaper then hard drives, are still cheaper then ssds, on a cost/TB. They are still the cheapest way to archive lots of data.
    4
  • iknowhowtofixit
    "How Seagate Tests Its Hard Drives"

    Lol...
    -4
  • jimmysmitty
    Anonymous said:


    How did I know that this was going to come up.

    Anonymous said:
    So if their testing is so good, why are their drives among the worst for reliability - to the point where most IT people I know actually refuse to use them, or if forced to use them will keep (and use) more spares than for other makers.


    Most IT people buy their servers, SANs etc prebuilt and typically have no choice in what brand of drives are used as that is normally decided by the OEM such as Dell, Nimble etc. We have servers with all brands. In fact so far in the year I have worked here we have had more WDs fail than Seagates or Toshibas but that does not mean the WDs are worse.

    Anonymous said:
    The article explains the different types of drive/MTBF and why the backblaze test is useless information. Marketing plot to have folks talking about it and re-posting its link. It seems to work as we keep seeing the link over and over... They are not getting my data. They put drives designed for desktop into servers and run them to the ground and call it a "reliability test". Let's test my kids bicycle with training wheels at the Tour de France and complain about its quality....

    I know IT folks that refuse to use other brands of drives as well. I know IT folks that refuse to use servers from this brand or that brand. We can find anecdotal information about anything. It does not make it true.


    The problem is getting people to grasp the concept of using a product in an unintended environment AND, to top it off, not even mounting them properly in some cases thus making this data unusable for consumers as no consumer OEM PC or self built PC will have a improperly mounted drive in a torture environment. It is much like using Prime95 these days. Sure it is great if you want absolute maximum temps but if you run the Asus real world stress test you will have a better picture of how your system will perform.

    The best thing about that is that in their Q3 report, Seagates dropped to under 6% failure rates yet WD jumped to 8%.

    I also found another interesting fact, the 3TB Seagate everyone fears is a HDD that is rated for 2400 power on hours per year, it is not meant to be on 24x7. Move to the Enterprise class 3TB and guess what? It is designed and rated for 24x7 use.

    It is just another flawed study of a component that is hard to put real world failure rate numbers out since there are a ton of different reasons for HDD failures due to so many different configurations and environments.

    Other than that the article was very interesting.
    2
  • teahsr
    "How Seagate works with Toms to recover some credibility"

    is what this article should have been entitled. Clear attempt to co-opt Tom's to recover a reputation tarnished by awful reliability. I wonder how much advertising Tom's sold Seagate in conjunction with this article?

    Quote:
    The article explains the different types of drive/MTBF and why the backblaze test is useless information. Marketing plot to have folks talking about it and re-posting its link. It seems to work as we keep seeing the link over and over... They are not getting my data. They put drives designed for desktop into servers and run them to the ground and call it a "reliability test". Let's test my kids bicycle with training wheels at the Tour de France and complain about its quality....


    Backblaze's use of hard drives is neither pure server or consumer. Typically usage patterns are write the data, leave it there, rarely if ever reading it. That is the nature of cloud backup. Not withstanding backblaze's data, Seagate has form in unreliability - anyone remember the 7200.11 1tb drives - they died like flies and Seagate covered it up for ages.

    the reason I'll never buy Seagate again, is how they deal with dodgy drives. I had 7200.11 drives die just outside the warranty period - Seagate refused to replace. Seagate has continued to sell drives with know failure rates of 40% - not ethical and not acceptable.
    2
  • jimmysmitty
    Anonymous said:
    "How Seagate works with Toms to recover some credibility"

    is what this article should have been entitled. Clear attempt to co-opt Tom's to recover a reputation tarnished by awful reliability. I wonder how much advertising Tom's sold Seagate in conjunction with this article?

    Quote:
    The article explains the different types of drive/MTBF and why the backblaze test is useless information. Marketing plot to have folks talking about it and re-posting its link. It seems to work as we keep seeing the link over and over... They are not getting my data. They put drives designed for desktop into servers and run them to the ground and call it a "reliability test". Let's test my kids bicycle with training wheels at the Tour de France and complain about its quality....


    Backblaze's use of hard drives is neither pure server or consumer. Typically usage patterns are write the data, leave it there, rarely if ever reading it. That is the nature of cloud backup. Not withstanding backblaze's data, Seagate has form in unreliability - anyone remember the 7200.11 1tb drives - they died like flies and Seagate covered it up for ages.

    the reason I'll never buy Seagate again, is how they deal with dodgy drives. I had 7200.11 drives die just outside the warranty period - Seagate refused to replace. Seagate has continued to sell drives with know failure rates of 40% - not ethical and not acceptable.


    Known failure rate of 40%? Where do you get these numbers? How are these numbers obtained? What scenarios are these numbers obtained in?

    BTW, did you know consumer HDDs from most brands are not rated for 24x7 operation hence why they should not be used in any server environment? It is a server environment in that it is a server with multiple HDDs running 24x7. They may not be read from all the time but they are spinning all the time and most 7200RPM HDDs will spin at 7200RPM unless they have a power saving feature.

    Running a consumer drive rated for normal use, i.e. that it will be powered on and off, will result in much different failure rates.

    BTW, the majority of companies will not replace a product outside of the warranty unless it is an issue affecting a massive amount of people such as a recall.

    This is not the first time Toms has gone into a company to see how they do things. They did not get paid anything nor did they sell advertising to Seagate.
    0
  • firefoxx04
    Here we go again. Seagate drives died when other brands didn't. I don't care if they were misused. I want drives that can be abused and still work. Not fragile drives.
    1