Best offers
|
My Passport Essential 500GB Portable... | $129.99 STAPLES More info |
|
My Book Essential Edition External... | $109.99 Newegg.com More info |
|
Simplesave External 1.5TB 3.5" Hard... | $143.99 HP Direct More info |
|
Caviar Black 1TB Hard Drive (Serial... | $91.99 Dell Small Business More info |
|
My Passport Essential Portable 320GB... | $134.00 ServerSupply.com More info |
Picking A Hard Drive For Your NAS: New Green Beats Old Speed
When it comes to mechanical hard drive performance, 7,200 RPM drives are considered the fastest. But does that convention apply in a NAS environment as well? We compare network storage performance with a handful of Samsung's "green" drives to see. Read More
-
Thecus N8800 NAS/SAN: Quick-Look
After recently running out of room on my Thecus N5200, I again turned to the company for something larger. What I found won't work for most folks (unless you have a rack in your garage, that is). But it's reasonably-priced and plenty fast for most SMBs. Read More
-
Promise's NS4600: Intel's Tolapai Enables Better Network Performance
Promise has revamped its NS4300N network storage device. Among the changes, its new NS4600 comes with a new engine--namely, Intel’s new x86-based EP80579 (code named Tolapai). Read on for a feature comparison and a thorough benchmark evaluation. Read More
- raid 5 failure what to do
- increase raid 5 size
- raid 5 one drive fails
- how to recover raid 1
- hard drive for storing data 2009
- recover raid 5 array
- raid 5 increase size
- raid 5 recovery time
- raid 5 disk loss 4
- setting up raid 2009
- what hard drive failed raid 5
- raid 5 2009
- recover raid 5
- recover raid
- raid 5 two drive failure
Partners
The Games selection
crazy :
PC Breakdown
What is worst than a Fatal Error occuring during a game you did not save? Unleash your rage at your PC in this game. Blow it to pieces, it feels so...
|
adventure :
Scoobydoo: Episode 2
The sequel of Scooby and Sammy's adventures. Same principle as in the previous episode (available on this website). Click on "Instructions" to see...
|
Sponsored links
RAID 5 May Be Doomed in 2009
Next news- Email |
- Print |
- Comments (34) |
- Share
A story appearing online is forecasting the doom of RAID 5 in 2009. Apparently with storage capacities of modern SATA hard drives now reaching 2-terabytes in size, the odds of a read error during a RAID 5 disk reconstruction is becoming unavoidable.
According to Zdnet, SATA drives often have unrecoverable read rates (URE) of 10^14, which implies that disk drives will not be able to read a sector once every 100,000,000,000,000 bits read. With hard drive capacities expected to reach two-terrabytes in 2009, the odds of a read error become practically unavoidable when recovering from a 7-drive RAID 5 disk failure. Upon encountering such a read error during a reconstruction process, it is claimed that the array volume will be declared unreadable and the recovery processes will be halted. Apparently all 12-terrabytes of data stored on the drives will be lost... or at least will require some extra effort and knowledge to recover.
RAID 5 is described as a striped set with distributed parity, which protects against a single disk failure. When a drive fails in a RAID 5 set, the failed drive can be replaced, the data can be rebuilt from the distributed parity and the array can eventually be restored. If more than one drive fails however, the array will have data loss. For some, this can make the reconstruction process after a single drive failure a stressful event, as the array during that time will be vulnerable to more drive failures.
While using RAID 6 instead may seem like a solution, where RAID 6 is two drive failures are allowable instead of just one, the increased redundancy may not be cost effective. Also, as hard drive capacities continue to increase exponentially, year after year, even RAID 6 may soon become prone to the same problems. When single disk drives become 12-terrabytes in size, even a direct drive-to-drive copy may commonly encounter these read errors. The use of disk drives that have smaller capacities and improved unrecoverable read rates could be a solution to avoid these potential headaches.
The problem comes from the increasingly tight data density packed onto drive platters. Using traditional means, bit magnetic poles can often leak their polarity onto other adjacent bits, causing a switch in an otherwise normal bit. Manufacturers have switched to perpendicular recording methods to avoid such problems and increase density, but even this method has its physical limits. Manufacturers will have to find more creative solutions down the road if drives are going to exceed 2TB in size.
Source : Tom's Hardware







"When single disk drives become 12-terrabytes in size"
Surely we are nowhere near that right?
News about an article posted 15 months ago? You must have quite a backlog!
At the rate drive capacity is growing... it'll be sooner than you think.
At the rate drive capacity is growing... it'll be sooner than you think.
i agree. i remember when i got my dell p4 system in early 2004. it came with a 120GB hard drive which was fairly impressive for its time, and now we're already up to 1.5TB, more than ten times the capacity in only 4 years. getting up to 12TB may happen within the next couple of years.
RAID 5 seeing this issue, I can see. RAID 6 maybe but by the time we have 12TB drives, they are not longer going to be spindles, SSDs are far from high capacity now but given time that will happen.
I still don't understand how RAID with parity can be less reliable than 1 drive with no parity. I never in my life had more than 1 drive fail at the same time (with big & small RAID5 arrays).
i.e. If one 12TB drive fails in RAID5, it's POSSIBLE (or likely) a reconstruction can fail. If one 12TB drive fails with no RAID, all data is 100% gone for sure.
Where can I find more info on this?
Thanks!
@enewmen
The point of this article was that the increase in drive size will increase the chance of read errors, thus increasing the chance of an error happening while you are rebuilding your array. According to this article, if there is a read error during the reconstruction then the whole array will be lost. If you'll remember from earlier, the read error will be more likely due to the size hard drives will be in the near future. So, in conclusion, your experience with hard drives in the present has no relevance to the issues of larger hard drives of the future, which is the subject of this article.
I still don't understand how RAID with parity can be less reliable than 1 drive with no parity. I never in my life had more than 1 drive fail at the same time (with big & small RAID5 arrays).i.e. If one 12TB drive fails in RAID5, it's POSSIBLE (or likely) a reconstruction can fail. If one 12TB drive fails with no RAID, all data is 100% gone for sure.Where can I find more info on this?Thanks!
I couldn't give you info that is 100% technically sound on this topic (or a terribly elegant one), at least compared to some hardware junkies...but I'll take a stab at it.
Consider, that with one drive you are dealing with the probability of one drive failing. That is, the probability that one drive is defective, or has an off 'moment' inside, or is just too old, or whatever. This is the chance that the one drive will 'become another statistic', as I'll call it.
The simple explanation here provides that the more drives you have, the better chance you have for a single drive failure. So instead of having 1 chance for one drive to fail...you have 5 chances for one drive to fail. Then, the more drives you add, even better the probability that one of them will 'become another statistic.'
The problem enters when you consider that (after a drive failure) rebuilding an array of a certain size takes a quantifiable amount of time and number of disk operations. I have not personally done the math, but the idea is that each drive may (for argument's sake) have a 1 in 10,000 operations error rate, and rebuilding an array of the specified size or disk count may require 10,001 disk operations from EACH disk; and this is assuming that each drive operates within its specified tolerances. Another drive really could fail altogether.
The basic point is that the numbers can catch up to you
ALL is possible on Linux!!!!!
!go Linux!
thanks for explaining..
It still seems like RAID6 is the better choice over no RAID. (the title of the article can be more clear)
Lets hope the rebuilding time also decreases with capacity increases.
What I hate about such types of articles is that the guy basically says: "RAID 5 sucks, RAID 6 sucks, you're screwed. HA! Told ya!"
It's a bit like announcing the end of the world because X is going to happen, yet not giving any practical solution for how X can be solved. The negative tone of the whole article is just like saying to us that we're doomed to lose our whole data every once in a while and that, oh, it's life... I'm sure there are already researches being conducted to circumvent such an issue, I'm sure technologies will show up in due time, and I'm pretty darn sure the drives will get larger and larger without stopping. There have always been seemingly unsurmountable issues, yet here we are today, through all those impossible obstacles and still living to talk about it!
Keyword: unrecoverable.
Error correction routines must become more robust otherwise all data will be corrupted, by their logic.
CRC errors happen constantly with I/O, you never notice them, or maybe you do..that weird hiccup or glitch. RAM has ECC as well, so does your CPU.
Our favorite hobby is awash in bit flipping and transients, if it wasn't for error correction algorithms, absolutely nothing electrical would work...hell all MP3's are just one big approximation of what you might be lisenting to.
you get a boo for ugly hair.
I dont see why this is all so melodramatic. Just increase the parity information with each strip. Big deal, as if we cant afford it.
Makes me wonder why in hell you want a single 12TB drive volume? Ever heard of the expression, "Don't put all your eggs in one basket?"
Smaller drives are better.
Ok guys they are talking about Corporations with NAS or SANs devices that hold 12 drives and 6TB of data. This is in RAID 5. Just 3 weeks ago we had 2 drives fail. 1 was part of the parity and the other was a spare. Heck one of our servers holds 6 drives 2 used in raid 1 and 4 in raid 5.
The more drives you have the higher the likely hood that you have multiple failures at the same time.
RAID 6 is expensive and requires hardware that is expensive. If you've ever bought a Server you will understand. One of our PowerEdge servers cost over 4 grand and that just had sata drives. Throw in 15k SAS drives and you're looking at 7-8grand. Now you need windows server...
This is old news anyway. Companies have and use redundant systems now. Companies don't rely on RAID all that much. A lot of companies also invest in imaging software like Acronis Echo or Symantec Ghost as well. As soon as your company experiences multiple drive failures in a RAID 5 they will switch to redundant systems faster than you can ship the server. You'll be buying the server that same day.
@Darkk
Exactly. Keep each RAID-5 volume under the size where read errors are statistically likely, and it's a non-issue. Assuming the scenario being suggested does occur, then it will simply become an industry standard that all RAID-5 arrays are built under the safe size limit. Either the drive designers will figure out how to resolve the problem, or another technology will replace hard drives, but no corporation or responsible individual is going to risk data corruption in exchange for capacity. Especially when media distribution is becoming increasingly an online enterprise...does anyone think that you'll be able to re-download your multi-terabyte movie collection for free (legitimately, I mean) when your array craps out?
Also, let's not forget that RAID-5 is not a backup solution. It's a high-availability solution.
What they're talking about is a 12TB RAID Array. Where all the drives in your system, SANs, or NAS box equals 12TB. Then they slice it up for different purposes.
It's a little bit more complicated as it requires planning of how much space this particular section will need. If you use roaming profiles will you need 1TB or 2?
Makes me wonder why in hell you want a single 12TB drive volume? Ever heard of the expression, "Don't put all your eggs in one basket?"Smaller drives are better.
I had case of déjà vu when reading this article. I think I saw it before on the same PC and time in the day.
RAID 5 is a nonsense solution: it is slower for writes than RAID 0 and slower for recoveries than RAID. you can lose an entery business day to recover a RAID 5, imagine the time consumed when 2TB+ disk are on the market, and as stated in the article, RAID 5 is not a failproof solution.
The whole idea behind RAId is that it it an inexpensive/independent array of disks that is safer or faster than JBOD, so with 3 drives in a RAID 5 i have safety and speed, but the safety implies hours of recovery time, during which not a single bit on the entire array can go wrong, and the speed is just for reads, writes on RAID 5 are longer. with just one more disk i can make a RAID 10 solution, for the best of both worlds, or i can go cheap and use the very same 3 disks for a RAID 0 + external dayly backup-an elegant way to have lighting fast performance and fast system recovery.
On business solution the costs of delaying to much to recover the array far exceed the price gap between RAID 5 and RAID 10/RAID 6, and with the increased likehood that the array rebuild will be unsucessful this cost gap will be reversed.
Lets all say no to RAID 5!!
RAID5 saved my butt before. Even with the long recovery time.
I then quickly got the external drive to be double sure the data is safe.
This works well for me for home-use.
2 cents worth..
Seriously how many servers out there are running 2 TB drives (or even 1TB) in Raid 5??
As for no on Raid 5??? Pull your head out of your @ss and get with the industry. Yes Raid 0 is faster but try explaining to a user why they can't get to their files until you do a system restore from backups because you wanted to have faster read\write times... idiot....
jmtc
If they can't find a solution some time soon then it will be just another excuse to go to SSD. Not that SSD's wont fail but they obviously are boasting a very long shelf life for them.
Server wise sure 12 TB isn't that much. Consumer wise it's a butt load and will take a few years. I'm not saying it won't happen any time soon it's just most people don't need that much space yet. Once HDTV picks up and 25GB Blu ray rips becomes the norm then I can see 12TB filling up a lot faster.
Anyone who has 12TB of movies or music on their PC right now might just have the RIAA or Obama/Biden internet police knocking at your door making sure your legit in the near future.
Sure, you could use high capacity 3.5" platter SATA drives for your enterprise RAID 5, but that may not be the most intelligent decision you ever made. SCSI / SAS and the WD Raptor, have been using more reliable (lower capacity) 2.5" platters for over 5 years (I love the reviews talking about 2.5" drives being new, pop open an old 3.5" SCSI). That is why SCSI/SAS drives tend to be smaller in size. This greatly reduces platter vibration increasing reliability greatly.
..and a 7 disk RAID 5?! This article should be entitled "stupid people get upset when they do stupid things". There is nothing worse than being frustrated with a bad decision that you made for yourself, but there is one catch: YOU MADE THE DECISION!!!! Don't get angry with WD or Seagate when you 12 TB SATA RAID 5 double-faults. If you read the specs and ran the numbers, these companies practically say in bold print: DON'T DO THIS!
Whew! Felt good to get that off my chest. I get tired of storage articles that just don't get it. The only reasonable source I ever found was storagereview, but they don't update like they used to. If you love your data, back it up.
Rarely such bullshit can be read...
The error rate referred in the article is that of a undetectable one (by internal drive ECC algorithms), not a uncorrectable one, which will just slip by undetected. The practical result is corrupted read data (1 sector corrupted in 10^14 sector reads). The figure for enterprise level drives is much lower (1 in 10^15 - 10^16).
This is not a catastrophic failure, as it is presented. A reread will reproduce with utmost probability the correct data.
In normal read RAID operation, the parity bit is usually not verified. Higher class controllers will do a array consistency check at predetermined intervals. Such a undetected HDD read error will generate a consistency failure, and the corresponding warning in the logs - it will not mark the array as failed - and the controller will try to correct it, by writing a correct (in its vision) parity bit. The result is corrupted data, corresponding to that particular bit. In a reconstruction operation, such a error will slip undetected, the result being also corrupted data.
If the corrupted bit is used in some critical file system area, the damage could be (much) higher than if it was just part of some file.
Statistically the probability of causing irreversible damage is very low - the alarming conclusion of doom is far fetched.
disk kapacity doesn't grow exponentially !
I have never read a more flawed article in my life. What a load of Bullsh*t.
The 10^14 figure is the chance of an unrecoverable read error by the drive. If you send the same read command for the same sector to the drive again, in 10^14 - 1 cases, the error won't happen the next time. During the purported RAID 5 rebuild, the controller will just have to read one sector from one drive again, and the rebuild completes perfectly, just like normal.
The author of this article obviously has no clue whatsoever how RAID arrays and controllers work, and is unqualified to write an article in this area. If he worked for me, this article would have never gone to print, and would have been rejected instantaneously.
seriously one day they say "well have 10 kajillion bit hard drives" the next they say "ohh no, the world will stop at 10TB" RAID 5 is freaking awsome! 12 TB of space is normal. Maybe those of you who work from a garage don't see it that often, but to enterprise users 12TB is normal. If drives get so wobbly that RAID 5 is not practical, then you're also going to be pissing off non raid users 10 fold.
Also, you can rebuild drives in the background, I've done it several times. Consistancy checks scheduled in the background too. If the mentioned problem becomes serious, then manufacturers can release firmware to just mark the sector bad or inconsistant instead of just saying "rebuild failed" I'm not trying to be coarse, but some of you really need to read about RAID 5 before you comment. Especially the author/editor.
Won't SSDs will solve this issue?
Im still wondering when magnetic disk hard drives will hit the wall just like CPU's did. how much smaller can they make sectors on those magnetic disks before they become too delicate for errors?
The article is complete nonsense based a fundamental mis-understanding of the what the unrecoverable read error rate represents.
SomeJoe7777 is on the right track, but failed to explain it completely.
First consider the soft error rate, the rate at which a bit may be read incorrectly one time, but read correctly most of the time. That number isn't really important other than as a context for what uncorrected read error rate is. The URE rate is the rate at which raw bits passing under the read can't be consistently read AND match what was written. Note that we're talking about the raw bit stream from the media. All HDs for at least 15 years have had advanced ECC encoding on each sector which is capable of recovering from at least one bit (typically several bits) error in the bit stream for that sector. This means that the ECC which is built in to the drive will correct that 10 ^ -14 URE.
The URE applies to the raw bit stream, not to the ECC corrected data coming out of the drive, and the ECC can recover from at least 1 bit error in the bit stream for each (and every) sector on the disk.
Because of that, the size of the drive is essentially irrelevant, it is the sector size and the amount of ECC per sector that is relevant. Because of the sector level ECC, the size of the drive has only a trivial effect on the likelihood of a sector becoming unreadable.
RAID (other than the improperly named RAID 0) offers an additional level of recoverability allowing for a failure of any part of a drive (e.g. one sector, one head, etc.) or a whole drive (e.g. motor, actuator, controller, etc.) to fail and still be able to recover the data.
Likewise, the size of the RAID volume is only a trivial factor in the chances that a RAID volume will be unrecoverable. The number of drives in an RAID 5 or RAID 6 array is many orders of magnitude more significant than the size of the drive or of the volume/array. The more drives, the greater the chance of a failure of one or more drives. The same applies to the seldom/never used RAID 2, 3, or 4 and to a few implementations of RAID 0 + 1 (where the data is striped before it's mirrored)
Because RAID 1 (and RAID 10, aka 1 + 0) mirrors pairs of drives, the number of drives and the size of the array are almost irrelevant. In order to be uncoverable, a RAID 10 array must have a failure the identical portion of the two drives that are mirrored within a short enough timeframe that one failed drive is not repaired/rebuilt before the second failure. That probability does not change significantly by adding more drive pairs to the array.
RAID5 is in no way doomed. If HDDs ever become so unreliable that they are unfit for RAID5, then they will most certainly be unfit for use as standalone drives (i.e. the most common use). HDDs simply must store enough parity information (per sector) that data loss is a VERY rare event. As we pack information more and more densely onto the platters it may be necessary to add more parity to compensate.
IOW ... move along, there's nothing to see here.