Omega Owners Forum
Chat Area => General Discussion Area => Topic started by: Andy H on 22 July 2017, 23:21:12
-
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.
I have been running a little home server using an HP microserver with 3 hard disks set up in a RAID array with two disks mirroring each other and one hot spare. I cannot remember how long ago I put it together :-[ it is looking very dusty so probably several years ::)
I normally switch it on in the morning and walk away but this morning I watched it start up and noticed an error at start up on one of the disks.
It appears that the software RAID has done it's job and switched from using the failing disk and is now using the spare.
my problem now is that I want to replace the failing disk with something equivalent and I haven't been paying attention to who owns which HDD manufacturers and which manufacturers to steer clear of.
Model Family: HP 250GB SATA disk VB0250EAVER
I don't even know if I can still buy a 250GB SATA disk. :-\
-
Ebay ?
-
TB will be along shortly to advise :y
-
Ebay ?
l had a quick look. The drives on offer appeared to have done as many hours as my dead/dying one :(
-
Ideally with RAID, you want same disks... ...this is one reason the likes of HPE and Dell dump their own firmwares on, so they can pick different OEMs, but end up with identically function/performance disks, and hotplug support.
BUt given you are (presumably) running off the onboard B1x0i software RAID device, you'll find it actually has little bearing. The B1x0i isn't hotplug compatible IIRC, and the microserver's drive "backplane" (just cables IIRC) certainly won't be.
250Gb drives are quite hard to source now, especially Enterprise ones. Bro replaced an entire G6 server due to cost of drives (and the likelihood more would fail, and it was getting well past its sell by date - and the day before we decomed it, the array battery failed, so good call).
I may have a NHP HP/HPE 250Gb SATA drive somewhere, as some of OOF's servers came with them over the years, and they wouldn't have been used. Post up the HP/HPE Part number (Bxxxxxx-xxx) and I'll check. Yours for postage if I have one.
-
Just checked one of my servers here, B140i definitely does support hotplug (with the right backplane), so you may have hotplug support if your microserver has a Hotplug backplane.
-
That would be fantastic if you are able to fix me up with a replacement. I may be mistaken but I really don't think that my little Proliant microserver supports hot-plug. I don't know whether it would hurt to use a hot-plug disc in a non-hotplug server :-\
I need to boot SWMBO off the internet so I can switch the the thing off and pull the disc out to have a look at the labels (& remind myself of which Microserver it is). SMART reports the model no as VB0250EAVER.
smartctl 5.41 2011-06-09 r3365 [i686-linux-3.2.0-4-686-pae] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net
=== START OF INFORMATION SECTION ===
Model Family: HP 250GB SATA disk VB0250EAVER
Device Model: VB0250EAVER
Serial Number: 6VMY8Q9Z
LU WWN Device Id: 5 000c50 03e060876
Firmware Version: HPG0
User Capacity: 250,059,350,016 bytes [250 GB]
Sector Size: 512 bytes logical/physical
Device is: In smartctl database [for details use: -P show]
ATA Version is: 8
ATA Standard is: ATA-8-ACS revision 6
Local Time is: Sun Jul 23 14:12:14 2017 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.
General SMART Values:
Offline data collection status: (0x82) Offline data collection activity
was completed without error.
Auto Offline Data Collection: Enabled.
Self-test execution status: ( 0) The previous self-test routine completed
without error or no self-test has ever
been run.
Total time to complete Offline
data collection: ( 625) seconds.
Offline data collection
capabilities: (0x5b) SMART execute Offline immediate.
Auto Offline data collection on/off support.
Suspend Offline collection upon new
command.
Offline surface scan supported.
Self-test supported.
No Conveyance Self-test supported.
Selective Self-test supported.
SMART capabilities: (0x0003) Saves SMART data before entering
power-saving mode.
Supports SMART auto save timer.
Error logging capability: (0x01) Error logging supported.
General Purpose Logging supported.
Short self-test routine
recommended polling time: ( 2) minutes.
Extended self-test routine
recommended polling time: ( 46) minutes.
SCT capabilities: (0x1039) SCT Status supported.
SCT Error Recovery Control supported.
SCT Feature Control supported.
SCT Data Table supported.
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x002f 117 099 006 Pre-fail Always - 161167139
3 Spin_Up_Time 0x0023 097 097 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 095 095 020 Old_age Always - 5944
5 Reallocated_Sector_Ct 0x0033 002 002 036 Pre-fail Always FAILING_NOW 4053
7 Seek_Error_Rate 0x002f 054 054 030 Pre-fail Always - 253422034555
9 Power_On_Hours 0x0032 072 072 000 Old_age Always - 25163
10 Spin_Retry_Count 0x0033 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 098 098 020 Old_age Always - 2972
180 Unused_Rsvd_Blk_Cnt_Tot 0x002b 100 100 000 Pre-fail Always - 29188
183 Runtime_Bad_Block 0x0032 098 098 000 Old_age Always - 2
184 End-to-End_Error 0x0032 100 100 097 Old_age Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
188 Command_Timeout 0x0032 100 096 000 Old_age Always - 8590066793
189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
190 Airflow_Temperature_Cel 0x0022 069 063 045 Old_age Always - 31 (Min/Max 17/31)
194 Temperature_Celsius 0x0022 031 040 000 Old_age Always - 31 (0 8 0 0)
195 Hardware_ECC_Recovered 0x003a 021 021 000 Old_age Always - 161167139
196 Reallocated_Event_Count 0x0032 002 002 036 Old_age Always FAILING_NOW 4053
197 Current_Pending_Sector 0x0032 099 099 000 Old_age Always - 42
198 Offline_Uncorrectable 0x0030 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x0032 200 200 000 Old_age Always - 0
SMART Error Log Version: 1
No Errors Logged
SMART Self-test log structure revision number 1
No self-tests have been logged. [To run self-tests, use: smartctl -t]
SMART Selective self-test log data structure revision number 1
SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
1 0 0 Not_testing
2 0 0 Not_testing
3 0 0 Not_testing
4 0 0 Not_testing
5 0 0 Not_testing
Selective self-test flags (0x0):
After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
-
Yup, that's failing ;D
Grab the 2 numbers (Part No and "Replace with") off the sticker next time its powered down, and I'll see what I can find.
-
Finally managed to get the two year old off to sleep :) - going to try and read the numbers from photographs I took last night (if the one year old will allow........ ::) )
-
1st small label
Replace with Spare
250G SATA NHP
[571517-001]
2nd small labelSecurity ID
458DHL1
...barcode...
CT: 2AVERD2331B0FM PN: 571227-002
REPLACE WITH SPARE PART#: 571517-001
-
Big label
Seaget Barracuda 7200.12
250 GB 7200 RPM SATA
S/N 6VMY8Q9Z
ST3250318AS
P/N: 9SL131-780
Firmware: HPG0
Date Code: 12026 Site Code: SU
HP MODEL: VB0250EAVER
HPN: 571227-002
CT:2AVER01330X367
HP: GPN: 397377-028
-
OK, should be enough details there, let me have a look round :y
-
Don't have an exact match in my selection of HP drives, the two nearest are a Seagate OEM one (based on underlying model) and a WD OEM one (GPN match).
Image of both with labels at:
http://theboy.omegaowners.com/oofpics/odds/IMG_0475.JPG
Let me know which you prefer, and I'll just do a quick check on it. If it spins, I have no doubt it will be fine, as these normally come with new ProLiant servers from that era, so would have been immediately removed, as they have always been far too small for my needs.
-
LOL, looking properly, both match on the GPN ;D
-
I am interested in either (or both) of them - and happy to pay actual money (not just postage) :)
-
Couple of things....
1. Why would you run a mirror with a hot swappable spare? 2 copies of the data and a spare disk in case one of those fails doesn't seem like that good of an idea.
2. Rather than replacing the duff disk with another old secondhand obsolete disk, why wouldn't you buy a bigger disk? Or even better a pair of disks as I just don't see the point in having a spare for a mirror.
I don't know what you use the server for, but unless it's something special - not just file sharing. Then why wouldn't you use a NAS or an external USB drive?
-
Couple of things....
1. Why would you run a mirror with a hot swappable spare? 2 copies of the data and a spare disk in case one of those fails doesn't seem like that good of an idea.
2. Rather than replacing the duff disk with another old secondhand obsolete disk, why wouldn't you buy a bigger disk? Or even better a pair of disks as I just don't see the point in having a spare for a mirror.
I don't know what you use the server for, but unless it's something special - not just file sharing. Then why wouldn't you use a NAS or an external USB drive?
1) Why not? Sounds like a good idea to me.
2) For any raid set, the disks ideally need to be the same. And that not just in capacity. Some controllers even like to spindle lock all disks in the set.
Why would you EVER piss around with stupid NAS devices. Even if it was just file sharing, This setup would be cheaper, more flexible, miles faster and more robust than a similar 4 bay toy NAS ;)