Omega Owners Forum

Please login or register.

Login with username, password and session length
Advanced search  

News:

Please check the Forum Guidelines at the top of the Newbie section

Pages: [1] 2  All   Go Down

Author Topic: Ooops - server hard drive failing  (Read 2039 times)

0 Members and 1 Guest are viewing this topic.

Andy H

  • Omega Lord
  • *****
  • Offline Offline
  • Gender: Male
  • Auckland
  • Posts: 5499
    • Mazda MPV
    • View Profile
Ooops - server hard drive failing
« on: 22 July 2017, 23:21:12 »

Quote
=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

I have been running a little home server using an HP microserver with 3 hard disks set up in a RAID array with two disks mirroring each other and one hot spare. I cannot remember how long ago I put it together  :-[ it is looking very dusty so probably several years  ::)

I normally switch it on in the morning and walk away but this morning I watched it start up and noticed an error at start up on one of the disks.

It appears that the software RAID has done it's job and switched from using the failing disk and is now using the spare.

my problem now is that I want to replace the failing disk with something equivalent and I haven't been paying attention to who owns which HDD manufacturers and which manufacturers to steer clear of.

Quote
Model Family:     HP 250GB SATA disk VB0250EAVER

I don't even know if I can still buy a 250GB SATA disk.  :-\
Logged
"Deja Moo - The feeling that you've heard this bull somewhere before."

Varche

  • Omega Queen
  • *****
  • Offline Offline
  • Gender: Male
  • middle of Andalucia
  • Posts: 13635
  • What is going to break next?
    • Golf Estate
    • View Profile
Re: Ooops - server hard drive failing
« Reply #1 on: 23 July 2017, 00:02:07 »

Ebay ?
Logged
The biggest joke on mankind is that computers have started asking humans to prove that they aren’t a robot.

Mr Gav

  • Omega Knight
  • *****
  • Offline Offline
  • Leeds
  • Posts: 1924
    • Nissan 370z GT Edition
    • View Profile
Re: Ooops - server hard drive failing
« Reply #2 on: 23 July 2017, 00:10:14 »

TB will be along shortly to advise  :y
Logged

Andy H

  • Omega Lord
  • *****
  • Offline Offline
  • Gender: Male
  • Auckland
  • Posts: 5499
    • Mazda MPV
    • View Profile
Re: Ooops - server hard drive failing
« Reply #3 on: 23 July 2017, 07:37:35 »

Ebay ?
l had a quick look. The drives on offer appeared to have done as many hours as my dead/dying one  :(
Logged
"Deja Moo - The feeling that you've heard this bull somewhere before."

TheBoy

  • Administrator
  • *****
  • Offline Offline
  • Gender: Male
  • Brackley, Northants
  • Posts: 105924
  • I Like Lockdown
    • Whatever Starts
    • View Profile
Re: Ooops - server hard drive failing
« Reply #4 on: 23 July 2017, 11:16:54 »

Ideally with RAID, you want same disks...  ...this is one reason the likes of HPE and Dell dump their own firmwares on, so they can pick different OEMs, but end up with identically function/performance disks, and hotplug support.

BUt given you are (presumably) running off the onboard B1x0i software RAID device, you'll find it actually has little bearing. The B1x0i isn't hotplug compatible IIRC, and the microserver's drive "backplane" (just cables IIRC) certainly won't be.

250Gb drives are quite hard to source now, especially Enterprise ones.  Bro replaced an entire G6 server due to cost of drives (and the likelihood more would fail, and it was getting well past its sell by date - and the day before we decomed it, the array battery failed, so good call).

I may have a NHP HP/HPE 250Gb SATA drive somewhere, as some of OOF's servers came with them over the years, and they wouldn't have been used. Post up the HP/HPE Part number (Bxxxxxx-xxx) and I'll check. Yours for postage if I have one.
Logged
Grumpy old man

TheBoy

  • Administrator
  • *****
  • Offline Offline
  • Gender: Male
  • Brackley, Northants
  • Posts: 105924
  • I Like Lockdown
    • Whatever Starts
    • View Profile
Re: Ooops - server hard drive failing
« Reply #5 on: 23 July 2017, 11:19:31 »

Just checked one of my servers here, B140i definitely does support hotplug (with the right backplane), so you may have hotplug support if your microserver has a Hotplug backplane.
Logged
Grumpy old man

Andy H

  • Omega Lord
  • *****
  • Offline Offline
  • Gender: Male
  • Auckland
  • Posts: 5499
    • Mazda MPV
    • View Profile
Re: Ooops - server hard drive failing
« Reply #6 on: 23 July 2017, 14:33:30 »

That would be fantastic if you are able to fix me up with a replacement. I may be mistaken but I really don't think that my little Proliant microserver supports hot-plug. I don't know whether it would hurt to use a hot-plug disc in a non-hotplug server :-\

I need to boot SWMBO off the internet so I can switch the the thing off and pull the disc out to have a look at the labels (& remind myself of which Microserver it is). SMART reports the model no as VB0250EAVER.

Quote
smartctl 5.41 2011-06-09 r3365 [i686-linux-3.2.0-4-686-pae] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     HP 250GB SATA disk VB0250EAVER
Device Model:     VB0250EAVER
Serial Number:    6VMY8Q9Z
LU WWN Device Id: 5 000c50 03e060876
Firmware Version: HPG0
User Capacity:    250,059,350,016 bytes [250 GB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 6
Local Time is:    Sun Jul 23 14:12:14 2017 BST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: FAILED!
Drive failure expected in less than 24 hours. SAVE ALL DATA.
See vendor-specific Attribute list for failed Attributes.

General SMART Values:
Offline data collection status:  (0x82)   Offline data collection activity
               was completed without error.
               Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)   The previous self-test routine completed
               without error or no self-test has ever
               been run.
Total time to complete Offline
data collection:       (  625) seconds.
Offline data collection
capabilities:           (0x5b) SMART execute Offline immediate.
               Auto Offline data collection on/off support.
               Suspend Offline collection upon new
               command.
               Offline surface scan supported.
               Self-test supported.
               No Conveyance Self-test supported.
               Selective Self-test supported.
SMART capabilities:            (0x0003)   Saves SMART data before entering
               power-saving mode.
               Supports SMART auto save timer.
Error logging capability:        (0x01)   Error logging supported.
               General Purpose Logging supported.
Short self-test routine
recommended polling time:     (   2) minutes.
Extended self-test routine
recommended polling time:     (  46) minutes.
SCT capabilities:           (0x1039)   SCT Status supported.
               SCT Error Recovery Control supported.
               SCT Feature Control supported.
               SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   117   099   006    Pre-fail  Always       -       161167139
  3 Spin_Up_Time            0x0023   097   097   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   095   095   020    Old_age   Always       -       5944
  5 Reallocated_Sector_Ct   0x0033   002   002   036    Pre-fail  Always   FAILING_NOW 4053
  7 Seek_Error_Rate         0x002f   054   054   030    Pre-fail  Always       -       253422034555
  9 Power_On_Hours          0x0032   072   072   000    Old_age   Always       -       25163
 10 Spin_Retry_Count        0x0033   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   098   098   020    Old_age   Always       -       2972
180 Unused_Rsvd_Blk_Cnt_Tot 0x002b   100   100   000    Pre-fail  Always       -       29188
183 Runtime_Bad_Block       0x0032   098   098   000    Old_age   Always       -       2
184 End-to-End_Error        0x0032   100   100   097    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   096   000    Old_age   Always       -       8590066793
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   069   063   045    Old_age   Always       -       31 (Min/Max 17/31)
194 Temperature_Celsius     0x0022   031   040   000    Old_age   Always       -       31 (0 8 0 0)
195 Hardware_ECC_Recovered  0x003a   021   021   000    Old_age   Always       -       161167139
196 Reallocated_Event_Count 0x0032   002   002   036    Old_age   Always   FAILING_NOW 4053
197 Current_Pending_Sector  0x0032   099   099   000    Old_age   Always       -       42
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   200   200   000    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
Logged
"Deja Moo - The feeling that you've heard this bull somewhere before."

TheBoy

  • Administrator
  • *****
  • Offline Offline
  • Gender: Male
  • Brackley, Northants
  • Posts: 105924
  • I Like Lockdown
    • Whatever Starts
    • View Profile
Re: Ooops - server hard drive failing
« Reply #7 on: 23 July 2017, 20:18:33 »

Yup, that's failing ;D

Grab the 2 numbers (Part No and "Replace with") off the sticker next time its powered down, and I'll see what I can find.
Logged
Grumpy old man

Andy H

  • Omega Lord
  • *****
  • Offline Offline
  • Gender: Male
  • Auckland
  • Posts: 5499
    • Mazda MPV
    • View Profile
Re: Ooops - server hard drive failing
« Reply #8 on: 24 July 2017, 19:17:05 »

Finally managed to get the two year old off to sleep  :) - going to try and read the numbers from photographs I took last night (if the one year old will allow........ ::) )
Logged
"Deja Moo - The feeling that you've heard this bull somewhere before."

Andy H

  • Omega Lord
  • *****
  • Offline Offline
  • Gender: Male
  • Auckland
  • Posts: 5499
    • Mazda MPV
    • View Profile
Re: Ooops - server hard drive failing
« Reply #9 on: 24 July 2017, 19:26:12 »

1st small label
Quote
Replace with Spare
250G SATA NHP
[571517-001]

2nd small label
Quote
Security ID
458DHL1
...barcode...
CT: 2AVERD2331B0FM     PN: 571227-002
REPLACE WITH SPARE PART#:   571517-001
Logged
"Deja Moo - The feeling that you've heard this bull somewhere before."

Andy H

  • Omega Lord
  • *****
  • Offline Offline
  • Gender: Male
  • Auckland
  • Posts: 5499
    • Mazda MPV
    • View Profile
Re: Ooops - server hard drive failing
« Reply #10 on: 24 July 2017, 19:35:09 »

Big label
Quote
Seaget Barracuda 7200.12
250 GB 7200 RPM SATA
S/N 6VMY8Q9Z
ST3250318AS
P/N: 9SL131-780
Firmware: HPG0
Date Code: 12026  Site Code: SU

HP MODEL: VB0250EAVER
HPN: 571227-002
CT:2AVER01330X367
HP: GPN: 397377-028
Logged
"Deja Moo - The feeling that you've heard this bull somewhere before."

TheBoy

  • Administrator
  • *****
  • Offline Offline
  • Gender: Male
  • Brackley, Northants
  • Posts: 105924
  • I Like Lockdown
    • Whatever Starts
    • View Profile
Re: Ooops - server hard drive failing
« Reply #11 on: 25 July 2017, 19:00:46 »

OK, should be enough details there, let me have a look round :y
Logged
Grumpy old man

TheBoy

  • Administrator
  • *****
  • Offline Offline
  • Gender: Male
  • Brackley, Northants
  • Posts: 105924
  • I Like Lockdown
    • Whatever Starts
    • View Profile
Re: Ooops - server hard drive failing
« Reply #12 on: 25 July 2017, 19:49:46 »

Don't have an exact match in my selection of HP drives, the two nearest are a Seagate OEM one (based on underlying model) and a WD OEM one (GPN match).

Image of both with labels at:
http://theboy.omegaowners.com/oofpics/odds/IMG_0475.JPG


Let me know which you prefer, and I'll just do a quick check on it.  If it spins, I have no doubt it will be fine, as these normally come with new ProLiant servers from that era, so would have been immediately removed, as they have always been far too small for my needs.
Logged
Grumpy old man

TheBoy

  • Administrator
  • *****
  • Offline Offline
  • Gender: Male
  • Brackley, Northants
  • Posts: 105924
  • I Like Lockdown
    • Whatever Starts
    • View Profile
Re: Ooops - server hard drive failing
« Reply #13 on: 25 July 2017, 19:51:38 »

LOL, looking properly, both match on the GPN ;D
Logged
Grumpy old man

Andy H

  • Omega Lord
  • *****
  • Offline Offline
  • Gender: Male
  • Auckland
  • Posts: 5499
    • Mazda MPV
    • View Profile
Re: Ooops - server hard drive failing
« Reply #14 on: 25 July 2017, 23:00:54 »

I am interested in either (or both) of them - and happy to pay actual money (not just postage) :)

Logged
"Deja Moo - The feeling that you've heard this bull somewhere before."
Pages: [1] 2  All   Go Up
 

Page created in 0.022 seconds with 21 queries.