Topic: OOF Outage (Read 4353 times)

TheBoy · « **on:** 08 October 2017, 12:35:38 »

Unfortunately we suffered a bizarre incident last night that caused everything here to fail... ...including the alerting system, which meant it went undetected until early this morning.

Clearly we are back up and running, but we will have to go down again for an hour or so later to resync the databases - not quite sure when, as I still have a lot to do getting all the infrastructure up and running.

It looks like we had a near simultaneous failure of 2 of our 3 storage solutions, one being a hyperconverged, high availability one, and the other being a standalone SAN solution. Trying to work back through the logs to see if it genuinely was 3 simultaneous storage server failures (unlikely) or 2 simultaneous hypervisor failures (unlikely), or something common that I haven't yet found (likely).

Sorry, as always, for the outage, and the fact you've probably had to talk to wives/partners or go down the pub!

Bigron · « **Reply #1 on:** 08 October 2017, 12:56:13 »

I missed you, obviously, but considering how long the Forum has been online and how few failures there have been, you do a great job. Thanks.

Ron.

STEMO · « **Reply #2 on:** 08 October 2017, 13:17:25 »

You shouldn’t really be hyperconverging on a Sunday, Jaime, try sitting in the bath till you go all wrinkly, much more relaxing.

b4ndit · « **Reply #3 on:** 08 October 2017, 13:24:53 »

Quote from: Bigron on 08 October 2017, 12:56:13

I missed you, obviously, but considering how long the Forum has been online and how few failures there have been, you do a great job. Thanks.

Ron.

I agree sterling job

Rods2 · « **Reply #4 on:** 08 October 2017, 14:32:48 »

Well done for getting it sorted,

not the sort of stress you need, especially at the weekend.

VXL V6 · « **Reply #5 on:** 08 October 2017, 17:36:05 »

Nice one

Lizzie Zoom · « **Reply #6 on:** 08 October 2017, 18:15:14 »

We owe you again TB!

TheBoy · « **Reply #7 on:** 08 October 2017, 18:21:25 »

The outage to resync the databases will likely be early tomorrow morning now, which hopefully will minimise inconvenience

BazaJT · « **Reply #8 on:** 08 October 2017, 18:42:20 »

I wondered where it'd gone.Good job someone knows what they're doing

Don't know how you do it for the money

Rods2 · « **Reply #9 on:** 08 October 2017, 18:58:26 »

Quote from: BazaJT on 08 October 2017, 18:42:20

I wondered where it'd gone.Good job someone knows what they're doing Don't know how you do it for the money

I think it's a labour of love, sweat and tears.

BazaJT · « **Reply #10 on:** 08 October 2017, 19:02:01 »

Mind you it could be TB's cull list that's overloading the system in the first place

Shackeng · « **Reply #11 on:** 08 October 2017, 19:32:02 »

As soon as it went off line, I said to myself, I bet that's the hyperconvergence again. Great to be proved right.

Rods2 · « **Reply #12 on:** 08 October 2017, 21:42:47 »

Quote from: Shackeng on 08 October 2017, 19:32:02

As soon as it went off line, I said to myself, I bet that's the hyperconvergence again. Great to be proved right.

It was down last night when I got in just before 11pm and the same this morning at about 8:30am and the thought went through my mind, does he know it's down and then gas bottles and garages.

TheBoy · « **Reply #13 on:** 09 October 2017, 17:30:40 »

Quote from: Rods2 on 08 October 2017, 21:42:47

does he know it's down and then gas bottles and garages.

That very thing was mentioned at work on one of our conf calls, as one of the guys couldn't make it due to having leccy and gas meters changed. "TB is good with gas and leccy" was the smart alec comment

Lazydocker · « **Reply #14 on:** 09 October 2017, 17:46:10 »

Quote from: TheBoy on 09 October 2017, 17:30:40

Quote from: Rods2 on 08 October 2017, 21:42:47
does he know it's down and then gas bottles and garages.
That very thing was mentioned at work on one of our conf calls, as one of the guys couldn't make it due to having leccy and gas meters changed. "TB is good with gas and leccy" was the smart alec comment

Well, to be fair

News:

Author Topic: OOF Outage (Read 4353 times)

TheBoy

OOF Outage

Bigron

Re: OOF Outage

STEMO

Re: OOF Outage

b4ndit

Re: OOF Outage

Rods2

Re: OOF Outage

VXL V6

Re: OOF Outage

Lizzie Zoom

Re: OOF Outage

TheBoy

Re: OOF Outage

BazaJT

Re: OOF Outage

Rods2

Re: OOF Outage

BazaJT

Re: OOF Outage

Shackeng

Re: OOF Outage

Rods2

Re: OOF Outage

TheBoy

Re: OOF Outage

Lazydocker

Re: OOF Outage