« New Internet ! | Same shit, different day. » |
Well the latest issue is HAL7 decided to die...
We have been noticing that it was randomly rebooting or going into a reboot cycle.
Thought it was a software issue and was digging into that but then it decided to turn off and not come back on...Meep!
HAL7 is older hardware, being an HP Proliant ML350 G6.
Come to find out my raid array was not being backed up either and the raid itself can't just be plugged into something else and recovered that way.
Online research suggested it was the power supply back plane distribution board. I can't find anything visually out of place like a leaky capacitor and were were about to give up hope.
Found ONE article online that shows soldering on a switch and manually forcing the power supplies to turn on. Currently its back up and running and we have completed a manual backup of the data.
I may pull the board back out and replace caps, we will see.
Usually older server hardware parts can be found online for pretty cheap but not this one. We can find it, but its in the $200 range.
Lesson learned is to verify your backup solution.