MSI X99a Mobo Suddenly Won't Boot After 6 Months Flawless Operation

Zydecci

Commendable
Sep 14, 2016
12
0
1,510
Last night I was doing some homework and decided to go to bed. I sent my computer to hibernate and went to sleep. Came back this morning and couldn't get it to wake up. Hard reset the machine but still no picture. Opened up the case and the debugger lights are kind of stuck at 04. Kind of meaning that it sits at 04 for a second, flashes very quickly to 00, AF, and b0, and then back to 04 for a few more seconds, and repeat. Usually since 04 is part of the CPU POST, I would say it's my CPU and then RMA it, but I find it hard to believe that my CPU just burnt out out of nowhere while doing literally nothing.

It might also help to note that I tried fiddling with the RAM sticks (changing slots and such and retesting the POST) and depending on some configurations, it will hang at 00, and some it will let it get to the situation I described above

Specs:
MSI X99a XPower Motherboard
Corsair RM1000i PSU
Corsair Vengeance LPX 4GB RAM Sticks (x4)
MSI GeForce GTX980
Intel i7-5820K Hooked into the LGA2011-v3 motherboard slot

Any ideas would be appreciated. Can provide any other details.

UPDATE: Did some more fiddling with RAM. Turns out it will get to the 04 post code with RAM only installed in slot 7. But if you install on slot 7 and 1, it will hang at 00. Doubt that is helpful, but it's all I got.

UPDATE 2: More RAM fiddling. Turns out all of the RAM slots on the left side of the CPU will cause it to hang at 00. I now currently have all 4 sticks on the right side of the CPU and it is again hanging at 04.
 
Hello... Try removing your battery and clearing the CMOS data/jumper/SW... put your Ram sticks in to where you want to use them... remove jumper/SW... replace MB battery and try a re-boot... New memory settings/communication and other CMOS data will be created. The MB will be at Default setting too... so the MB CPU/IGT will be your Video output at boot.

Do this every time, while trouble shooting the memory sticks, with the different MB slots used, for consistent Pass/fail results in your Tests.
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


Hey thanks for your suggestion. I have tried clearing the CMOS data using the jumper and the button that comes on the back panel of the mobo. If I try to put my RAM sticks where I want them it will hang at 00. I have 8 slots, 4 on the left of the CPU socket and 4 on the right. The recommended config for 4 sticks is in slots 1, 3, 5, and 7. But the slots on the left are messed up and will not work if there are any DIMMs in them, so I have all my sticks in 5, 6, 7, and 8 right now.
 
Hello... 1) Do a clear everytime, while trouble shooting the memory sticks, with the different MB slots used, for consistent Pass/fail results in your Tests... Try a new battery too.
2) Do you have any auto update APPS enabled for the MB software/bios?
3) can you locate a error code chart for your MB? PDF/Manual/website?
4) Do you have a spare PS on hand to try, with just enough power to Boot to the BIO's? (remove the GPU)
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


To make sure that I am doing it right, I just short the 2 contacts for the CMOS clear while the system is off right? and No I do not have any auto update for the BIOS configured

 

bailojustin

Distinguished
Sounds like you had a motherboard short, I would check your houses ground first to help rule out the ground as the cause, Also is it on carpets, Is it open? Possibility for shorts are high, as well as telecommunication surges, Through the ethernet, these can fry vital components including most usbs.
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510
It may also be worth noting that when the debugger hangs at 00, the GPU fans spin at maximum rpm and the case fans do not spin at all. But when it is at 04, the case fans spin at regular speed and the GPU fans do not spin at all.
 

bailojustin

Distinguished
I would also test your PSU, meters the wires, Look up a guide before hand please, Do not meters them if you don't know what you are doing.

The first I would meter is the yellow to black on the psu, the posotive is the yellow 12v, the black is ground, meter from either 2 points and it should read 12v +/-10%

If it does not read this, That explains your CPU not powering, and your GPU not even getting to que in post.
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


I haven't ever had a problem with any of the plugs here. The case is above the ground sitting on my desk and has been since the beginning of it's now very short life.
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


Thanks for another reply. I have fully metered the PSU with my multi meter (I work with circuits from time to time) and everything meets the manufacturer's specifications
 
Hello... Good question... and information.

"Proper CMOS Clear/reset"
1) un-plug PS from wall.
2) remove MB battery
3) place jumper pins/switch on MB to clear the capacitors... for ~1 mins
4) remove Jumper/switch
5) re-install battery
7) plug monitor into the MB video output
8) plug in PS to wall
9) Boot computer.

If you jumper'd/shorted the Clear, without taking the battery out... You may need a New battery after that.

I would suggest removing your Video card and other hardware devices, HDD's? DVD's? Card readers? from the PS and MB, for further testing... you could have a weak PS problem? or a Failed device, connected to your MB too ...and we just need a CPU, MB, RAM, and monitor... And enough power to get a BOOT screen at this time. B /
 

bailojustin

Distinguished
Ok, Step by step.

Ground yourself, then turn off the psu switch, move the PSU to a location where you can look inside.
Now pull out all ram but 1. turn off any motherboard overclock switches
pull out your cmos battery, after this, locate your CMOS JUMPER PINS
Cap these 2 pins to CMOS reset, Remove the PSU power from the wall
hold the power button for 10 seconds
now put cap back on cmos jumper correctly, replace cmos battery PLug back in the PSU power
flip on the switch, try to boot. If it does not boot.

Optional: You should life up your cpu, reseat it, and you should check carefully for damaged pins or bent ones. As on of your error codes states for somereason your Northbridge is trying to start before your CPU, which is a sign of a bent pin or a short, Or possible errors in the memory/mobo connections.

Optional: You can go buy a new set of ram and go for gold, this will be the ultimate end all if nothing else works, If this does not. Get new components.

Time for breadboard.

Make sure you have an surface to do this on.
Pull out your motherboard and all other components, Connect everything outside the case on the insulated surface, power it and see if it power on correctly, this will tell you if its a component that has a short or a failure. Then you know to take a look at the motherboard or cpu. Even a chance of the ram being the culprit, I have heard of ram sticks going bad on that board, 3 at a time.
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


I just did that without just about every memory config I could think of. Still nothing. All HDD's and SSD's were removed beforehand because I was afraid something happened. I doubt it's a weak PSU because it's 1000W and there's hardly anything pulling from it.
 

bailojustin

Distinguished


breadboard, visually inspect it
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


Thanks again for another detailed reply. I tried that set of steps for clearing CMOS and it's still hanging at 00. Earlier I took my CPU out and inspected the pins in the socket (LGA2011-v3 has pins in the socket as opposed to on the CPU itself) and they all looked uniform. I figured it was my RAM at first, but I tried every single stick in every single port and it's just not working (unlesse the mobo managed to fry 4 sticks all at the same time.) At some point, I will have to "breadboard" the machine to see if that changes anything, but that would be a multiple hour endeavor as a lot of my case is jerry rigged to work (nothing electric, just cosmetic)
 
Hello... Well... No spare PS to try? that would be my next quick action... Other wise we are just guessing here B /
and you are talking RMA of the CPU, MB, RAM, PS?... my 2nd suspect would be the MB... intel CPU's and Corsair Ram have been rock solid for me through the years... but you are running Quad channel?

Can you try a BOOT with Single or paired sticks... for further memory tests with that chipset?

I remember some early problems with the X99's MB's construction... better email their TECH support to see if they suggest you sending the MB in? due to a know problem B /
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


I just got rid of a spare PSU I had lying around last month. Not my best idea. If I had other known good parts I would use them but unfortunately this is the only desktop in the house. I have been running quad channel for all of these tests except for when I was trying to boot with just 1 stick. I'm really trying not to RMA anything as I have tons of programming work to do over the next few days, but it's starting to look like that might have to be the case.
 
Hello... YES bad timing on the PS... Test your battery voltage... if your in a Time/Work crunch here... I would suggest calling and using a local Repair Center spare parts resources... But this could cost you some $$$ ? B / ...You could know what is wrong in a couple hours from them.
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


Alright I'll try to find a repair place that might have some spare parts. Thanks for all of your help and effort
 

bailojustin

Distinguished


Just maybe, does your card hafe a uefi button?
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510


No it doesn't. Why do you ask?
 

Zydecci

Commendable
Sep 14, 2016
12
0
1,510
Update in case anyone was wondering. Now basically nothing works. I haven't changed anything, but now I cannot get the debugger to move past 00. No more 04. Seems like it's breaking more and more by the minute
 

bailojustin

Distinguished


short, You are killing your other components most likely if they are not done, it may be possible to be permanant damage from your CPU overheating or a hotspot/short.