CPU: 3700X

Motherboard: Aorus B550 Elite

RAM: 8GBx4 Corsair Vengence LPX 3200

GPU: PowerColor 5700XT

PSU: Cooler Master MWE 1050 V2

Built in 2020.

Since last month, my PC started having random reboots and giving ‘Machine Check Exception’ error, similar to these:

https://old.reddit.com/r/AMDHelp/comments/190mkn0/5950x_whea_error_18_machine_check_exception/

https://old.reddit.com/r/AMDHelp/comments/qia2e7/whea_18_critical_error_computer_goes_black_restart/

https://old.reddit.com/r/buildapc/comments/150m14n/pc_randomly_restarts_whealogger_id_18/

And now from the last 3 days the system doesn’t boot. When I power on the computer, all fans start spinning but keyboard and mouse LEDs don’t light up. Pressing CTRL+ALT+DEL doesn’t reboot system neither does pressing the power button for few seconds.

I suspect that motherboard has gone kaput and isn’t completing or even starting the boot process, which is why keyboard and mouse aren’t getting any signal or power from motherboard or why restart or power down functionality is working.

Before the system stopped booting, I was trying to solve the machine check exception error by updating BIOS, updating chipset drivers, changing BIOS settings etc. But now I’m thinking none of it could’ve helped because the board itself was deteriorating.

Also during that time, I would randomly get display glitches (pic below) which could only be solved by restarting the machine so I was suspecting it might be GPU that was causing the problems.

Sometimes it would show chessboard like pattern. I guess this was also because of some issue with mobo-GPU connection?

Anyway before changing the board is there anything else I can try? Changing it is a pain so I’m trying to avoid that. 😂

  • comador @lemmy.world
    link
    fedilink
    arrow-up
    0
    ·
    2 months ago

    While I call it init, many will assume I am referring to the boot init, but I am actually referring to the bios initialization (init).

    That said, most BIOS inits go in this order:

    • Power Detection check. If enough power, proceed.

    • CPU program link, CPU calls the BIOS to basically wake up and run the bios program.

    • Ram Detection check. If RAM is present, the BIOS will use about 64k to load from ROM to RAM (called the bios reserve area) that then does the next steps.

    • Hardware Detection check. Identifiers of the hardware are detected, enumerated, configured and initialized.

    • Boot Sequence is initialized whereby the BIOS does a handoff to the bootloader.

    It’s during the hardware detection phase when the display is initialized by the gpu and you often see it displaying the bios version, then counting RAM. If the gpu is working BUT the display out isn’t, it’ll actually continue to boot 100% of the time (it doesn’t care). If the gpu hardware itself doesn’t respond correctly to the BIOS request however, it sends the hardware detection of the BIOS into a loop or shuts the system down, never getting to the final step: boot sequence.

    Depending on the bios type, it may or may not show numlock. I’ve also seen it act differently on UEFI enabled systems than when it’s set to classic bios. So, it just depends.

    Regardless, see if you can source another pcie gpu for testing this. It only takes a minute and tbh, it doesn’t hurt to have a cheap used pcie vid card in your pc tools for such things.

    Good luck!