Hey Everyone,
Been trying to solve this obscure BSOD issue for a number of months now and can't seem to get to the bottom of it. Randomly why idle either overnight or while I'm at work the machine will BSOD and reboot with a bugcheck code of 126 (7E), this originally used to be a hard hang and further down the line of troubleshooting it became a reboot.
Before I replaced the mobo+cpu+psu, it started 1 day when I plugged an external 2.5" USB drive into 1 of the front header USB 3.0 port, was copying files to it overnight and next morning it did hard hang so I had to reboot it, didn't think too much of it at the time, over the coming months it started doing once a day while idle, or once every few days, maybe once a week or maybe once a month and of course it never created any dump files, this is part of the reason I can't seem to find what the issue is, it is worth noting I recently bought an RTX 2080 about 2-3 months before this started happening and my machine is custom water cooled by myself, I have monitored and temps have never been a problem.
1.) Roll-back to a known good working image of Windows I had, this is the same image that had been working for a few months
2.) Re-installed Windows from scratch, I used to be on LTSC 1809.
3.) Checked for updated drivers.
4.) Ran driver verifier which did find Logitech drivers were causing a hard hang shortly after logging in, removed that but didn't made a difference.
5.) Ran memtest86+ on each of the dimms, no issues
6.) Ran Furmark and Prime95 overnight, no issues
Some of the details are bit hazy as it's been ages and I did spend hours and hours trying different things, eventually the machine started rebooting instead of hard hanging and just 1 of the times it rebooted it created a minidump file which pointed to the nvlddmkm.sys driver, I possibly updated/reverted the nVidia drivers by this point, I was dusting out the PC and found that the slightest nudge on the ram sticks started hard hanging the machine, so I put some pressure on the dimms (4 dimms in total) to keep them seated firmly in plaace, eventually after a few days or a few weeks it had rebooted so that wasn't it, there was also the fact that the watercooled 2080 and the TITAN X before it had a heavy gpu sag on the PCI-E slot it was sitting in, also occassionally taking the 24 pin ATX connector out and dimms caused quite some bend in the motherboard, ultimately it got to the point where the machine would not even post, stripped down the PC, motherboard out on it's own on workbench, good known working PSU, known good video card, and still no post so I wrote that mobo off as I did not have another CPU to test with.
Anyways I bought a new mobo+cpu, the PSU is one of the first things I had replaced a while back when I started experiencing this issue, new build of Windows 10 1809, all drivers up to date, everything clean, working, no probs for a few months now, Friday evening I came home and plugged a USB camera into 1 of the front USB 2.0 header ports and overnight at 2:46AM it had crashed and rebooted, same bugcheck code 126, this time it did write a minidump file which points to the nvlddmkm.sys driver, however I also noticed that it created .dmp files in another location that showed a driver crashing called "USBHUB3", I then used Appcrashview which was interesting because at the very second that the event log reported the bugcheck error which was 2:46:53 there was 7 events that took place, started with the USBHUB3 driver causing a crash, I've now moved that USB camera to a USB hub I have connected (I did try changing hubs months back this did not help), I have not had any random reboots yet but it's getting to the point now where I really just want to drill down into what exactly is causing this, if it is a faulty RTX 2080 I'd like to find out because the RTX 2080 has a USB Type-C connector on it and 3 of the USB devices in Device Manager are registered under NVIDIA and using the USBHUB3 driver, also looking in the event log it's saying where the .dmp and .xml etc files are for the USBHUB3 crash but they never exist, it doesn't actually seem to create the files.
Apologies some of the details are a bit hazy, I've been trying to remember everything I've done over the past number of months. I've attached a link to the minidump file it created for nvlddmkm.sys as well as a previous USBHUB3 .dmp file it did manage to create, any help would be massively appreciated as I'm very much lost at this point, I have tried to look at both .dmp files using windbg but not experienced enough to understand what the assembly instructions are doing or what else is going on, there is also a MEMORY.DMP if it helps but it's 1.2GB so will upload upon request.
Minidump/msinfo32 below:
MEGA
If you need any more info let me know.
Thanks
Rainman65
Been trying to solve this obscure BSOD issue for a number of months now and can't seem to get to the bottom of it. Randomly why idle either overnight or while I'm at work the machine will BSOD and reboot with a bugcheck code of 126 (7E), this originally used to be a hard hang and further down the line of troubleshooting it became a reboot.
Before I replaced the mobo+cpu+psu, it started 1 day when I plugged an external 2.5" USB drive into 1 of the front header USB 3.0 port, was copying files to it overnight and next morning it did hard hang so I had to reboot it, didn't think too much of it at the time, over the coming months it started doing once a day while idle, or once every few days, maybe once a week or maybe once a month and of course it never created any dump files, this is part of the reason I can't seem to find what the issue is, it is worth noting I recently bought an RTX 2080 about 2-3 months before this started happening and my machine is custom water cooled by myself, I have monitored and temps have never been a problem.
1.) Roll-back to a known good working image of Windows I had, this is the same image that had been working for a few months
2.) Re-installed Windows from scratch, I used to be on LTSC 1809.
3.) Checked for updated drivers.
4.) Ran driver verifier which did find Logitech drivers were causing a hard hang shortly after logging in, removed that but didn't made a difference.
5.) Ran memtest86+ on each of the dimms, no issues
6.) Ran Furmark and Prime95 overnight, no issues
Some of the details are bit hazy as it's been ages and I did spend hours and hours trying different things, eventually the machine started rebooting instead of hard hanging and just 1 of the times it rebooted it created a minidump file which pointed to the nvlddmkm.sys driver, I possibly updated/reverted the nVidia drivers by this point, I was dusting out the PC and found that the slightest nudge on the ram sticks started hard hanging the machine, so I put some pressure on the dimms (4 dimms in total) to keep them seated firmly in plaace, eventually after a few days or a few weeks it had rebooted so that wasn't it, there was also the fact that the watercooled 2080 and the TITAN X before it had a heavy gpu sag on the PCI-E slot it was sitting in, also occassionally taking the 24 pin ATX connector out and dimms caused quite some bend in the motherboard, ultimately it got to the point where the machine would not even post, stripped down the PC, motherboard out on it's own on workbench, good known working PSU, known good video card, and still no post so I wrote that mobo off as I did not have another CPU to test with.
Anyways I bought a new mobo+cpu, the PSU is one of the first things I had replaced a while back when I started experiencing this issue, new build of Windows 10 1809, all drivers up to date, everything clean, working, no probs for a few months now, Friday evening I came home and plugged a USB camera into 1 of the front USB 2.0 header ports and overnight at 2:46AM it had crashed and rebooted, same bugcheck code 126, this time it did write a minidump file which points to the nvlddmkm.sys driver, however I also noticed that it created .dmp files in another location that showed a driver crashing called "USBHUB3", I then used Appcrashview which was interesting because at the very second that the event log reported the bugcheck error which was 2:46:53 there was 7 events that took place, started with the USBHUB3 driver causing a crash, I've now moved that USB camera to a USB hub I have connected (I did try changing hubs months back this did not help), I have not had any random reboots yet but it's getting to the point now where I really just want to drill down into what exactly is causing this, if it is a faulty RTX 2080 I'd like to find out because the RTX 2080 has a USB Type-C connector on it and 3 of the USB devices in Device Manager are registered under NVIDIA and using the USBHUB3 driver, also looking in the event log it's saying where the .dmp and .xml etc files are for the USBHUB3 crash but they never exist, it doesn't actually seem to create the files.
Apologies some of the details are a bit hazy, I've been trying to remember everything I've done over the past number of months. I've attached a link to the minidump file it created for nvlddmkm.sys as well as a previous USBHUB3 .dmp file it did manage to create, any help would be massively appreciated as I'm very much lost at this point, I have tried to look at both .dmp files using windbg but not experienced enough to understand what the assembly instructions are doing or what else is going on, there is also a MEMORY.DMP if it helps but it's 1.2GB so will upload upon request.
Minidump/msinfo32 below:
MEGA
If you need any more info let me know.
Thanks
Rainman65