We have a Dell PowerEdge R440 that started doing random crash/restarts in february. Then it was fine for 2 months and started happening again late april.
I am unable to figure out what the problem is, but think it is a software problem, since Dell Pre-boot system assessment pass all tests.
Event viewer doesn't give me anything worth wild. I get the event 41, kernel power. But that just tell me that the system shut down unexpectedly.
Dell server manager and dell IDRAC logs OEM software event at each crash. but its unable to tell me anything more about the OEM software event. And it logs "a runtime critical stop occured".
When I run chkdsk all drives are reported fine.
When I run SFC /scannow it does not find any intergrity violations
When I run dism /checkhealth it says repairable
When I run dism /restorehealth it says not enough storage to complete operation.
When i run dism /startcomponentcleanup it says can't find specified file.
I guess my first step will be to get the last two commands to succeed, even though I don't think thats the reason for the crashes, since SFC says everything is fine.
I will try to increase the IRPStackSize to solve the /restorehealth problem. But im not sure its going to work. And I can't restart the server during business hours.
All help here are appreciated.
I am unable to figure out what the problem is, but think it is a software problem, since Dell Pre-boot system assessment pass all tests.
Event viewer doesn't give me anything worth wild. I get the event 41, kernel power. But that just tell me that the system shut down unexpectedly.
Dell server manager and dell IDRAC logs OEM software event at each crash. but its unable to tell me anything more about the OEM software event. And it logs "a runtime critical stop occured".
When I run chkdsk all drives are reported fine.
When I run SFC /scannow it does not find any intergrity violations
When I run dism /checkhealth it says repairable
When I run dism /restorehealth it says not enough storage to complete operation.
When i run dism /startcomponentcleanup it says can't find specified file.
I guess my first step will be to get the last two commands to succeed, even though I don't think thats the reason for the crashes, since SFC says everything is fine.
I will try to increase the IRPStackSize to solve the /restorehealth problem. But im not sure its going to work. And I can't restart the server during business hours.
All help here are appreciated.