Dell PowerEdge R530 Crashing

brad8598

Well-known member
Joined
Dec 7, 2018
Posts
51
We have a Dell PowerEdge R530 that started doing random crash/restarts on Sunday. Then it was fine for 4 days and started happening again today.

I am unable to figure out what the problem is, but think it is a software problem, since Dell Pre-boot system assessment pass all tests.

Event viewer doesn't give me anythingd. I get the event 41, kernel power. But that just tell me that the system shut down unexpectedly.
Dell server manager and dell IDRAC logs OEM software event at each crash. but its unable to tell me anything more about the OEM software event.

Any help or what do I need to post to get any? Thanks in advance!
 
  • A brief description of your problem (but you can also include the steps you tried)
    • See original post. We have tried installing all updates, reviewed events in EventViewer and Dell system logs that implies a graceful shutdown. Currently have a ticket with Microsoft and Dell but have found nothing. Server is running on a backup VM so I am able to try whatever is needed on physical box.
  • System Manufacturer?
    • Dell
  • Laptop or Desktop?
    • Server
  • Exact model number (if laptop, check label on bottom)
    • PowerEdge R530
  • OS ? (Windows 10, 8.1, 8, 7, Vista)
    • Windows Server 2012 R2
  • x86 (32bit) or x64 (64bit)?
    • x64
  • (Only for Vista, Windows 7) Service pack?
    • N/A
  • What was original installed OS on system?
    • Windows Server 2012 R2
  • Is the OS an OEM version (came pre-installed on system) or full retail version (YOU purchased it from retailer)?
    • OEM
  • Age of system? (hardware)
    • 2016
  • Age of OS installation?
    • 2016
  • Have you re-installed the OS?
    • no
  • CPU
    • Intel(R) Xeon(R) CPU E31220 @ 3.10GHz, Model 42 Stepping 7
  • RAM (brand, EXACT model, what slots are you using?)
    • will get later if needed
  • Video Card
    • G200e Matrox G200e (ServerEngines) - English
  • MotherBoard - (if NOT a laptop)
    • Intel Corporation S1200BTL E98681-352
  • Power Supply - brand & wattage (if laptop, skip this one)
    • will get later if needed
  • Is driver verifier enabled or disabled?
    • disabled
  • What security software are you using? (Firewall, antivirus, antimalware, antispyware, and so forth)
    • SentinelOne
  • Are you using proxy, vpn, ipfilters or similar software?
    • No
  • Are you using Disk Image tools? (like daemon tools, alcohol 52% or 120%, virtual CloneDrive, roxio software)
    • not sure, axcient backup software is running
  • Are you currently under/overclocking? Are there overclocking software installed on your system?
    • No
 

Attachments

Rich (BB code):
Event[9436]:
  Log Name: System
  Source: Server Administrator
  Date: 2021-12-12T14:31:13.000
  Event ID: 5306
  Task: Instrumentation Service
  Level: Error
  Opcode: Info
  Keyword: Classic
  User: N/A
  User Name: N/A
  Computer: PT-FileServer.PathTec.local
  Description: 
Severity: Critical, Category: System Health, MessageID: RDU0012, Message: Power supply redundancy is lost.

Event[9437]:
  Log Name: System
  Source: Server Administrator
  Date: 2021-12-12T14:31:13.000
  Event ID: 5354
  Task: Instrumentation Service
  Level: Error
  Opcode: Info
  Keyword: Classic
  User: N/A
  User Name: N/A
  Computer: PT-FileServer.PathTec.local
  Description: 
Severity: Critical, Category: System Health, MessageID: PSU0908, Message: Power lost on power unit PS1 Status.

There appears to be quite a few of these power lost errors on Sunday at about 14:31. I couldn't find any more of them in the event logs despite the system reporting that it was unexpectedly shutdown on the 17th and 18th.

These were logged moments beforehand.

Rich (BB code):
Event[9434]:
  Log Name: System
  Source: Server Administrator
  Date: 2021-12-12T14:31:39.000
  Event ID: 5304
  Task: Instrumentation Service
  Level: Information
  Opcode: Info
  Keyword: Classic
  User: N/A
  User Name: N/A
  Computer: PT-FileServer.PathTec.local
  Description: 
Severity: Informational, Category: System Health, MessageID: RDU0011, Message: The power supplies are redundant.

Event[9435]:
  Log Name: System
  Source: Server Administrator
  Date: 2021-12-12T14:31:36.000
  Event ID: 5352
  Task: Instrumentation Service
  Level: Information
  Opcode: Info
  Keyword: Classic
  User: N/A
  User Name: N/A
  Computer: PT-FileServer.PathTec.local
  Description: 
Severity: Informational, Category: System Health, MessageID: PSU0017, Message: Power supply PS1 Status is operating normally.




Rich (BB code):
Event[2130]:
  Log Name: System
  Source: volmgr
  Date: 2021-12-17T14:26:25.858
  Event ID: 49
  Task: N/A
  Level: Error
  Opcode: N/A
  Keyword: Classic
  User: N/A
  User Name: N/A
  Computer: PT-FileServer.PathTec.local
  Description: 
Configuring the Page file for crash dump failed. Make sure there is a page file on the boot partition and that is large enough to contain all physical memory.

Could you please ensure that your dump file settings are set to their system defaults and the page file size is managed by the operating system? There was two instances of this log entry on the 17th which would have been on Friday.
 
The 12/12 events were when this first started and I tested that all the power supplies and redundancy was actually working.

The 12/17 paging file change was done by Microsoft so I'll need to wait to change that back to system defaults.
 
Unfortunately those are the only events which appear to be applicable to your problem. I couldn't see anything else. It'll be interesting to see what it was attempting to write to a dump file though.
 
I apologize for lack of updates but myself and another IT guy have been working tickets with Microsoft and Dell to solve this issue. Since they've asked me to do some things that I can't revert yet to get more information to you, I currently am at a stand still. I should be able to revert some changes and keep pursuing this possibly next week. Thank you for your help so far.
 

Has Sysnative Forums helped you? Please consider donating to help us support the site!

Back
Top