While investigating intermittent kernel panics on an x86-64 Linux server, you see a growing number of messages such as:
[Hardware Error]: Corrected error, no action required.
[Hardware Error]: CPU:0 Machine Check: 0 Bank 7: b600000000100153
Management wants to receive an alert whenever a single DIMM logs more than ten corrected ECC events in any 24-hour period so the module can be replaced during the next maintenance window.
Which ONE of the following actions will BEST meet this requirement and provide ongoing decoding of Machine Check Exceptions (MCEs)?
Install the mcelog package, enable its daemon, and set ce-error-threshold = 10 / 24h in /etc/mcelog.conf.
Configure smartd with -I 194 and -I 198 directives to watch for increasing disk-error counters.
Add memtest86+ to the GRUB menu so memory is automatically tested on every reboot.
Enable the kernel's NMI watchdog by setting kernel.nmi_watchdog=1 and monitor /proc/interrupts for anomalies.
The mcelog package is designed to decode Machine Check Exceptions on x86 systems, keep running error statistics, and trigger actions when thresholds are exceeded. Running mcelog as a daemon and setting ce-error-threshold = 10 / 24h in /etc/mcelog.conf will cause the built-in dimm-error-trigger script to fire when a DIMM accumulates more than ten corrected errors within 24 hours, allowing administrators to receive alerts and schedule replacement.
The NMI watchdog (second choice) helps detect hard lockups but does not count or report ECC events. Adding memtest86+ to GRUB (third choice) only tests memory during a reboot and offers no real-time monitoring or alerting. smartd with attributes 194 and 198 (fourth choice) monitors disk temperature and reallocated-sector counts, not memory errors. Therefore, only deploying mcelog with an appropriate threshold directly addresses the stated goal.
Ask Bash
Bash is our AI bot, trained to help you pass your exam. AI Generated Content may display inaccurate information, always double-check anything important.
What is mcelog and how does it work?
Open an interactive chat with Bash
What are ECC events, and why are they important to monitor?
Open an interactive chat with Bash
How does the `ce-error-threshold` setting in mcelog.conf improve system reliability?
Open an interactive chat with Bash
CompTIA Linux+ XK0-006 (V8)
Troubleshooting
Your Score:
Report Issue
Bash, the Crucial Exams Chat Bot
AI Bot
Loading...
Loading...
Loading...
IT & Cybersecurity Package Join Premium for Full Access