Releases: mchehab/rasdaemon
Releases · mchehab/rasdaemon
Release v0.8.4
Main New Features
- Common Event Record to CXL updated to match spec rev3.2
- Added support for CXL memory sparing events
- Added AER Uncorrectable Error reporting to SEL on OpenBMC
- Added SIGBUS event handler
- Added decode panic mce through ERST
- Added poison page and memory corrected error statistics
- Added event level on event record
- Added bash and zsh completions for ras-mc-ctl
- Added dmidecode Requires for riscv64
- Updated support for HiSilicon
Summary of Changes
Code Improvements
Makefile.am: package completions at the source tarballMakefile.am: add install rules for completions fileunified-sel.c: use tabs instead of spaces for indentMakefile.am: sort if groupsMakefile.am: use one file per line, in alphabetic order
Autotools Updates
INSTALL: update it with the autogenerated one from new autotools
RAS Daemon Core
rasdaemon: Fix CPU isolate errors with limited CPUs at startuprasdaemon: Update the supported modules in the HiSilicon common sectionrasdaemon: Add support for the CXL memory sparing eventsrasdaemon: cxl: Add validity check for parsing and logging the corrected memory error threshold event flags and error countrasdaemon: cxl: Update handling of Common Event Record to CXL spec rev3.2rasdaemon: fix post-processing optionsrasdaemon: cxl: Add validity check for parse and log hdr_maint_op_sub_class fieldrasdaemon: cxl: Update ras-report.c with missing maintenance operation subclass informationrasdaemon: cxl: Add parse and log linear cache address alias emission for cxl events
Completion Files
Makefile.am: install completions filescompletions: add completions files for zsh and bash
Event Handling
ras-events: Revert page_size to 4096 in get_pagesize()ras-events: fix for return error code in add_event_handler()ras-events: fix buffer overflow in add_event_handler read
Miscellaneous Changes
unified-sel: added AER Uncorrectable Error reporting to SEL on OpenBMCREADME.md: Fix spelling and grammar errors in README.mdrasdaemon: Fix coding style issuesrasdaemon: add event level for event recordaer: print pci device name and vendor/device idrasdaemon: erst: decode panic mce through erstrasdaemon: introduce poison page statisticsrasdaemon: support memory corrected error statistics
Compiler Requirements
dmidecode Requires on riscv64
Bug Fixes
rasdaemon: skip doesn't exist event
Configuration Files
config: add rsyslog configconfig: add syslog-ng and logrotate config
Release v0.8.3
The most relevant change on this version is support for CXL spec rev. 3.1.
Changes since version 0.8.2:
- ras-diskerror-handler: fix checkpatch warnings
- ras-diskerror-handler: Use the right dev_t decoding for diskerror handler
- rasdaemon: Add new modules supported by HiSilicon common section
- rasdaemon: Fix some static check warning
- rasdaemon: Fix few compilation warnings in non standard hisilicon code
- rasdaemon: Fix some compilation alarms in ras-record.h.
- rasdaemon: ras-mc-ctl: Update logging of CXL memory module data to align with CXL spec rev 3.1
- rasdaemon: ras-mc-ctl: Update logging of CXL DRAM event data to align with CXL spec rev 3.1
- rasdaemon: ras-mc-ctl: Update logging of CXL general media event data to align with CXL spec rev 3.1
- rasdaemon: ras-mc-ctl: Update logging of common event data to align with CXL spec rev 3.1
- rasdaemon: ras-mc-ctl: Fix logging of memory event type in CXL DRAM error table
- rasdaemon: cxl: Update memory module event to CXL spec rev 3.1
- rasdaemon: cxl: Update CXL DRAM event to CXL spec rev 3.1
- rasdaemon: cxl: Update CXL general media event to CXL spec rev 3.1
- rasdaemon: cxl: Add Component Identifier formatting for CXL spec rev 3.1
- rasdaemon: cxl: Update common event to CXL spec rev 3.1
- rasdaemon: cxl: Add automatic indexing for storing CXL fields in SQLite database
- rasdaemon: cxl: Fix mismatch in region field's name with kernel DRAM trace event
- rasdaemon: cxl: Fix logging of memory event type of DRAM trace event
- rasdaemon: Fix for parsing error when trace event's format file is larger than PAGE_SIZE
- rasdaemon: Add page offline support for cxl memory
- Add labels for ASRock X370 Taichi
- Add labels for ASRock X570 Creator
- Add labels for ASRock X570S PG Riptide
- rasdaemon: mce: decode io port for bus error
- rasdaemon: arm: do not print error msg if field not found
- rasdaemon: add DE error type for AMD
- rasdaemon: Fix the display format of JaguarMicro vendor no standard errors
Release v0.8.2
Version 0.8.2
- ChangeLog: fix 0.8.1 release date
- checkpatch.pl: warn also about strcat and sprintf usages
- ci.yml: run checkpatch when doing tests
- Makefile: add types.h to the list of headers
- Makefile: only enable rbtree if needed
- mce-intel: drop a code commented a long time ago with an action
- mce-intel-ivb/mce-intel-sb: remove code commented with #if 0
- New feature: support memory row CE threshold policy
- non-standard-jaguarmicro: avoid CamelCase
- ras-arm-handler: cope with latest upstream changes
- ras-arm-handler: Parse and log ARM Processor Error Info table
- ras-arm-handler: use GENMASK() macro
- rasdaemon: add .editorconfig file to follow our coding style
- rasdaemon: alphabetically sort includes
- rasdaemon: check if sscanf() processed all arguments on dev_name
- rasdaemon: cleanup coding style
- rasdaemon: coding style cleanup
- rasdaemon: CXL: Extract, log and record region info from cxl_general_media and cxl_dram events
- rasdaemon: CXL: Fix uncorrectable macro spelling
- rasdaemon: don't use braces for single statement blocks
- rasdaemon: don't use unsafe strcpy, strcat and sprintf
- rasdaemon: enforce SPDX license tags
- rasdaemon: fix a coding style issue
- rasdaemon: fix checkpatch warnings
- rasdaemon: fix initialization potential issues
- rasdaemon: fix for compilation warning in ras-memory-failure-handler.c
- rasdaemon: fix mem_fail_event build breakage
- rasdaemon: fix some coding style issues
- rasdaemon: fix some typos and correct spelling
- rasdaemon: ipmitool SEL logging of AER CEs on OpenBMC platforms
- rasdaemon: modify support for vendor-specific machine check error information
- rasdaemon: move type macros to a separate header (types.h)
- rasdaemon: rbtree: removed unused definition for RB_ROOT
- rasdaemon: use func instead of the name of the function
- ras-events: demote log information about trace being enabled/disabled
- ras-events: drop a dead code to check number of CPUs
- ras-events: fix -d option to work again
- ras-events: fix warning ‘filter_ras_mc_event’ defined but not used
- ras-events: make returned error code consistent
- ras-mc-ctl: log hpa and region info from cxl_general_media and cxl_dram tables
- ras-page-isolation: drop an unused variable
- ras-page-isolation: fix location_fields size
- ras-page-isolation: fix an overflow bug when units->val=1
- ras-page-isolation: use snprintf() instead of sprintf()
- scripts/checkpatch.pl: add support for checking SPDX
- scripts/checkpatch.pl: do some additional cleanups
- scripts/checkpatch.pl: set default mode to strict
- scripts/checkpatch.pl: some improvements to reduce false positives
- types.h: add an implementation for strscpy() and strscat()
- types.h: don't depend on linux/bits.h
- unified-sel: convert license boilerplate to SPDX
Release v0.8.1
- Lots of new labels added;
- Added a script to check coding style;
- Detect non-x86 CPUs on MCE handler;
- Added an error injection script for QEMU;
- Allow external scripts to handle memory errors;
- Fix ARM processor compatibility with upstream;
- Add support for CXL events;
- Add support for microcode and PPIN on MCE traces;
- Add non-standard decodes for Yitian and Jaguarmicro;
- Add EmeraldRapids Intel CPU;
- Add Corrected Internal Error for aer_cor_errors;
- Add support for disable ras events in runtime;
- Improve AMD SMCA handling;
- Fix several compilation and runtime issues;
- Add option on ras-mc-ctl to exclude old events from reports;
- Other improvements at ras-mc-ctl tool;
- Documentation updates;
- Code cleanups;
- Coding style cleanups.
Release v0.7.0
- Add labels for ASUS TUF GAMING B450-PLUS II
- Add four modules supported by HiSilicon common section
- Updated HiSilicon platform name
- Relocate reading and display Kunpeng920 errors to under Kunpeng9xx
- Add support to display the HiSilicon vendor errors for a specified module
- Add printing usage if necessary parameters are not passed for the vendor-error options
- Reformat error info of the HiSilicon Kunpeng920
- Modify error statistics for HiSilicon KunPeng9xx common errors
- Modify recording Hisilicon common error data
- Support cpu fault isolation for recoverable errors
- Support cpu fault isolation for corrected errors
- Use XSI version of strerror_r on non glibc systems
- Use the new block_rq_error tracepoint
- Fix bank limit types check
- Properly handle localtime() failure
- Fix for a memory out-of-bounds issue and optimized code to remove duplicate function.
- Fix possible but unlikely file descriptor leak
- Fix bashisms
Release v0.6.8
- Fix some issues related to sysconfigdir
- Some fixes for hisi boards
- Update ras-mc-ctl manpage to match current options
- Fix ras-mc-ctl when parsing some dimm sizes
- New asrock x570 motherboard label
- New Supermicro labels
- Support MCE for AMD CPU family 19h
- Add new SMCA bank types with error decoding
- Add error handling for Ampere-specific errors.
- Add support for multi-arch builds