Skip to content

Releases: mchehab/rasdaemon

Release v0.8.4

20 Nov 10:53

Choose a tag to compare

Main New Features

  • Common Event Record to CXL updated to match spec rev3.2
  • Added support for CXL memory sparing events
  • Added AER Uncorrectable Error reporting to SEL on OpenBMC
  • Added SIGBUS event handler
  • Added decode panic mce through ERST
  • Added poison page and memory corrected error statistics
  • Added event level on event record
  • Added bash and zsh completions for ras-mc-ctl
  • Added dmidecode Requires for riscv64
  • Updated support for HiSilicon

Summary of Changes

Code Improvements

  • Makefile.am: package completions at the source tarball
  • Makefile.am: add install rules for completions file
  • unified-sel.c: use tabs instead of spaces for indent
  • Makefile.am: sort if groups
  • Makefile.am: use one file per line, in alphabetic order

Autotools Updates

  • INSTALL: update it with the autogenerated one from new autotools

RAS Daemon Core

  • rasdaemon: Fix CPU isolate errors with limited CPUs at startup
  • rasdaemon: Update the supported modules in the HiSilicon common section
  • rasdaemon: Add support for the CXL memory sparing events
  • rasdaemon: cxl: Add validity check for parsing and logging the corrected memory error threshold event flags and error count
  • rasdaemon: cxl: Update handling of Common Event Record to CXL spec rev3.2
  • rasdaemon: fix post-processing options
  • rasdaemon: cxl: Add validity check for parse and log hdr_maint_op_sub_class field
  • rasdaemon: cxl: Update ras-report.c with missing maintenance operation subclass information
  • rasdaemon: cxl: Add parse and log linear cache address alias emission for cxl events

Completion Files

  • Makefile.am: install completions files
  • completions: add completions files for zsh and bash

Event Handling

  • ras-events: Revert page_size to 4096 in get_pagesize()
  • ras-events: fix for return error code in add_event_handler()
  • ras-events: fix buffer overflow in add_event_handler read

Miscellaneous Changes

  • unified-sel: added AER Uncorrectable Error reporting to SEL on OpenBMC
  • README.md: Fix spelling and grammar errors in README.md
  • rasdaemon: Fix coding style issues
  • rasdaemon: add event level for event record
  • aer: print pci device name and vendor/device id
  • rasdaemon: erst: decode panic mce through erst
  • rasdaemon: introduce poison page statistics
  • rasdaemon: support memory corrected error statistics

Compiler Requirements

  • dmidecode Requires on riscv64

Bug Fixes

  • rasdaemon: skip doesn't exist event

Configuration Files

  • config: add rsyslog config
  • config: add syslog-ng and logrotate config

Release v0.8.3

10 Mar 11:13

Choose a tag to compare

The most relevant change on this version is support for CXL spec rev. 3.1.

Changes since version 0.8.2:

  • ras-diskerror-handler: fix checkpatch warnings
  • ras-diskerror-handler: Use the right dev_t decoding for diskerror handler
  • rasdaemon: Add new modules supported by HiSilicon common section
  • rasdaemon: Fix some static check warning
  • rasdaemon: Fix few compilation warnings in non standard hisilicon code
  • rasdaemon: Fix some compilation alarms in ras-record.h.
  • rasdaemon: ras-mc-ctl: Update logging of CXL memory module data to align with CXL spec rev 3.1
  • rasdaemon: ras-mc-ctl: Update logging of CXL DRAM event data to align with CXL spec rev 3.1
  • rasdaemon: ras-mc-ctl: Update logging of CXL general media event data to align with CXL spec rev 3.1
  • rasdaemon: ras-mc-ctl: Update logging of common event data to align with CXL spec rev 3.1
  • rasdaemon: ras-mc-ctl: Fix logging of memory event type in CXL DRAM error table
  • rasdaemon: cxl: Update memory module event to CXL spec rev 3.1
  • rasdaemon: cxl: Update CXL DRAM event to CXL spec rev 3.1
  • rasdaemon: cxl: Update CXL general media event to CXL spec rev 3.1
  • rasdaemon: cxl: Add Component Identifier formatting for CXL spec rev 3.1
  • rasdaemon: cxl: Update common event to CXL spec rev 3.1
  • rasdaemon: cxl: Add automatic indexing for storing CXL fields in SQLite database
  • rasdaemon: cxl: Fix mismatch in region field's name with kernel DRAM trace event
  • rasdaemon: cxl: Fix logging of memory event type of DRAM trace event
  • rasdaemon: Fix for parsing error when trace event's format file is larger than PAGE_SIZE
  • rasdaemon: Add page offline support for cxl memory
  • Add labels for ASRock X370 Taichi
  • Add labels for ASRock X570 Creator
  • Add labels for ASRock X570S PG Riptide
  • rasdaemon: mce: decode io port for bus error
  • rasdaemon: arm: do not print error msg if field not found
  • rasdaemon: add DE error type for AMD
  • rasdaemon: Fix the display format of JaguarMicro vendor no standard errors

Release v0.8.2

19 Nov 08:22

Choose a tag to compare

Version 0.8.2

  • ChangeLog: fix 0.8.1 release date
  • checkpatch.pl: warn also about strcat and sprintf usages
  • ci.yml: run checkpatch when doing tests
  • Makefile: add types.h to the list of headers
  • Makefile: only enable rbtree if needed
  • mce-intel: drop a code commented a long time ago with an action
  • mce-intel-ivb/mce-intel-sb: remove code commented with #if 0
  • New feature: support memory row CE threshold policy
  • non-standard-jaguarmicro: avoid CamelCase
  • ras-arm-handler: cope with latest upstream changes
  • ras-arm-handler: Parse and log ARM Processor Error Info table
  • ras-arm-handler: use GENMASK() macro
  • rasdaemon: add .editorconfig file to follow our coding style
  • rasdaemon: alphabetically sort includes
  • rasdaemon: check if sscanf() processed all arguments on dev_name
  • rasdaemon: cleanup coding style
  • rasdaemon: coding style cleanup
  • rasdaemon: CXL: Extract, log and record region info from cxl_general_media and cxl_dram events
  • rasdaemon: CXL: Fix uncorrectable macro spelling
  • rasdaemon: don't use braces for single statement blocks
  • rasdaemon: don't use unsafe strcpy, strcat and sprintf
  • rasdaemon: enforce SPDX license tags
  • rasdaemon: fix a coding style issue
  • rasdaemon: fix checkpatch warnings
  • rasdaemon: fix initialization potential issues
  • rasdaemon: fix for compilation warning in ras-memory-failure-handler.c
  • rasdaemon: fix mem_fail_event build breakage
  • rasdaemon: fix some coding style issues
  • rasdaemon: fix some typos and correct spelling
  • rasdaemon: ipmitool SEL logging of AER CEs on OpenBMC platforms
  • rasdaemon: modify support for vendor-specific machine check error information
  • rasdaemon: move type macros to a separate header (types.h)
  • rasdaemon: rbtree: removed unused definition for RB_ROOT
  • rasdaemon: use func instead of the name of the function
  • ras-events: demote log information about trace being enabled/disabled
  • ras-events: drop a dead code to check number of CPUs
  • ras-events: fix -d option to work again
  • ras-events: fix warning ‘filter_ras_mc_event’ defined but not used
  • ras-events: make returned error code consistent
  • ras-mc-ctl: log hpa and region info from cxl_general_media and cxl_dram tables
  • ras-page-isolation: drop an unused variable
  • ras-page-isolation: fix location_fields size
  • ras-page-isolation: fix an overflow bug when units->val=1
  • ras-page-isolation: use snprintf() instead of sprintf()
  • scripts/checkpatch.pl: add support for checking SPDX
  • scripts/checkpatch.pl: do some additional cleanups
  • scripts/checkpatch.pl: set default mode to strict
  • scripts/checkpatch.pl: some improvements to reduce false positives
  • types.h: add an implementation for strscpy() and strscat()
  • types.h: don't depend on linux/bits.h
  • unified-sel: convert license boilerplate to SPDX

Release v0.8.1

16 Jul 09:04

Choose a tag to compare

  • Lots of new labels added;
  • Added a script to check coding style;
  • Detect non-x86 CPUs on MCE handler;
  • Added an error injection script for QEMU;
  • Allow external scripts to handle memory errors;
  • Fix ARM processor compatibility with upstream;
  • Add support for CXL events;
  • Add support for microcode and PPIN on MCE traces;
  • Add non-standard decodes for Yitian and Jaguarmicro;
  • Add EmeraldRapids Intel CPU;
  • Add Corrected Internal Error for aer_cor_errors;
  • Add support for disable ras events in runtime;
  • Improve AMD SMCA handling;
  • Fix several compilation and runtime issues;
  • Add option on ras-mc-ctl to exclude old events from reports;
  • Other improvements at ras-mc-ctl tool;
  • Documentation updates;
  • Code cleanups;
  • Coding style cleanups.

Release v0.7.0

22 Jan 06:39

Choose a tag to compare

  • Add labels for ASUS TUF GAMING B450-PLUS II
  • Add four modules supported by HiSilicon common section
  • Updated HiSilicon platform name
  • Relocate reading and display Kunpeng920 errors to under Kunpeng9xx
  • Add support to display the HiSilicon vendor errors for a specified module
  • Add printing usage if necessary parameters are not passed for the vendor-error options
  • Reformat error info of the HiSilicon Kunpeng920
  • Modify error statistics for HiSilicon KunPeng9xx common errors
  • Modify recording Hisilicon common error data
  • Support cpu fault isolation for recoverable errors
  • Support cpu fault isolation for corrected errors
  • Use XSI version of strerror_r on non glibc systems
  • Use the new block_rq_error tracepoint
  • Fix bank limit types check
  • Properly handle localtime() failure
  • Fix for a memory out-of-bounds issue and optimized code to remove duplicate function.
  • Fix possible but unlikely file descriptor leak
  • Fix bashisms

Release v0.6.8

01 Apr 10:52
v0.6.8

Choose a tag to compare

  • Fix some issues related to sysconfigdir
  • Some fixes for hisi boards
  • Update ras-mc-ctl manpage to match current options
  • Fix ras-mc-ctl when parsing some dimm sizes
  • New asrock x570 motherboard label
  • New Supermicro labels
  • Support MCE for AMD CPU family 19h
  • Add new SMCA bank types with error decoding
  • Add error handling for Ampere-specific errors.
  • Add support for multi-arch builds