Skip to content

v0.1.0b3

Pre-release
Pre-release

Choose a tag to compare

@maskedsyntax maskedsyntax released this 14 Mar 04:22
· 13 commits to main since this release

HashPrep v0.1.0b3

New features

  • Config file loading (#69) Load analysis settings from YAML, TOML, or JSON via --config. Supports runtime threshold overrides (e.g. outlier, missingness, correlation) so you can tune checks without code changes.
  • Mutual information and Shannon entropy (#68) New checks and summaries for feature-target and feature-feature mutual information, plus Shannon entropy for categorical columns. Helps spot low-information or redundant features.
  • Normality and variance homogeneity tests (#67) Built-in normality tests (e.g. Shapiro-Wilk) and variance homogeneity (e.g. Levene) for numeric columns. Surfaces non-normal or heteroscedastic variables that may need transforms.
  • First-class DateTime support (#66) Proper handling of datetime columns: inference, summaries, and checks (e.g. future dates, skew). Datetime columns are no longer treated as plain text.
  • Edge-case tests and CI (#64) Broader test coverage for correlation, leakage, and other edge cases, plus GitHub Actions CI so regressions are caught automatically.
  • Website UI and docs (#59) Updated hashprep.com with clearer UI and documentation (installation, CLI, Python API, checks).

Fixes

  • PDF reports in limited environments PDF generation is optional: if WeasyPrint or system libs (e.g. libgobject) are missing, MD/JSON/HTML still work and the CLI reports a clear error for --format pdf instead of crashing.
  • Docs page light mode (#70) Fixed syntax highlighting on the docs site in light theme (contrast and colors) so code blocks are readable.
  • Mobile menu and routing (#60) Fixed mobile menu behavior, responsiveness, and routing issues on the website.

Refactors and quality

  • Renderers and text perf (#65) Deduplicated report renderers and optimized text output.
  • Structured logging and error handling (#63) More consistent logging and validation so invalid inputs and errors are clearer.
  • Config and codebase (#61) Centralized config handling and removal of dead code.