Skip to content

Conversation

@aaronng91
Copy link
Contributor

@aaronng91 aaronng91 commented Dec 18, 2024

Use same WER calculation method from jiwer, but split CJK chars by spacing so the WER calculation method treats each CJK char as a unit.

Test via:
python3 -m asr_metrics.cli wer --mixed-error-rate --non-en --show-normalised --diff --show-errors asr_metrics/examples/reference-mer.txt asr_metrics/examples/hypothesis-mer.txt

@dan-cochrane
Copy link
Contributor

Speedy work

@dan-cochrane
Copy link
Contributor

Cool - I think this can be merged. We should also tag it and then start a release for PyPI

@aaronng91 aaronng91 self-assigned this Dec 18, 2024
@aaronng91 aaronng91 merged commit 4b00140 into master Dec 18, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

3 participants