html2text is a Python script that converts a page of HTML into clean, easy-to-read plain ASCII text
To me, ASCII means that this doesn't support Unicode and will fail on non-English html, but it seems to work just fine. Maybe the word ASCII should be removed?