Encode and decode inside Node::Text?

Hi!

In the `Text` variant of a `Node`, the text is stored as-is from the source code of the HTML file. This means that a source such as `a &gt; b` would be represented as `Node::Text("a &gt; b")`, rather than `Node::Text("a > b")`. While this does make sense for performance reasons, I feel like this might be unintuitive for users. The `Node` data-type is for manipulating HTML after it has been parsed into an abstract syntax tree, but here the Text variant store the text unprocessed from the file, rather than storing what the text represents feels.

Additionally, this means that one could easily construct a `Node::Text` instance by mistake which contains HTML fragments which when serialized, either give invalid HTML or something which would parse to a different tree structure (for example doing `Node::Text("a > b")`, or `Node::Text("a <img> b")`)

From what I can see, a solution to this problem would simply be to add a dependency such as [`html-escape`](https://docs.rs/html-escape/latest/html_escape/) and making a call to `decode_html_entities` in the parse function, as well as a call to `encode_html_entities` in the Htmlifiable::html implementation.

(All of this also applies to attribute values as well)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Encode and decode inside Node::Text? #9

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Encode and decode inside Node::Text? #9

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions