Support changing the encoding #3

Closed
opened 2021-09-27 13:25:03 -04:00 by jking · 0 comments
Owner

We do not current support changing the encoding while parsing.

It should be possible to implement it by assuming that only ASCII characters are compatible between encodings, and throwing out the document otherwise. Special care should be taken for encodings which are not entirely ASCII compatible (such as ISO 2022-JP and I believe Shift-JIS or possibly Big5).

We do not current support [changing the encoding](https://html.spec.whatwg.org/multipage/parsing.html#changing-the-encoding-while-parsing) while parsing. It should be possible to implement it by assuming that only ASCII characters are compatible between encodings, and throwing out the document otherwise. Special care should be taken for encodings which are not entirely ASCII compatible (such as ISO 2022-JP and I believe Shift-JIS or possibly Big5).
jking closed this issue 2021-09-29 18:40:24 -04:00
Sign in to join this conversation.
No labels
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: MensBeam/HTML-Parser#3
No description provided.