diff --git a/README.md b/README.md index e96061c..24f4df2 100644 --- a/README.md +++ b/README.md @@ -24,11 +24,15 @@ This library and [masterminds/html5](https://packagist.org/packages/masterminds/ | | Masterminds | MensBeam | |-----------------------------------------------------|---------------------------------------|----------------------------------------| | Extensions required | dom, ctype, mbstring or iconv | dom | +| Supported encodings | System-dependent | [Per specification](https://html.spec.whatwg.org/multipage/parsing.html#character-encodings) | +| Encoding detection | None | Byte order mark, HTTP header, [pre-scan](https://html.spec.whatwg.org/multipage/parsing.html#prescan-a-byte-stream-to-determine-its-encoding) | +| Fallback encoding | UTF-8, configurable | Windows-1252, configurable | | Handling of invalid characters | Characters are dropped | [Per specification](https://encoding.spec.whatwg.org/#concept-encoding-process) | | Handling of invalid XML element names | Name is changed to "invalid" | [Per specification](https://html.spec.whatwg.org/multipage/parsing.html#coercing-an-html-dom-into-an-infoset) | | Handling of invalid XML attribute names | Attribute is dropped | [Per specification](https://html.spec.whatwg.org/multipage/parsing.html#coercing-an-html-dom-into-an-infoset) | | Handling of misnested tags | Parent end tags always close children | [Per specification](https://html.spec.whatwg.org/multipage/parsing.html#an-introduction-to-error-handling-and-strange-cases-in-the-parser) | | Handling of data between table cells | Left as-is | [Per specification](https://html.spec.whatwg.org/multipage/parsing.html#an-introduction-to-error-handling-and-strange-cases-in-the-parser) | | Handling of omitted start tags | Elements are not inserted | Per specification | +| Handling of processing instructions | Processing instructions are retained | Per specification | | Time needed to parse single-page HTML specification | 2.8 seconds | 7.0 seconds | | Peak memory needed for same | 38 MB | 13.9 MB |