J. King
3108808c10
Testmore MathML properly
3 years ago
J. King
179a21441c
Select and template insertion modes
3 years ago
J. King
68317d838e
Fill in more insertion modes
3 years ago
J. King
dc9dc9953a
Partial adoption agency implementation
3 years ago
J. King
752ab05464
Implement rest of in-body insertion mode
3 years ago
J. King
9758c08da2
Various minor corrections
3 years ago
J. King
1228ecca17
Corrective pass over foreign content stuff
3 years ago
J. King
065f9c97d6
Handle non-foreign fragment cases
3 years ago
J. King
aeb08b5f5d
Fix remaining failures
Fragment-case tests still need to be harnessed to test all functionality
3 years ago
J. King
5376ed43a5
Don't trim whitespace other than newlines in tests
3 years ago
J. King
ab972a838c
Fix DOCTYPE serialization
Also patch all top-level comments out
3 years ago
J. King
eebc45cb82
Patch tests to suppress root commnts
3 years ago
J. King
d35e4f909e
Correct error in tst harness related to comments
3 years ago
J. King
5f1f02b552
Skip tests requiring unimplemented logic
3 years ago
J. King
1dc3d9c23e
Emit whitespace-only character tokens
This makes tree building simpler in certain circumstances
3 years ago
J. King
504731cba0
Bring coverage backend up to date
3 years ago
J. King
e3a271f06b
Fix first failure in tree builder
3 years ago
J. King
bb4002abcb
Stub the tree builder properly
3 years ago
J. King
eea70eccd8
Test harness for tree construction
3 years ago
J. King
a35e8c8ae5
Update character decoders
3 years ago
J. King
269d0ecc64
Patch tests based on input not unstable identifier
4 years ago
J. King
37aecf97ba
Remove scripted encoding test workaround
The test has been segregated, making the workaround unnecessary
4 years ago
J. King
28f0bbfe72
Suppress only one scripting test
4 years ago
J. King
1f3c33ad9e
Better coverage of BOM-based detection
4 years ago
J. King
06e43504d0
Tweaks
4 years ago
J. King
164e5ff1e8
Add standard charset detection tests
- Various new tests needed for full coverage, noted in comment
- Byte Order Mark detection methopd added
- Japanese encodings nt yet supported, so tests marked incomplete
- Tests requiring scripting suppressed
4 years ago
J. King
a7e1083681
Prototype character encoding detection
4 years ago
J. King
c1162f962f
Add missing test
4 years ago
J. King
2aa6bb2dea
Remove unnecessary test abstraction
4 years ago
J. King
49f31015ac
Start on character encoding detection
4 years ago
J. King
318d7bd7ad
Patch remaining test failures away
4 years ago
J. King
00bf9974c5
Fix up most error reporting positions
4 years ago
J. King
58a1177888
Address errors and omissions in error emission
One test still fails, though it is arguably immaterial. This does not
account for line and column number, which are known to be mostly
off by one.
4 years ago
J. King
5a12fa8ad7
Tidying
4 years ago
J. King
ff4447e986
Include pending spec changes tests
4 years ago
J. King
e8f35e92fb
Character reference fixes
One test in the "entities.test" file is till failing
4 years ago
J. King
67c7f382e2
Prep for character references
- Add missing state constants
- Break all existing deviations for character refs
- Add assertions before use of $attribute
- Also fix DOCTYPE state
4 years ago
J. King
43f380c1f9
Fix EOF and end tags
- End tags now emit errors if they have attributes
- End tags now emit errors if they are self-closing
- The last character before EOF is now correctly reconsumed
Also changed the tokenizer debug log to be zero-cost
4 years ago
J. King
4e4aee2edd
Update intl dependency
4 years ago
J. King
6b42f08fbc
Change some if-the-exception blocks to assertions
This has only been done some parts of the code that are internal
to the parser at large.
4 years ago
J. King
af57117c23
Silence parse errors for now
4 years ago
J. King
bb2a7b5a95
Rewrite how parse errors are handled
Everything which can emit a parse error should have the error handler
and data stream as properties and use the ParseErrorEmitter trait to
avoid complicating the task of actually producing an error.
Normally the Parser would be expected to set the error handler before it
begins (this commit does not do this) and unset it after it's done.
Alternatively, the entire means of reporting errors can now be easily
replaced.
4 years ago
J. King
d93fe25e58
Combine character tokens in test harness
4 years ago
J. King
1beb934789
Add more tests
4 years ago
J. King
f360206a34
Basic endless loop helper
4 years ago
J. King
1386eb103c
Fix test transformer
4 years ago
J. King
1971892635
Basic skeleton of test suite
4 years ago
J. King
9df201f663
Remove erroneously added sub-repository
6 years ago
J. King
0d7a0a3367
Change tokenizer constant references to self::
Changing static:: to self:: makes constant de-referencing a
compile-time operation rather than run-time, potentially improving
performance. As constants cannot be overridden by extending classes,
there is no advantage to using static:: for these constants
6 years ago