J. King
be2134cc71
API re-organization
4 years ago
J. King
d580e93e52
ISO 2022-JP encoder tests and fixes
4 years ago
J. King
a57dde6dbd
Style fixes
4 years ago
J. King
16f411c767
Prototype ISO 2022-JP encoder
The encoder currently operates only on single code points, but will later be
expanded to operate on iterables to construct complete strings. For encodings
other than ISO 2022-JP this is merely a convenience, but the algorithm for
that encoding mandates that encoded strings terminate in a switch to ASCII
mode, which a single-character encoder cannot accomplish by itself.
4 years ago
J. King
cdd1c0182b
Corrected ISO 2022-JP decoder and seeker
4 years ago
J. King
53b27d1a55
Correct buggy Shift_JIS tests
4 years ago
J. King
96846d061c
Complete Shift_JIS testing
4 years ago
J. King
d45e0be7c3
Typo
4 years ago
J. King
915aa7ca93
Finally fix Shift_JIS seeker
4 years ago
J. King
ef9932ffcb
Correct various ShiftJIS errors
4 years ago
J. King
d9b8cd8dd1
Fixes for multi-byte index-base encoders
- array_flip() retains the last duplicate, when we need the first
- Indexes are now prepared with a list of first-duplicate code points
to search before flipping
- This affected only U+3000 in GBK
- Big5 did not use array_flip(), but its list of override code points
did not include U+2561; Big5 now flips like the others
- EUC-JP had a long list of errors, but this encoding was not
previously released
- Shift_JIS' indexes are probably not correct, still
4 years ago
J. King
9e812ffdf8
Second stab at Shift_JIS
- Decoder implemented, with correct table
- Modernized decoder; may have bugs
- Backwards seeker hopefully, though it does not yet pass fuzzer
4 years ago
J. King
b284056644
Encode correct duplicate pointers in EUC-JP
4 years ago
J. King
46b6ac3c44
Complete and correct EUC-JP implementation
4 years ago
J. King
85f06186f2
Partial Shift_JIS implementation
5 years ago
J. King
f9e3d795a7
Add label matcher
5 years ago
J. King
106167ab39
Cleanup
6 years ago
J. King
74d8e07a65
Fully corrected WPT test data for EUC-JP
6 years ago
J. King
8dfb1ba984
Initial implementation of EUC-JP
6 years ago
J. King
2810ed9b2a
Full tests for EUC-KR
6 years ago
J. King
929d55cffe
Encode whitespace code points correctly in browser tests
6 years ago
J. King
1121f32e96
Minor Big5 corrections
6 years ago
J. King
c4cdbdd5c8
Initial implementation of EUC-KR
6 years ago
J. King
bfc6c677c5
Complete Big5 tests, with numerous fixes
6 years ago
J. King
32d7fc47b0
Fix HTML test generator; clean up
6 years ago
J. King
55cbc915c3
Refactor HTML-based test generators
6 years ago
J. King
5967d148c0
Consolidate index generation into a single, better script
6 years ago
J. King
3b8db5822a
Add all available tests to platform test generator
6 years ago
J. King
4a091610e9
Initial implementation of Big5 encoding
Only the decoder is tested, and even that requires more thorough testing.
Testing of seeking and encoding still to come
6 years ago
J. King
4c686aa8a1
Complete battery of tests for gb18030
6 years ago
J. King
1b9889914a
Fix numerous bugs with gb18030
6 years ago
J. King
467c565e8c
Implement gb18030 seeking
Also fix some bugs in EOF handling
6 years ago
J. King
40d0054bd1
Implement gb18030 and GBK encoders
6 years ago
J. King
d6747532cd
Implement gb18030 decoder
6 years ago
J. King
7de6d7a6fc
Implement ISO-8859-6 single-byte encoding
Other single-byte encodings to follow
6 years ago