MensBeam/Lit - Lit

Author	SHA1	Message	Date
Dustin Wilson	b2ae3be4a7	Subpattern tokenization now maintains line length	3 years ago
Dustin Wilson	6eccc22196	Minor fixes, added capture token splicing	3 years ago
Dustin Wilson	b7e1353821	Subpatterns now limited to their parent pattern's length (if necessary) • Removed Token class in favor of associative arrays in anticipation of token manipulation in captures. (ugh)	3 years ago
Dustin Wilson	005e394076	Removed weak references from grammars • Originally I had a concept of a readonly node tree for grammars with nodes owning other nodes thinking it would be necessary when tokenizing. It isn't, so they're more trouble than they're worth. • "ownership" in Grammar\Reference objects is handled by an ownerGrammarScopeName property which is then used to get the grammar from the GrammarRegistry.	3 years ago
Dustin Wilson	eb9c34a024	Minor nit picking	3 years ago
Dustin Wilson	cba093dd68	Removed getting data from file • Added pattern match anchor support. • Data is now an instanced class with support only for string input. • Data now has firstLine, lastLine, and lastLineBeforeFinalNewLine properties to facilitate anchoring • Highlight now has a static toDOM method for highlighting to a DOM tree instead of the withFile and withString methods for accepting different kinds of input • Tokenizer now only outputs newline tokens if not the last line • Tokenizer now throws out pattern match regexes if their anchors are invalid for the current line. • Tokenizer now won't mistakenly emit empty string tokens.	3 years ago
Dustin Wilson	c055e9f3ba	Fixed tokenization of pattern and capture leftovers	3 years ago
Dustin Wilson	088a28270b	Progress	3 years ago
Dustin Wilson	adf7cd7331	Misunderstood matching process, still broken lol • Before the first pattern's regex to match the line would be processed into tokens. This apparently is incorrect. Instead, the pattern regex that has an offset that is closest to the offset wins. Changes reflect this.	3 years ago
Dustin Wilson	0dbbc67f1a	More minor tweaks/fixes	3 years ago
Dustin Wilson	8f56d15b68	Very minor cleanup	3 years ago
Dustin Wilson	699aeebf93	Minor fixes, still broken lol • When parsing JSON grammars match regexes now only escape unescaped forward slashes. • When parsing JSON grammars match regexes now truncate unicode character codes larger than 0x10ffff to 0x10ffff, the largest possible unicode character. • Content names should only be applied to what is between begin/end patterns. Might need to fix to not apply to end patterns themselves.	3 years ago
Dustin Wilson	2577852090	Still broken • Added a flag for begin patterns • Trying to handle begin/end patterns better. Begin patterns shouldn't automatically remove themselves from the stack, their corresponding end pattern should instead.	3 years ago
Dustin Wilson	4f09139e3b	Various fixes, tokenization is however now an infinite loop :/ • Added preliminary transformation of out-of-range codepoints in matches • Fixed adoption of Grammar\Pattern objects. • Fixed retrieval of Grammar\RepositoryReferences.	3 years ago
Dustin Wilson	5adf6b3107	A bit more	3 years ago
Dustin Wilson	5027113596	Started injections, broken lol	3 years ago
Dustin Wilson	2f7f14dea1	Scope names now resolve, starting on first line and last line anchors	3 years ago
Dustin Wilson	63a5fb7367	One full line tokenizes lol	3 years ago
Dustin Wilson	ad23bf4c4d	Tokenization progress	3 years ago
Dustin Wilson	4ed8ffcd26	Reverting to using UTF-8 and preg_match. mb_ereg is garbage	3 years ago
Dustin Wilson	5a3322a0cb	Many changes • Lines are now converted to UTF-32 while tokenizing so that byte offsets may be cleanly converted to character offsets • Now when grammars are parsed into Grammar objects begin and end matches are converted to regular matches by adding end matches to the pattern's pattern list to simplify tokenization. • Highlight::withFile and Highlight::withString now accept an encoding parameter which defaults to UTF-8.	3 years ago
Dustin Wilson	7717827259	Breaking Tokenizer	3 years ago
Dustin Wilson	bb4b90a7b0	Setting up Tokenizer for recursion	3 years ago
Dustin Wilson	a12ec9dbfc	Fixes to references in grammars, adoption of new owner grammars	3 years ago
Dustin Wilson	53151a674c	Break all the things!	3 years ago
Dustin Wilson	1763653eca	Tokenizing stuff... maybe? :)	3 years ago
Dustin Wilson	457cf39a56	Move Grammar\Registry to GrammarRegistry	3 years ago
Dustin Wilson	33e411ec63	Trying to start code tokenization	3 years ago
Dustin Wilson	f592d93e23	Added prefix retrieval	3 years ago
Dustin Wilson	379d9c791d	Maybe have scope selector matchers working	3 years ago
Dustin Wilson	7a3d0e5d2e	Starting to add matchers	3 years ago
Dustin Wilson	6a9fd79f11	Scope parsing fixes	3 years ago
Dustin Wilson	a4707060c0	Fixing bugs in scope serialization	3 years ago
Dustin Wilson	4147a207d4	Started adding serialization for scopes	3 years ago
Dustin Wilson	8d6c51a2d0	Use assertions for debugging scope parsing	3 years ago
Dustin Wilson	7e918f48be	Starting rewriting scope parsing	3 years ago
Dustin Wilson	cdc53456e0	Added cache for scope parsing	3 years ago
Dustin Wilson	d83916a57e	Includes now work like lazy WeakReferences except self	3 years ago
Dustin Wilson	dcb00c001f	Changed Pattern to Rule to be consistent with other implementations	3 years ago
Dustin Wilson	d6b55c8678	Grammar registry now automatically pulls from JSON pool	3 years ago
Dustin Wilson	e5869f8a8e	Reorganized data folder	3 years ago
Dustin Wilson	676971850e	Minor cleanup	3 years ago
Dustin Wilson	0a7b0e4f28	Cleaned up JSON capture list reading	3 years ago
Dustin Wilson	2be674cd2a	Added some comments to grammars and cleaned up exceptions	3 years ago
Dustin Wilson	900e462bcd	Wrote JSON to Grammar converter, successfully parses all grammars tested	3 years ago
Dustin Wilson	e2cc9adbd8	Working on Grammars	3 years ago
Dustin Wilson	5ae2d256d3	Cleaning up a bit	3 years ago
Dustin Wilson	e27408c662	Improvements to run command, starting on Grammar	3 years ago
Dustin Wilson	785c03b1f8	Trying to figure out structure	3 years ago
Dustin Wilson	5edc6d32b3	Changed project name to Lit lol	3 years ago

1 2

75 Commits (b2ae3be4a70e3bab4d981465f08a25ba13e3b8be)