MensBeam/Lit - Lit

Author	SHA1	Message	Date
Dustin Wilson	40b205fdd2	Fixed a few bugs when testing sass grammar tokenization • I -really- hate debugging this because there's no reference to go by to ensure things are correct except trial and error. • Sometimes when resolving scope names the wrong match would end up in the name. • Because of how references are handled in this implementation sometimes there'd be a leftover pattern containing a single reference when popping off the rule and scope stacks. It would cause havoc, so a bit of bullshit is needed to circumvent that. Probably can simplify it in the future because checking against the end pattern like it is probably isn't necessary, but this works at present.	2 years ago
Dustin Wilson	8370b9368c	Changes to handling of missing end patterns	2 years ago
Dustin Wilson	3e07ac45af	Fixed back references in end patterns, negative lookahead differences	2 years ago
Dustin Wilson	1e61057caf	Added GrammarRegistry::has	2 years ago
Dustin Wilson	2de78bebbd	Added example for integration with MensBeam\HTML-DOM	2 years ago
Dustin Wilson	4a528d3f55	Updated README	2 years ago
Dustin Wilson	dbb50116bb	0.0.1	2 years ago
Dustin Wilson	19ca683e31	Integration with Framework	2 years ago
Dustin Wilson	94d7d5446e	Removing support for php7.4	3 years ago
Dustin Wilson	b274fcb84c	Removed unneeded composer requirement	3 years ago
Dustin Wilson	22c6614c76	Changed namespace	3 years ago
Dustin Wilson	25eb7ef79a	Updated composer requirements	3 years ago
Dustin Wilson	1ddbf953b4	Cleaning up JSON files	3 years ago
Dustin Wilson	21f614213e	Added support for spaces when tokenizing Grammar scope names	3 years ago
Dustin Wilson	d52af24daa	Added supported languages list to readme	3 years ago
Dustin Wilson	8590b6cde9	More more documentation	3 years ago
Dustin Wilson	a311c83240	More documentation	3 years ago
Dustin Wilson	bf10cca1db	Added documentation for Grammar to readme	3 years ago
Dustin Wilson	77d8cb82b7	More readme updates	3 years ago
Dustin Wilson	a322370779	Added more examples to readme	3 years ago
Dustin Wilson	e2a944cc72	More readme fixes	3 years ago
Dustin Wilson	907df20c16	Minor changes to readme	3 years ago
Dustin Wilson	fd4124bc3e	Started documenting the library in the readme	3 years ago
Dustin Wilson	774658603d	Now tokenizes properly when matches grab the newline	3 years ago
Dustin Wilson	641e9fa547	Added some documentation	3 years ago
Dustin Wilson	8fba5ebee5	Begin pattern content names are now removed before end patterns	3 years ago
Dustin Wilson	3ba9fdb68b	Fixed injections, still trouble with php close tag • Before injections would inject themselves onto the rule stack which was incorrect. Now they inject into the current rule list.	3 years ago
Dustin Wilson	e2aa1fa1ed	Injections have been completely wrong, attempting rewrite	3 years ago
Dustin Wilson	f944ca9b9c	Added DOM creation • Fixed bug where nonexistent grammars would cause tokenizer to fail. • Added mensbeam/html as a dependency, removed docopt/docopt and ext-mbstring. • Discovered bug when injections are removed from the stack when tokenizing, investigating.	3 years ago
Dustin Wilson	15c3576977	Cache spliced rule lists in tokenizer; much faster now	3 years ago
Dustin Wilson	fb0441809e	Minor optimizations to tokenizer	3 years ago
Dustin Wilson	434c29a03f	Tokenization looks to be working... VERY SLOWLY	3 years ago
Dustin Wilson	c04e54d5ed	Minor tokenization bug fixes • When calculating the offset after handling overlapping tokens it now aware of invalid capture offsets (meaning they matched nothing). • Tokenizer::tokenizeLine now correctly does not continue looking for new matches when the newly tokenized pattern was an end pattern. • Grammars no longer have beginCaptures incorrectly applied to end patterns.	3 years ago
Dustin Wilson	98bc4ff794	Started adding assertions for easier debugging of Tokenizer	3 years ago
Dustin Wilson	730f10ead3	Minor changes	3 years ago
Dustin Wilson	b2ae3be4a7	Subpattern tokenization now maintains line length	3 years ago
Dustin Wilson	6eccc22196	Minor fixes, added capture token splicing	3 years ago
Dustin Wilson	b7e1353821	Subpatterns now limited to their parent pattern's length (if necessary) • Removed Token class in favor of associative arrays in anticipation of token manipulation in captures. (ugh)	3 years ago
Dustin Wilson	005e394076	Removed weak references from grammars • Originally I had a concept of a readonly node tree for grammars with nodes owning other nodes thinking it would be necessary when tokenizing. It isn't, so they're more trouble than they're worth. • "ownership" in Grammar\Reference objects is handled by an ownerGrammarScopeName property which is then used to get the grammar from the GrammarRegistry.	3 years ago
Dustin Wilson	eb9c34a024	Minor nit picking	3 years ago
Dustin Wilson	cba093dd68	Removed getting data from file • Added pattern match anchor support. • Data is now an instanced class with support only for string input. • Data now has firstLine, lastLine, and lastLineBeforeFinalNewLine properties to facilitate anchoring • Highlight now has a static toDOM method for highlighting to a DOM tree instead of the withFile and withString methods for accepting different kinds of input • Tokenizer now only outputs newline tokens if not the last line • Tokenizer now throws out pattern match regexes if their anchors are invalid for the current line. • Tokenizer now won't mistakenly emit empty string tokens.	3 years ago
Dustin Wilson	c055e9f3ba	Fixed tokenization of pattern and capture leftovers	3 years ago
Dustin Wilson	088a28270b	Progress	3 years ago
Dustin Wilson	adf7cd7331	Misunderstood matching process, still broken lol • Before the first pattern's regex to match the line would be processed into tokens. This apparently is incorrect. Instead, the pattern regex that has an offset that is closest to the offset wins. Changes reflect this.	3 years ago
Dustin Wilson	0dbbc67f1a	More minor tweaks/fixes	3 years ago
Dustin Wilson	8f56d15b68	Very minor cleanup	3 years ago
Dustin Wilson	699aeebf93	Minor fixes, still broken lol • When parsing JSON grammars match regexes now only escape unescaped forward slashes. • When parsing JSON grammars match regexes now truncate unicode character codes larger than 0x10ffff to 0x10ffff, the largest possible unicode character. • Content names should only be applied to what is between begin/end patterns. Might need to fix to not apply to end patterns themselves.	3 years ago
Dustin Wilson	2577852090	Still broken • Added a flag for begin patterns • Trying to handle begin/end patterns better. Begin patterns shouldn't automatically remove themselves from the stack, their corresponding end pattern should instead.	3 years ago
Dustin Wilson	4f09139e3b	Various fixes, tokenization is however now an infinite loop :/ • Added preliminary transformation of out-of-range codepoints in matches • Fixed adoption of Grammar\Pattern objects. • Fixed retrieval of Grammar\RepositoryReferences.	3 years ago
Dustin Wilson	5adf6b3107	A bit more	3 years ago

1 2 3

116 Commits (main) All Branches Search

116 Commits (main)

All Branches