1
I Use This!
Low Activity

Commits : Listings

Analyzed 1 day ago. based on code collected 1 day ago.
Oct 17, 2024 — Oct 17, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
* Begin work on full PTB-compatible English tokenization More... over 11 years ago
* Compile string_tools module More... over 11 years ago
* Tests passing for reorganized version More... over 11 years ago
* Tests passing for reorganized version More... over 11 years ago
* Reorganized, moving language-independent stuff to spacy. The functions in spacy ask for the dictionaries and split function on input, but the language-specific modules are curried versions that use the globals More... over 11 years ago
* Working tokenization. en doesn't match PTB perfectly. Need to reorganize before adding more schemes. More... over 11 years ago
* Reading in tokenization rules correctly. Passing tests. More... over 11 years ago
* Rejigged tests. Working possessives, but no other contractions More... over 11 years ago
* Fixes to tokenization. Now segment sequences of the same punctuation. More... over 11 years ago
* Possessive test passing More... over 11 years ago
* Initial commit. Tests passing for punctuation handling. Need contractions, file transport, tokenize function, etc. More... over 11 years ago
* Add ext stuff, while I figure out how to get it working as a different project More... over 11 years ago
* Add gitignore More... over 11 years ago
* Add initial docs stuff More... over 11 years ago
* Add build/setup stuff More... over 11 years ago
Initial commit More... over 11 years ago