0
I Use This!
Moderate Activity

Commits : Listings

Analyzed 1 day ago. based on code collected 1 day ago.
Jan 25, 2025 — Jan 25, 2026
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Add github codespace support (#312) More... 21 days ago
Ignore .claude More... 21 days ago
Remove Python 3.9 support, add Python 3.14 support (#311) More... about 1 month ago
Move heuristics to SingleByteCharSetGroupProber More... 2 months ago
xfail by EncodingEra in test.py More... 2 months ago
Remove Windows-1250 Romanian files that were actually ISO-8859-16 More... 2 months ago
Add active_probers property to fix bug More... 2 months ago
Tweak tests so that we use ALL encoding era by default More... 2 months ago
Add nested_probers property to simplify some post-processing More... 2 months ago
Properly fill out language_filter for each charset More... 2 months ago
Remove EUC-TW test data since we no longer support it More... 2 months ago
Remove HEURISTICS.md since it was just ideas I was working on with Claude More... 2 months ago
Simplify encoding era handling some more More... 2 months ago
Add explanations for filter_international_words to help future devs More... 2 months ago
Put back INTERNATIONAL_WORDS_PATTERN because it would skip parts of words otherwise More... 2 months ago
More little cleanup things around encoding eras and tie breaking More... 2 months ago
Update EXPECTED_FAILURES for now More... 2 months ago
Add encoding preference tiers for tie-breaking More... 2 months ago
Improve MacRoman vs ISO-8859/Windows detection tie-breaking More... 2 months ago
Add test to verify all single-byte encodings have probers More... 2 months ago
Add missing CP500 EBCDIC model registrations More... 2 months ago
Add missing CP037 EBCDIC model registrations More... 2 months ago
Fix UTF-16/32 detection for non-ASCII heavy text More... 2 months ago
Remove a bunch of state machines from the all-invalid-sequences test because the state machines are too simple to actually do that More... 2 months ago
Retrain models after latest updates More... 2 months ago
Add a bunch of new test files More... 2 months ago
Add all supported encodings and languages to charsets and languages More... 2 months ago
Update charsets metadata to have a dict that maps from names to metadata More... 2 months ago
Retrain models with legacy substitutions factored into char_to_order_maps More... 2 months ago
Update metadata about supported encodings More... 2 months ago