C

chardet

Settings | Report Duplicate

0

I Use This!

Moderate Activity

Commits : Listings

Analyzed 1 day ago. based on code collected 1 day ago.

Commit Message	Contributor	Files Modified	Lines Added	Lines Removed	Code Location	Date
Jan 25, 2025 — Jan 25, 2026 Showing page 1 of 18 Search / Filter on:
Add github codespace support (#312)	oxygen dioxide	More...				21 days ago
Ignore .claude	Dan Blanchard	More...				21 days ago
Remove Python 3.9 support, add Python 3.14 support (#311)	Dan Blanchard	More...				about 1 month ago
Move heuristics to SingleByteCharSetGroupProber	Dan Blanchard	More...				2 months ago
xfail by EncodingEra in test.py	Dan Blanchard	More...				2 months ago
Remove Windows-1250 Romanian files that were actually ISO-8859-16	Dan Blanchard	More...				2 months ago
Add active_probers property to fix bug	Dan Blanchard	More...				2 months ago
Tweak tests so that we use ALL encoding era by default	Dan Blanchard	More...				2 months ago
Add nested_probers property to simplify some post-processing	Dan Blanchard	More...				2 months ago
Properly fill out language_filter for each charset	Dan Blanchard	More...				2 months ago
Remove EUC-TW test data since we no longer support it	Dan Blanchard	More...				2 months ago
Remove HEURISTICS.md since it was just ideas I was working on with Claude	Dan Blanchard	More...				2 months ago
Simplify encoding era handling some more	Dan Blanchard	More...				2 months ago
Add explanations for filter_international_words to help future devs	Dan Blanchard	More...				2 months ago
Put back INTERNATIONAL_WORDS_PATTERN because it would skip parts of words otherwise	Dan Blanchard	More...				2 months ago
More little cleanup things around encoding eras and tie breaking	Dan Blanchard	More...				2 months ago
Update EXPECTED_FAILURES for now	Dan Blanchard	More...				2 months ago
Add encoding preference tiers for tie-breaking	Dan Blanchard	More...				2 months ago
Improve MacRoman vs ISO-8859/Windows detection tie-breaking	Dan Blanchard	More...				2 months ago
Add test to verify all single-byte encodings have probers	Dan Blanchard	More...				2 months ago
Add missing CP500 EBCDIC model registrations	Dan Blanchard	More...				2 months ago
Add missing CP037 EBCDIC model registrations	Dan Blanchard	More...				2 months ago
Fix UTF-16/32 detection for non-ASCII heavy text	Dan Blanchard	More...				2 months ago
Remove a bunch of state machines from the all-invalid-sequences test because the state machines are too simple to actually do that	Dan Blanchard	More...				2 months ago
Retrain models after latest updates	Dan Blanchard	More...				2 months ago
Add a bunch of new test files	Dan Blanchard	More...				2 months ago
Add all supported encodings and languages to charsets and languages	Dan Blanchard	More...				2 months ago
Update charsets metadata to have a dict that maps from names to metadata	Dan Blanchard	More...				2 months ago
Retrain models with legacy substitutions factored into char_to_order_maps	Dan Blanchard	More...				2 months ago
Update metadata about supported encodings	Dan Blanchard	More...				2 months ago

←
1
2
3
4
5
6
7
8
9
…
17
18
→