openhub.net
Black Duck Software, Inc.
Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
C
chardet
Settings
|
Report Duplicate
0
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Moderate Activity
Commits
: Listings
Analyzed
1 day
ago. based on code collected
1 day
ago.
Jan 25, 2025 — Jan 25, 2026
Showing page 1 of 18
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
Add github codespace support (#312)
oxygen dioxide
More...
21 days ago
Ignore .claude
Dan Blanchard
More...
21 days ago
Remove Python 3.9 support, add Python 3.14 support (#311)
Dan Blanchard
More...
about 1 month ago
Move heuristics to SingleByteCharSetGroupProber
Dan Blanchard
More...
2 months ago
xfail by EncodingEra in test.py
Dan Blanchard
More...
2 months ago
Remove Windows-1250 Romanian files that were actually ISO-8859-16
Dan Blanchard
More...
2 months ago
Add active_probers property to fix bug
Dan Blanchard
More...
2 months ago
Tweak tests so that we use ALL encoding era by default
Dan Blanchard
More...
2 months ago
Add nested_probers property to simplify some post-processing
Dan Blanchard
More...
2 months ago
Properly fill out language_filter for each charset
Dan Blanchard
More...
2 months ago
Remove EUC-TW test data since we no longer support it
Dan Blanchard
More...
2 months ago
Remove HEURISTICS.md since it was just ideas I was working on with Claude
Dan Blanchard
More...
2 months ago
Simplify encoding era handling some more
Dan Blanchard
More...
2 months ago
Add explanations for filter_international_words to help future devs
Dan Blanchard
More...
2 months ago
Put back INTERNATIONAL_WORDS_PATTERN because it would skip parts of words otherwise
Dan Blanchard
More...
2 months ago
More little cleanup things around encoding eras and tie breaking
Dan Blanchard
More...
2 months ago
Update EXPECTED_FAILURES for now
Dan Blanchard
More...
2 months ago
Add encoding preference tiers for tie-breaking
Dan Blanchard
More...
2 months ago
Improve MacRoman vs ISO-8859/Windows detection tie-breaking
Dan Blanchard
More...
2 months ago
Add test to verify all single-byte encodings have probers
Dan Blanchard
More...
2 months ago
Add missing CP500 EBCDIC model registrations
Dan Blanchard
More...
2 months ago
Add missing CP037 EBCDIC model registrations
Dan Blanchard
More...
2 months ago
Fix UTF-16/32 detection for non-ASCII heavy text
Dan Blanchard
More...
2 months ago
Remove a bunch of state machines from the all-invalid-sequences test because the state machines are too simple to actually do that
Dan Blanchard
More...
2 months ago
Retrain models after latest updates
Dan Blanchard
More...
2 months ago
Add a bunch of new test files
Dan Blanchard
More...
2 months ago
Add all supported encodings and languages to charsets and languages
Dan Blanchard
More...
2 months ago
Update charsets metadata to have a dict that maps from names to metadata
Dan Blanchard
More...
2 months ago
Retrain models with legacy substitutions factored into char_to_order_maps
Dan Blanchard
More...
2 months ago
Update metadata about supported encodings
Dan Blanchard
More...
2 months ago
←
1
2
3
4
5
6
7
8
9
…
17
18
→
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree