openhub.net
Black Duck Software, Inc.
Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
P
python readability lxml
Settings
|
Report Duplicate
1
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Very Low Activity
Commits
: Listings
Analyzed
1 day
ago. based on code collected
1 day
ago.
Apr 02, 2025 — Apr 02, 2026
Showing page 9 of 9
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
Added command-line usage
Yuri Baburov
More...
almost 15 years ago
Debug utilities.
Yuri Baburov
More...
almost 15 years ago
Add chardet to installation requirements
Jerry Charumilind
More...
almost 15 years ago
Expose Document in readability package
Jerry Charumilind
More...
almost 15 years ago
Change to automatically find packages
Jerry Charumilind
More...
almost 15 years ago
Add version number to track changes more easily
Jerry Charumilind
More...
almost 15 years ago
Allow passing unicode objects
Lee Semel
More...
almost 15 years ago
Updated setup.py to my fork, changed package name to lxml-readability
Yuri Baburov
More...
almost 15 years ago
Renamed encodings to encoding to avoid conflicts with system module.
Yuri Baburov
More...
almost 15 years ago
Added usage
Yuri Baburov
More...
almost 15 years ago
Updated scoring algorithm to match readability.js v1.7.1
Yuri Baburov
More...
almost 15 years ago
Improved title shortener method, and added it to the Document class.
Yuri Baburov
More...
almost 15 years ago
Corrected README
Yuri Baburov
More...
almost 15 years ago
Moved to lxml (based on decruft version); better encoding recognition.
Yuri Baburov
More...
almost 15 years ago
well that was quick; first fork added
gfxmonk
More...
about 15 years ago
added note to readme to make it clear that I'm not actively working on this library
gfxmonk
More...
about 15 years ago
made setup.py executable
Tim Cuthbertson
More...
over 15 years ago
added setup.py
Sean Brant
More...
over 15 years ago
removing empty paragraphs is not very useful, and can break some (stupid) websites
gfxmonk
More...
almost 16 years ago
fixed bug where only immediate text was being considered for weights, instead of all nested text
gfxmonk
More...
almost 16 years ago
failsafe parsing and more logging
gfxmonk
More...
almost 16 years ago
unicode, dammit!
gfxmonk
More...
almost 16 years ago
minor
gfxmonk
More...
almost 16 years ago
modified readme
gfxmonk
More...
almost 16 years ago
split out into content and summary methods
gfxmonk
More...
almost 16 years ago
clean up content method and debug
gfxmonk
More...
almost 16 years ago
use a more leniant parser
gfxmonk
More...
almost 16 years ago
initial
gfxmonk
More...
almost 16 years ago
←
1
2
3
4
5
6
7
8
9
→
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree