openhub.net
Black Duck Software, Inc.
Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
P
python_crawler
Settings
|
Report Duplicate
1
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Inactive
Commits
: Listings
Analyzed
1 day
ago. based on code collected
1 day
ago.
Feb 13, 2025 — Feb 13, 2026
Showing page 1 of 1
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
Added 'requestInterval' parameter to globalData, which adds a delay between each URL request.
David Arbuckle
More...
over 14 years ago
Now parsing control characters out of urls prior to saving to database, and converting to lower case.
David Arbuckle
More...
over 14 years ago
improved efficiency of updatecanonical() by an order of magnitude, replaced One Big Query with a bunch of little indexed selects.
David Arbuckle
More...
over 14 years ago
debug output was throwing errors when an error occured while loading the page. handled.
David Arbuckle
More...
over 14 years ago
added debug toggle and url-specific print statements to spice up the console output.
David Arbuckle
More...
over 14 years ago
Added indexes to tables. Runs about 10x faster (for now)
David Arbuckle
More...
over 14 years ago
updated description, issues/functoinality
David Arbuckle
More...
over 14 years ago
moved queue.join() call to the appropriate location.
David Arbuckle
More...
over 14 years ago
Added sleeps to initial URL retrieval
David Arbuckle
More...
over 14 years ago
Added threading module for URL requests.
David Arbuckle
More...
over 14 years ago
Updated README with better descriptionn of project.
David Arbuckle
More...
over 14 years ago
tests continuing...
David Arbuckle
More...
over 14 years ago
testing
David Arbuckle
More...
over 14 years ago
test
David Arbuckle
More...
over 14 years ago
first commit
David Arbuckle
More...
over 14 years ago
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree