0
I Use This!
Very Low Activity

Commits : Listings

Analyzed about 22 hours ago. based on code collected about 22 hours ago.
Jan 09, 2025 — Jan 09, 2026
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Update dependency More... almost 6 years ago
Add lengths to CDX test lines ahead of attempting to fix length indexing. More... almost 6 years ago
Adding initial reusable benchmakr. More... about 6 years ago
Add notebook gathering SolrCloud stats. More... about 6 years ago
Add query performance exploration. More... about 6 years ago
Merge pull request #211 from netarchivesuite/timepad More... over 6 years ago
Merges master and fixes conflicts More... over 6 years ago
Merge pull request #210 from netarchivesuite/compression More... over 6 years ago
Adds unit test for waybackDate padding More... over 6 years ago
Uses the padded crawlDate instead of as-is WARC-Date for waybackDate. This handles issue #188 (no test yet) More... over 6 years ago
Refactors WARCIndexer heavily to clean up and provide compression header to decompress Brotli responses More... over 6 years ago
Switches to archive.org's ArchiveUtils.isGzipped for broader GZip support More... over 6 years ago
Introduces proper test for GZip & Brotli support in the analyzer workflow. Fails on Brotli (as expected) More... over 6 years ago
Merge branch 'master' into compression More... over 6 years ago
Separates headers from body in compression support test More... over 6 years ago
Adds skeleton code for testing compression support More... over 6 years ago
Adds WARCs for testing compression support More... over 6 years ago
Merge pull request #198 from netarchivesuite/droid_url More... over 6 years ago
Merge pull request #208 from netarchivesuite/webrecorder More... over 6 years ago
Merge pull request #209 from netarchivesuite/exif_default More... over 6 years ago
Adds exif_extraction = true to avoid errors on old configs. As EXIF extraction is fairly lightweight this should not introduce performance problems More... over 6 years ago
HtmlAnalyser and TikaPayloadAnalyze will unzip inputstream (if mime-type is not excluded). Warc files created with WebRecorder will have gzip encoding. More... over 6 years ago
HtmlAnalyser and TikaPayloadAnalyze will unzip inputstream (if mime-type is not excluded). Warc files created with WebRecorder will have gzip encoding. More... over 6 years ago
HtmlAnalyser and TikaPayloadAnalyze will unzip inputstream (if mime-type is not excluded). Warc files created with WebRecorder will have gzip encoding. More... over 6 years ago
Don't wait for the job to finish. More... over 6 years ago
Added a test case for the outlink extractor, and cleaned up a bit. More... over 6 years ago
Added me...@name extraction and made it and link@rel normalise to lower case. More... over 6 years ago
Droid WARC URL header sanitize More... about 7 years ago
Merge pull request #196 from netarchivesuite/windowsPathFix More... about 7 years ago
WindowsOS path fix. This closes #194 More... about 7 years ago