0
I Use This!
Low Activity

Commits : Listings

Analyzed 1 day ago. based on code collected 1 day ago.
Oct 14, 2024 — Oct 14, 2025
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
CdxRecord: add format(), surt(), toString(), values() More... 28 days ago
WarcReader: use a little-endian view of the buffer for magic probing More... about 1 month ago
WarcReader: Don't restore buffer ByteOrder More... about 1 month ago
Add support for reading warc-zstd files More... about 1 month ago
[maven-release-plugin] prepare for next development iteration More... 3 months ago
[maven-release-plugin] prepare release v0.32.0 More... 3 months ago
Update CHANGELOG.md for 0.32.0 release More... 3 months ago
Set tagNameFormat in pom.xml More... 3 months ago
Add scm section to pom.xml More... 3 months ago
Switch to Maven Central portal from OSSRH More... 3 months ago
ListTool: Handle ParsingException More... 3 months ago
Avoid double filename/offset in CdxWriter exception output More... 3 months ago
HttpParser: Accept (but ignore) folded reason phrases in lenient mode More... 3 months ago
WarcParser: Don't treat alexa/dat ARC records as HTTP More... 3 months ago
Include source filename and record offset in ParsingException message More... 3 months ago
HttpParser: Handle header lines missing ":" More... 3 months ago
HttpParser: Support non-standard NCSA/1.5.1 HTTP responses missing version More... 3 months ago
ValidateTool: Remove prefix from indented log lines More... 3 months ago
ValidateTool: Document -j, --threads option More... 3 months ago
ValidateTool: Add filename prefix to log output More... 3 months ago
ValidateTool: Implement multithreaded validation More... 3 months ago
Update actions to latest version More... 3 months ago
Remove SOURCE as we aren't building the native binary anymore More... 3 months ago
DedupeTool: Use multiple threads More... 3 months ago
DedupeTool: Print stats, add --dry-run and --quiet options More... 3 months ago
DedupeTool: Add in-memory cache for digest deduplication More... 4 months ago
ExtractTool: Add --concurrent option More... 10 months ago
Add HeaderValidator with a ruleset based on the WARC 1.1 standard More... 11 months ago
Replace broken IPv6 formatting with Guava's toAddrString More... 11 months ago
Use RFC5952 canonical form for IPv6 addresses in WARC-IP-Address More... 11 months ago