0
I Use This!
Very Low Activity

Commits : Listings

Analyzed about 22 hours ago. based on code collected about 22 hours ago.
Jan 09, 2025 — Jan 09, 2026
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Adding in more data, support JSONL output. More... over 3 years ago
Remove old MDX code, support JSONL output from indexer jobs. More... over 3 years ago
Add JSON encoder, and add Parquet dependency. More... over 3 years ago
Fixes and additions from Hadoop 3 experiments. More... over 3 years ago
Bump jsoup from 1.14.2 to 1.15.3 in /warc-indexer More... over 3 years ago
Add separate Solr9 runner. More... over 3 years ago
Switch to suffix model. More... over 3 years ago
Update ci-build-and-push.yml More... over 3 years ago
Add a separate Solr9 image build. More... over 3 years ago
Strip down Typesafe Config integration. More... over 3 years ago
Ensure HTTP Content-Type is used over the Record-level Content-Type. More... over 3 years ago
Add test for #289 and fall-back on the served content type when format ID fails. More... over 3 years ago
Update headers. More... over 3 years ago
Merge pull request #292 from ukwa/dependabot/maven/digipres-tika/com.itextpdf-itextpdf-5.5.12 More... over 3 years ago
Bump itextpdf from 5.2.0 to 5.5.12 in /digipres-tika More... over 3 years ago
Merge pull request #269 from aponb/elastic2opensearch_migration More... almost 4 years ago
Merge branch 'master' into elastic2opensearch_migration More... almost 4 years ago
Merge pull request #263 from netarchivesuite/elastic_text More... almost 4 years ago
Merge pull request #270 from netarchivesuite/warcit More... almost 4 years ago
Merge pull request #257 from netarchivesuite/field_rewrite More... almost 4 years ago
Merge branch 'master' into field_rewrite More... almost 4 years ago
Fix whitespace collapsing and add unit test More... almost 4 years ago
Make tests from #283 consistent with newer JSoup behaviour. More... almost 4 years ago
Merge pull request #278 from ukwa/dependabot/maven/warc-indexer/xerces-xercesImpl-2.12.2 More... almost 4 years ago
Merge pull request #262 from ukwa/dependabot/maven/warc-indexer/org.elasticsearch-elasticsearch-7.14.0 More... almost 4 years ago
Merge pull request #260 from ukwa/dependabot/maven/warc-indexer/org.jsoup-jsoup-1.14.2 More... almost 4 years ago
Merge pull request #283 from netarchivesuite/host_normalize More... almost 4 years ago
Extend host and domain unit test More... almost 4 years ago
Add JavaDoc and comments More... almost 4 years ago
Harden host-extraction/validation. This closes #281 but might be too strict More... almost 4 years ago