openhub.net
Black Duck Software, Inc.
Open Hub
Follow @
OH
Sign In
Join Now
Projects
People
Organizations
Tools
Blog
BDSA
Projects
People
Projects
Organizations
W
Web Archive Discovery
Settings
|
Report Duplicate
0
I Use This!
×
Login Required
Log in to Open Hub
Remember Me
Very Low Activity
Commits
: Listings
Analyzed
about 22 hours
ago. based on code collected
about 22 hours
ago.
Jan 09, 2025 — Jan 09, 2026
Showing page 5 of 54
Search / Filter on:
Commit Message
Contributor
Files Modified
Lines Added
Lines Removed
Code Location
Date
Tweak to build on pull requests.
Andy Jackson
More...
about 4 years ago
Merge pull request #268 from netarchivesuite/maven381
Andy Jackson
More...
about 4 years ago
Adding a GitHub Action build.
Andy Jackson
More...
about 4 years ago
Switch the version99 repo from http to https. This closes #267
Toke Eskildsen
More...
about 4 years ago
Bugfix: The SingleFileDocumenConsumer did not flush on endWARC() calls
Toke Eskildsen
More...
over 4 years ago
Bugfix: Reduce log statement to the number of docs instead of docs content
Toke Eskildsen
More...
over 4 years ago
Remove extra .jar from the produced witt-dependencies JAR
Toke Eskildsen
More...
over 4 years ago
Enable text output when using Elasticsearch, mirroring Solr behaviour
Toke Eskildsen
More...
over 4 years ago
Bump elasticsearch from 7.13.3 to 7.14.0 in /warc-indexer
dependabot[bot]
More...
over 4 years ago
Move explicit handling of URL length to the generic handling of Solr field content
Toke Eskildsen
More...
over 4 years ago
Merged from master
Toke Eskildsen
More...
over 4 years ago
Set max length for url and url_norm to 2000 (down from 2048) to mimick old behaviour
Toke Eskildsen
More...
over 4 years ago
Bump jsoup from 1.13.1 to 1.14.2 in /warc-indexer
dependabot[bot]
More...
over 4 years ago
Merge pull request #248 from ukwa/dependabot/maven/warc-indexer/commons-io-commons-io-2.7
Andy Jackson
More...
over 4 years ago
Merge pull request #258 from netarchivesuite/doc-consumer
Andy Jackson
More...
over 4 years ago
Merge pull request #259 from ukwa/dependabot/maven/warc-indexer/org.elasticsearch-elasticsearch-7.13.3
Andy Jackson
More...
over 4 years ago
Bump elasticsearch from 7.12.0 to 7.13.3 in /warc-indexer
dependabot[bot]
More...
over 4 years ago
Add batch_size to reference config
Toke Eskildsen
More...
over 4 years ago
Use TypeSafe's getBytes instead og getint and getLong for batch sizes
Toke Eskildsen
More...
over 4 years ago
Bugfix timing feedback
Toke Eskildsen
More...
over 4 years ago
Clean up code
Toke Eskildsen
More...
over 4 years ago
Improve feedback
Toke Eskildsen
More...
over 4 years ago
toString()
Toke Eskildsen
More...
over 4 years ago
Harden argument check
Toke Eskildsen
More...
over 4 years ago
Enable DocumentConsumer for the WARCIndexerCommand
Toke Eskildsen
More...
over 4 years ago
Instrument document additions
Toke Eskildsen
More...
over 4 years ago
Handle 1 file/warc vs. 1 file/warc-record
Toke Eskildsen
More...
over 4 years ago
Implement DocumentConsumerFactory
Toke Eskildsen
More...
over 4 years ago
Flesh out document consumer implementations
Toke Eskildsen
More...
over 4 years ago
Merge branch 'master' into field_rewrite
Toke Eskildsen
More...
over 4 years ago
←
1
2
3
4
5
6
7
8
9
…
53
54
→
This site uses cookies to give you the best possible experience. By using the site, you consent to our use of cookies. For more information, please see our
Privacy Policy
Agree