0
I Use This!
Very Low Activity

Commits : Listings

Analyzed about 22 hours ago. based on code collected about 22 hours ago.
Jan 09, 2025 — Jan 09, 2026
Commit Message Contributor Files Modified Lines Added Lines Removed Code Location Date
Added in the generic RegEx indexer code, along with some other code fragments concerning entity extraction. More... over 13 years ago
Simplified logging. More... over 13 years ago
Improved logging configuration. More... over 13 years ago
Added logging when a new file is opened, and changed debug logging in the mapper. More... over 13 years ago
Cleaner error handling, and switched the ArchiveReader into 'strict' mode to avoid infinite loop. More... over 13 years ago
Made RecordReader error logging less verbose (no stacktraces needed). Updated Maven build to Tika 1.1. More... over 13 years ago
Fixed up defaults and record skipping when there is an empty string. Added some basic logging. More... over 13 years ago
Added in Hadoop jar as 'provided'. More... almost 14 years ago
Switched Hadoop jar to 'provided'. More... almost 14 years ago
Updated Restlet code to use 2.0.12 (stable) API instead of unsupported 2.0-M3 API (which is not available in Maven central or even maven.restlet.org AFAICT). More... almost 14 years ago
Removed unnecessary files, and tweaked the Hadoop pom, adding archive file deps. More... almost 14 years ago
Removed unnecessary files, and tweaked the Hadoop pom, adding archive file deps. More... almost 14 years ago
Resolved conflict and added archive format deps. More... almost 14 years ago
Fixed permissions, as many files were mistakenly set as executable. More... almost 14 years ago
Fixed permissions, as many files were mistakenly set as executable. More... almost 14 years ago
Fixed up POM with wayback dependency. More... almost 14 years ago
Merge branch 'master' into anj-experiments More... almost 14 years ago
Forcing sync. More... almost 14 years ago
Forcing sync. More... almost 14 years ago
Syncing up minor tweaks to build etc. More... almost 14 years ago
Syncing up minor tweaks to build etc. More... almost 14 years ago
FileInputFormat for parsing (W)ARC files, returning CDX lines. NB. Uses Hadoop's non-deprecated classes, hence the new packages. More... almost 14 years ago
Implemented QueueingHttpSolrServer. More... almost 14 years ago
Removed Properties parsing: JobConf should now be set in driver class. More... almost 14 years ago
Added notes on Zip reading. More... almost 14 years ago
Added notes on Zip reading. More... almost 14 years ago
Some tweaks and experimentation. More... almost 14 years ago
Some tweaks and experimentation. More... almost 14 years ago
Various fixes related to direct submission to SolrServer. More... almost 14 years ago
Added apache-solrj dependency. More... almost 14 years ago