This project contains some Hadoop code for working with the TREC Knowledge Base Acceleration dataset. In particular, it provides classes to read/write topic files, read/write run files, and expose the documents in the Thrift files as Hadoop-readable objects.
30 Day SummaryOct 3 2024 — Nov 2 2024
|
12 Month SummaryNov 2 2023 — Nov 2 2024
|