See: Description
| Class | Description |
|---|---|
| Gov2DocnoMapping | |
| RepackTrecWebCollection |
Tool to repack TREC web collections (wt10g, gov2) into
SequenceFiles. |
| TrecWebDocnoMappingBuilder |
Tool that builds the mapping from docids (String identifiers) to docnos (sequentially-numbered
ints) for TREC web collections (wt10g, gov2).
|
| TrecWebDocument | |
| TrecWebDocumentInputFormat | |
| TrecWebDocumentInputFormat.TrecWebDocumentRecordReader | |
| TrecWebDocumentInputFormatOld |
Hadoop
InputFormat for processing the TREC collection. |
| TrecWebDocumentInputFormatOld.TrecWebRecordReader |
Hadoop
RecordReader for reading TREC-formatted documents. |
| Wt10gDocnoMapping |
| Enum | Description |
|---|---|
| TrecWebDocnoMappingBuilder.Documents |
Copyright © 2015. All rights reserved.