public class TrecWebDocumentInputFormatOld extends IndexableFileInputFormatOld<org.apache.hadoop.io.LongWritable,TrecWebDocument>
InputFormat for processing the TREC collection.| Modifier and Type | Class and Description |
|---|---|
static class |
TrecWebDocumentInputFormatOld.TrecWebRecordReader
Hadoop
RecordReader for reading TREC-formatted documents. |
| Constructor and Description |
|---|
TrecWebDocumentInputFormatOld() |
| Modifier and Type | Method and Description |
|---|---|
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,TrecWebDocument> |
getRecordReader(org.apache.hadoop.mapred.InputSplit inputSplit,
org.apache.hadoop.mapred.JobConf conf,
org.apache.hadoop.mapred.Reporter reporter)
Returns a
RecordReader for this InputFormat. |
protected boolean |
isSplitable(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.fs.Path filename) |
protected boolean isSplitable(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.fs.Path filename)
isSplitable in class org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,TrecWebDocument>public org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,TrecWebDocument> getRecordReader(org.apache.hadoop.mapred.InputSplit inputSplit, org.apache.hadoop.mapred.JobConf conf, org.apache.hadoop.mapred.Reporter reporter) throws IOException
RecordReader for this InputFormat.getRecordReader in interface org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,TrecWebDocument>getRecordReader in class org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,TrecWebDocument>IOExceptionCopyright © 2015. All rights reserved.