public class WikipediaPageInputFormatOld extends IndexableFileInputFormatOld<org.apache.hadoop.io.LongWritable,WikipediaPage>
InputFormat for processing Wikipedia pages from the XML dumps.| Modifier and Type | Class and Description |
|---|---|
static class |
WikipediaPageInputFormatOld.WikipediaPageRecordReader
Hadoop
RecordReader for reading Wikipedia pages from the XML dumps. |
| Constructor and Description |
|---|
WikipediaPageInputFormatOld() |
| Modifier and Type | Method and Description |
|---|---|
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,WikipediaPage> |
getRecordReader(org.apache.hadoop.mapred.InputSplit inputSplit,
org.apache.hadoop.mapred.JobConf conf,
org.apache.hadoop.mapred.Reporter reporter)
Returns a
RecordReader for this InputFormat. |
public org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,WikipediaPage> getRecordReader(org.apache.hadoop.mapred.InputSplit inputSplit, org.apache.hadoop.mapred.JobConf conf, org.apache.hadoop.mapred.Reporter reporter) throws IOException
RecordReader for this InputFormat.getRecordReader in interface org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,WikipediaPage>getRecordReader in class org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,WikipediaPage>IOExceptionCopyright © 2015. All rights reserved.