public class ClueWarcInputFormat extends org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,ClueWarcRecord>
| Modifier and Type | Class and Description |
|---|---|
static class |
ClueWarcInputFormat.ClueWarcRecordReader |
| Constructor and Description |
|---|
ClueWarcInputFormat() |
| Modifier and Type | Method and Description |
|---|---|
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,ClueWarcRecord> |
getRecordReader(org.apache.hadoop.mapred.InputSplit split,
org.apache.hadoop.mapred.JobConf conf,
org.apache.hadoop.mapred.Reporter reporter)
Just return the record reader
|
protected boolean |
isSplitable(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.fs.Path filename)
Don't allow the files to be split!
|
protected boolean isSplitable(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.fs.Path filename)
isSplitable in class org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,ClueWarcRecord>public org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,ClueWarcRecord> getRecordReader(org.apache.hadoop.mapred.InputSplit split, org.apache.hadoop.mapred.JobConf conf, org.apache.hadoop.mapred.Reporter reporter) throws IOException
getRecordReader in interface org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,ClueWarcRecord>getRecordReader in class org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,ClueWarcRecord>IOExceptionCopyright © 2015. All rights reserved.