| Package | Description |
|---|---|
| edu.umd.cloud9.collection.wikipedia |
Provides classes for working with Wikipedia XML dumps.
|
| edu.umd.cloud9.collection.wikipedia.language |
Provides language dependent classes for working with Wikipedia XML dumps.
|
| Modifier and Type | Method and Description |
|---|---|
WikipediaPage |
WikipediaPageInputFormatOld.WikipediaPageRecordReader.createValue()
Creates an object for the value.
|
WikipediaPage |
WikipediaPageInputFormat.WikipediaPageRecordReader.getCurrentValue() |
WikipediaPage |
WikipediaForwardIndex.getDocument(int docno) |
WikipediaPage |
WikipediaForwardIndex.getDocument(String docid) |
| Modifier and Type | Method and Description |
|---|---|
org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,WikipediaPage> |
WikipediaPageInputFormat.createRecordReader(org.apache.hadoop.mapreduce.InputSplit split,
org.apache.hadoop.mapreduce.TaskAttemptContext context) |
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,WikipediaPage> |
WikipediaPageInputFormatOld.getRecordReader(org.apache.hadoop.mapred.InputSplit inputSplit,
org.apache.hadoop.mapred.JobConf conf,
org.apache.hadoop.mapred.Reporter reporter)
Returns a
RecordReader for this InputFormat. |
| Modifier and Type | Method and Description |
|---|---|
boolean |
WikipediaPageInputFormatOld.WikipediaPageRecordReader.next(org.apache.hadoop.io.LongWritable key,
WikipediaPage value)
Reads the next key-value pair.
|
boolean |
WikipediaPagesBz2InputStream.readNext(WikipediaPage page)
Reads the next Wikipedia page.
|
static void |
WikipediaPage.readPage(WikipediaPage page,
String s)
Reads a raw XML string into a
WikipediaPage object. |
| Modifier and Type | Class and Description |
|---|---|
class |
ArabicWikipediaPage
An Arabic page from Wikipedia.
|
class |
ChineseWikipediaPage
An Chinese page from Wikipedia.
|
class |
CzechWikipediaPage
An Czech page from Wikipedia.
|
class |
EnglishWikipediaPage
An English page from Wikipedia.
|
class |
GermanWikipediaPage
An German page from Wikipedia.
|
class |
SpanishWikipediaPage
An Spanish page from Wikipedia.
|
class |
SwedishWikipediaPage
A Swedish page from Wikipedia.
|
class |
TurkishWikipediaPage
An Turkish page from Wikipedia.
|
| Modifier and Type | Method and Description |
|---|---|
static WikipediaPage |
WikipediaPageFactory.createWikipediaPage(String language)
Returns a
WikipediaPage for this language. |
| Modifier and Type | Method and Description |
|---|---|
static Class<? extends WikipediaPage> |
WikipediaPageFactory.getWikipediaPageClass(String language) |
Copyright © 2015. All rights reserved.