| Package | Description |
|---|---|
| edu.umd.cloud9.collection.aquaint2 |
Provides classes for working with the AQUAINT-2 collection.
|
| edu.umd.cloud9.collection.medline |
Provides classes for working with MEDLINE citations in XML format
(particularly, for the TREC 2004-5 genomics tracks).
|
| edu.umd.cloud9.collection.trec |
Provides classes for working with the TREC collection (particularly
disks 4 and 5).
|
| edu.umd.cloud9.collection.trecweb |
Provides classes for working with the GOV2 collection.
|
| edu.umd.cloud9.collection.wikipedia |
Provides classes for working with Wikipedia XML dumps.
|
| Modifier and Type | Class and Description |
|---|---|
class |
Aquaint2DocumentInputFormat |
| Modifier and Type | Class and Description |
|---|---|
class |
MedlineCitationInputFormat
Hadoop
InputFormat for processing the MEDLINE citations in XML format (new API). |
| Modifier and Type | Class and Description |
|---|---|
class |
TrecDocumentInputFormat |
| Modifier and Type | Class and Description |
|---|---|
class |
TrecWebDocumentInputFormat |
| Modifier and Type | Class and Description |
|---|---|
class |
WikipediaPageInputFormat
Hadoop
InputFormat for processing Wikipedia pages from the XML dumps. |
Copyright © 2015. All rights reserved.