Package opennlp.tools.util.featuregen
Class BrownCluster
java.lang.Object
opennlp.tools.util.featuregen.BrownCluster
- All Implemented Interfaces:
SerializableArtifact
Class to load a Brown cluster document: word\tword_class\tprob
http://metaoptimize.com/projects/wordreprs/
The file containing the clustering lexicon has to be passed as the
value of the dict attribute of each BrownCluster feature generator.
-
Nested Class Summary
Nested Classes -
Constructor Summary
ConstructorsConstructorDescriptionGenerates the token to cluster map from Brown cluster input file. -
Method Summary
Modifier and TypeMethodDescriptionClass<?> Retrieves the class which can serialize and recreate this artifact.lookupToken(String string) Check if a token is in the Brown:paths, token map.voidserialize(OutputStream out)
-
Constructor Details
-
BrownCluster
Generates the token to cluster map from Brown cluster input file. NOTE: we only add those tokens with frequency bigger than 5.- Parameters:
in- the inputstream- Throws:
IOException- the io exception
-
-
Method Details
-
lookupToken
Check if a token is in the Brown:paths, token map.- Parameters:
string- the token to look-up- Returns:
- the brown class if such token is in the brown cluster map
-
serialize
- Throws:
IOException
-
getArtifactSerializerClass
Description copied from interface:SerializableArtifactRetrieves the class which can serialize and recreate this artifact.
Note: The serializer class must have a public zero argument constructor or an exception is thrown during model serialization/loading.- Specified by:
getArtifactSerializerClassin interfaceSerializableArtifact- Returns:
- the corresponding ArtifactSerializer class.
-