public class Wordlist
extends java.lang.Object
Wordlists
for details of available files. The constructors allow the specification of
lower case and that words contain a minimum number of characters. The latter
is supports some applications where short tokens can lead to noisy results.| Constructor and Description |
|---|
Wordlist(Wordlists wordListName)
Creates a list of 'words' from the specified source file.
|
Wordlist(Wordlists wordListName,
boolean normalised)
Creates a list of 'words' from the specified source file and allows
the words to be normalised to lower case.
|
Wordlist(Wordlists wordListName,
boolean normalised,
int minimumLength)
Creates a list of 'words' from the specified source file, that can be
normalised to lower case, and consists of at least a minimum number of
characters.
|
| Modifier and Type | Method and Description |
|---|---|
java.util.List<java.lang.String> |
list()
Returns the list of words.
|
public Wordlist(Wordlists wordListName)
wordListName - a constant from the Wordlists enumerationpublic Wordlist(Wordlists wordListName, boolean normalised)
wordListName - a constant from the Wordlists enumerationnormalised - a boolean that indicates whether words in the
recovered list should be normalised to lower case.public Wordlist(Wordlists wordListName, boolean normalised, int minimumLength)
wordListName - a constant from the Wordlists enumerationnormalised - a boolean that indicates whether words in the
recovered list should be normalised to lower case.minimumLength - the length of the shortest token to include in the
wordlist. 1 loads every token, and is the default for the other constructors.