| Analyzer |
An Analyzer builds TokenStreams, which analyze text.
|
| Analyzer.ReuseStrategy |
|
| Analyzer.TokenStreamComponents |
This class encapsulates the outer components of a token stream.
|
| AnalyzerWrapper |
Extension to Analyzer suitable for Analyzers which wrap
other Analyzers.
|
| CachingTokenFilter |
This class can be used if the token attributes of a TokenStream
are intended to be consumed more than once.
|
| CharacterUtils |
Utility class to write tokenizers or token filters.
|
| CharacterUtils.CharacterBuffer |
|
| CharArrayMap<V> |
A simple class that stores key Strings as char[]'s in a
hash table.
|
| CharArraySet |
A simple class that stores Strings as char[]'s in a
hash table.
|
| CharFilter |
Subclasses of CharFilter can be chained to filter a Reader
They can be used as Reader with additional offset
correction.
|
| DelegatingAnalyzerWrapper |
An analyzer wrapper, that doesn't allow to wrap components or readers.
|
| FilteringTokenFilter |
Abstract base class for TokenFilters that may remove tokens.
|
| GraphTokenFilter |
|
| LowerCaseFilter |
Normalizes token text to lower case.
|
| StopFilter |
Removes stop words from a token stream.
|
| StopwordAnalyzerBase |
Base class for Analyzers that need to make use of stopword sets.
|
| TokenFilter |
A TokenFilter is a TokenStream whose input is another TokenStream.
|
| Tokenizer |
A Tokenizer is a TokenStream whose input is a Reader.
|
| TokenStream |
A TokenStream enumerates the sequence of tokens, either from
Fields of a Document or from query text.
|
| TokenStreamToAutomaton |
Consumes a TokenStream and creates an Automaton
where the transition labels are UTF8 bytes (or Unicode
code points if unicodeArcs is true) from the TermToBytesRefAttribute.
|
| WordlistLoader |
Loader for text files that represent a list of stopwords.
|