Package org.apache.lucene.analysis.ru
Class RussianLetterTokenizer
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.Tokenizer
org.apache.lucene.analysis.util.CharTokenizer
org.apache.lucene.analysis.ru.RussianLetterTokenizer
- All Implemented Interfaces:
Closeable,AutoCloseable
Deprecated.
A RussianLetterTokenizer is a
Tokenizer that extends LetterTokenizer
by also allowing the basic Latin digits 0-9.
You must specify the required Version compatibility when creating
RussianLetterTokenizer:
- As of 3.1,
CharTokenizeruses an int based API to normalize and detect token characters. SeeCharTokenizer.isTokenChar(int)andCharTokenizer.normalize(int)for details.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
AttributeSource.AttributeFactory, AttributeSource.State -
Constructor Summary
ConstructorsConstructorDescriptionRussianLetterTokenizer(Version matchVersion, Reader in) Deprecated.Construct a new RussianLetterTokenizer.RussianLetterTokenizer(Version matchVersion, AttributeSource.AttributeFactory factory, Reader in) Deprecated.Construct a new RussianLetterTokenizer using a givenAttributeSource.AttributeFactory. -
Method Summary
Methods inherited from class org.apache.lucene.analysis.util.CharTokenizer
end, incrementToken, resetMethods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
-
Constructor Details
-
RussianLetterTokenizer
Deprecated.Construct a new RussianLetterTokenizer. * @param matchVersion Lucene version to match See}invalid @link
{@link <a href="#version">above</a>- Parameters:
in- the input to split up into tokens
-
RussianLetterTokenizer
public RussianLetterTokenizer(Version matchVersion, AttributeSource.AttributeFactory factory, Reader in) Deprecated.Construct a new RussianLetterTokenizer using a givenAttributeSource.AttributeFactory. * @param matchVersion Lucene version to match See}invalid @link
{@link <a href="#version">above</a>- Parameters:
factory- the attribute factory to use for thisTokenizerin- the input to split up into tokens
-
StandardTokenizerinstead, which has the same functionality. This filter will be removed in Lucene 5.0