Class RussianLetterTokenizer

All Implemented Interfaces:
Closeable, AutoCloseable

@Deprecated public class RussianLetterTokenizer extends CharTokenizer
Deprecated.
(3.1) Use StandardTokenizer instead, which has the same functionality. This filter will be removed in Lucene 5.0
A RussianLetterTokenizer is a Tokenizer that extends LetterTokenizer by also allowing the basic Latin digits 0-9.

You must specify the required Version compatibility when creating RussianLetterTokenizer:

  • As of 3.1, CharTokenizer uses an int based API to normalize and detect token characters. See CharTokenizer.isTokenChar(int) and CharTokenizer.normalize(int) for details.
  • Constructor Details

    • RussianLetterTokenizer

      public RussianLetterTokenizer(Version matchVersion, Reader in)
      Deprecated.
      Construct a new RussianLetterTokenizer. * @param matchVersion Lucene version to match See
      invalid @link
      {@link <a href="#version">above</a>
      }
      Parameters:
      in - the input to split up into tokens
    • RussianLetterTokenizer

      public RussianLetterTokenizer(Version matchVersion, AttributeSource.AttributeFactory factory, Reader in)
      Deprecated.
      Construct a new RussianLetterTokenizer using a given AttributeSource.AttributeFactory. * @param matchVersion Lucene version to match See
      invalid @link
      {@link <a href="#version">above</a>
      }
      Parameters:
      factory - the attribute factory to use for this Tokenizer
      in - the input to split up into tokens