Class AnalyzerNames
- java.lang.Object
-
- org.hibernate.search.engine.backend.analysis.AnalyzerNames
-
public final class AnalyzerNames extends Object
Constants for the names of built-in analyzers.
-
-
Field Summary
Fields Modifier and Type Field Description static StringDEFAULTThe default analyzer.static StringKEYWORDThe keyword analyzer.static StringSIMPLEThe simple analyzer.static StringSTANDARDThe standard analyzer.static StringSTOPThe stop analyzer.static StringWHITESPACEThe whitespace analyzer.
-
-
-
Field Detail
-
DEFAULT
public static final String DEFAULT
The default analyzer.This analyzer will generally be used for full-text field that don't require specific analysis.
Unless overridden by explicit analysis configuration, the default analyzer will be the standard analyzer:
- First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
- Then, lowercase each token.
- See Also:
- Constant Field Values
-
STANDARD
public static final String STANDARD
The standard analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize using the standard tokenizer, which follows Word Break rules from the Unicode Text Segmentation algorithm, as specified in Unicode Standard Annex #29.
- Then, lowercase each token.
- See Also:
- Constant Field Values
-
SIMPLE
public static final String SIMPLE
The simple analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming non-letter characters are separators.
- Then, lowercase each token.
- See Also:
- Constant Field Values
-
WHITESPACE
public static final String WHITESPACE
The whitespace analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming whitespace characters are separators.
- Do not change the tokens.
- See Also:
- Constant Field Values
-
STOP
public static final String STOP
The stop analyzer.Unless overridden by explicit analysis configuration, this analyzer behaves as follows:
- First, tokenize by assuming non-letter characters are separators.
- Then, lowercase each token.
- finally, remove english stop words.
- See Also:
- Constant Field Values
-
KEYWORD
public static final String KEYWORD
The keyword analyzer.Unless overridden by explicit analysis configuration, this analyzer does not change the text in any way.
With this analyzer, a full text field would behave similarly to a keyword field, but with fewer features: no terms aggregations, for example.
Consider using a keyword field instead.
- See Also:
- Constant Field Values
-
-