Class HunspellStemmer
java.lang.Object
org.apache.lucene.analysis.hunspell.HunspellStemmer
HunspellStemmer uses the affix rules declared in the HunspellDictionary to generate one or more stems for a word. It
conforms to the algorithm in the original hunspell algorithm, including recursive suffix stripping.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classStem represents all information known about a stem of a word. -
Constructor Summary
ConstructorsConstructorDescriptionHunspellStemmer(HunspellDictionary dictionary) Constructs a new HunspellStemmer which will use the provided HunspellDictionary to create its stems.HunspellStemmer(HunspellDictionary dictionary, int recursionCap) Constructs a new HunspellStemmer which will use the provided HunspellDictionary to create its stems -
Method Summary
Modifier and TypeMethodDescriptionapplyAffix(char[] strippedWord, int length, HunspellAffix affix, int recursionDepth) Applies the affix rule to the given word, producing a list of stems if any are foundstem(char[] word, int length) Find the stem(s) of the provided wordFind the stem(s) of the provided worduniqueStems(char[] word, int length) Find the unique stem(s) of the provided word
-
Constructor Details
-
HunspellStemmer
Constructs a new HunspellStemmer which will use the provided HunspellDictionary to create its stems. Uses the default recursion cap of2(based on Hunspell documentation).- Parameters:
dictionary- HunspellDictionary that will be used to create the stems
-
HunspellStemmer
Constructs a new HunspellStemmer which will use the provided HunspellDictionary to create its stems- Parameters:
dictionary- HunspellDictionary that will be used to create the stemsrecursionCap- maximum level of recursion stemmer can go into
-
-
Method Details
-
stem
Find the stem(s) of the provided word- Parameters:
word- Word to find the stems for- Returns:
- List of stems for the word
-
stem
Find the stem(s) of the provided word- Parameters:
word- Word to find the stems for- Returns:
- List of stems for the word
-
uniqueStems
Find the unique stem(s) of the provided word- Parameters:
word- Word to find the stems for- Returns:
- List of stems for the word
-
applyAffix
public List<HunspellStemmer.Stem> applyAffix(char[] strippedWord, int length, HunspellAffix affix, int recursionDepth) Applies the affix rule to the given word, producing a list of stems if any are found- Parameters:
strippedWord- Word the affix has been removed and the strip addedaffix- HunspellAffix representing the affix rule itselfrecursionDepth- Level of recursion this stemming step is at- Returns:
- List of stems for the word, or an empty list if none are found
-