Class CompoundWordTokenFilterBase.AbstractBuilder<BuilderT extends CompoundWordTokenFilterBase.AbstractBuilder<BuilderT>>
java.lang.Object
co.elastic.clients.util.ObjectBuilderBase
co.elastic.clients.util.WithJsonObjectBuilderBase<BuilderT>
co.elastic.clients.elasticsearch._types.analysis.TokenFilterBase.AbstractBuilder<BuilderT>
co.elastic.clients.elasticsearch._types.analysis.CompoundWordTokenFilterBase.AbstractBuilder<BuilderT>
- All Implemented Interfaces:
WithJson<BuilderT>
- Direct Known Subclasses:
DictionaryDecompounderTokenFilter.Builder,HyphenationDecompounderTokenFilter.Builder
- Enclosing class:
- CompoundWordTokenFilterBase
public abstract static class CompoundWordTokenFilterBase.AbstractBuilder<BuilderT extends CompoundWordTokenFilterBase.AbstractBuilder<BuilderT>>
extends TokenFilterBase.AbstractBuilder<BuilderT>
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionfinal BuilderTmaxSubwordSize(Integer value) Maximum subword character length.final BuilderTminSubwordSize(Integer value) Minimum subword character length.final BuilderTminWordSize(Integer value) Minimum word character length.final BuilderTonlyLongestMatch(Boolean value) Iftrue, only include the longest matching subword.final BuilderTA list of subwords to look for in the token stream.final BuilderTA list of subwords to look for in the token stream.final BuilderTwordListPath(String value) Path to a file that contains a list of subwords to find in the token stream.Methods inherited from class co.elastic.clients.elasticsearch._types.analysis.TokenFilterBase.AbstractBuilder
self, versionMethods inherited from class co.elastic.clients.util.WithJsonObjectBuilderBase
withJsonMethods inherited from class co.elastic.clients.util.ObjectBuilderBase
_checkSingleUse, _listAdd, _listAddAll, _mapPut, _mapPutAll
-
Constructor Details
-
AbstractBuilder
public AbstractBuilder()
-
-
Method Details
-
maxSubwordSize
Maximum subword character length. Longer subword tokens are excluded from the output. Defaults to15.API name:
max_subword_size -
minSubwordSize
Minimum subword character length. Shorter subword tokens are excluded from the output. Defaults to2.API name:
min_subword_size -
minWordSize
Minimum word character length. Shorter word tokens are excluded from the output. Defaults to5.API name:
min_word_size -
onlyLongestMatch
Iftrue, only include the longest matching subword. Defaults tofalse.API name:
only_longest_match -
wordList
A list of subwords to look for in the token stream. If found, the subword is included in the token output. Either this parameter orword_list_pathmust be specified.API name:
word_listAdds all elements of
listtowordList. -
wordList
A list of subwords to look for in the token stream. If found, the subword is included in the token output. Either this parameter orword_list_pathmust be specified.API name:
word_listAdds one or more values to
wordList. -
wordListPath
Path to a file that contains a list of subwords to find in the token stream. If found, the subword is included in the token output. This path must be absolute or relative to the config location, and the file must be UTF-8 encoded. Each token in the file must be separated by a line break. Either this parameter orword_listmust be specified.API name:
word_list_path
-