Class CompoundWordTokenFilterBase.AbstractBuilder<BuilderT extends CompoundWordTokenFilterBase.AbstractBuilder<BuilderT>>

java.lang.Object
co.elastic.clients.util.ObjectBuilderBase
co.elastic.clients.util.WithJsonObjectBuilderBase<BuilderT>
co.elastic.clients.elasticsearch._types.analysis.TokenFilterBase.AbstractBuilder<BuilderT>
co.elastic.clients.elasticsearch._types.analysis.CompoundWordTokenFilterBase.AbstractBuilder<BuilderT>
All Implemented Interfaces:
WithJson<BuilderT>
Direct Known Subclasses:
DictionaryDecompounderTokenFilter.Builder, HyphenationDecompounderTokenFilter.Builder
Enclosing class:
CompoundWordTokenFilterBase

public abstract static class CompoundWordTokenFilterBase.AbstractBuilder<BuilderT extends CompoundWordTokenFilterBase.AbstractBuilder<BuilderT>> extends TokenFilterBase.AbstractBuilder<BuilderT>
  • Constructor Details

    • AbstractBuilder

      public AbstractBuilder()
  • Method Details

    • maxSubwordSize

      public final BuilderT maxSubwordSize(@Nullable Integer value)
      Maximum subword character length. Longer subword tokens are excluded from the output. Defaults to 15.

      API name: max_subword_size

    • minSubwordSize

      public final BuilderT minSubwordSize(@Nullable Integer value)
      Minimum subword character length. Shorter subword tokens are excluded from the output. Defaults to 2.

      API name: min_subword_size

    • minWordSize

      public final BuilderT minWordSize(@Nullable Integer value)
      Minimum word character length. Shorter word tokens are excluded from the output. Defaults to 5.

      API name: min_word_size

    • onlyLongestMatch

      public final BuilderT onlyLongestMatch(@Nullable Boolean value)
      If true, only include the longest matching subword. Defaults to false.

      API name: only_longest_match

    • wordList

      public final BuilderT wordList(List<String> list)
      A list of subwords to look for in the token stream. If found, the subword is included in the token output. Either this parameter or word_list_path must be specified.

      API name: word_list

      Adds all elements of list to wordList.

    • wordList

      public final BuilderT wordList(String value, String... values)
      A list of subwords to look for in the token stream. If found, the subword is included in the token output. Either this parameter or word_list_path must be specified.

      API name: word_list

      Adds one or more values to wordList.

    • wordListPath

      public final BuilderT wordListPath(@Nullable String value)
      Path to a file that contains a list of subwords to find in the token stream. If found, the subword is included in the token output. This path must be absolute or relative to the config location, and the file must be UTF-8 encoded. Each token in the file must be separated by a line break. Either this parameter or word_list must be specified.

      API name: word_list_path