Class WordDelimiterTokenFilterBase
java.lang.Object
co.elastic.clients.elasticsearch._types.analysis.TokenFilterBase
co.elastic.clients.elasticsearch._types.analysis.WordDelimiterTokenFilterBase
- All Implemented Interfaces:
JsonpSerializable
- Direct Known Subclasses:
WordDelimiterGraphTokenFilter,WordDelimiterTokenFilter
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic classWordDelimiterTokenFilterBase.AbstractBuilder<BuilderT extends WordDelimiterTokenFilterBase.AbstractBuilder<BuilderT>> -
Constructor Summary
ConstructorsModifierConstructorDescriptionprotected -
Method Summary
Modifier and TypeMethodDescriptionfinal BooleanIftrue, the filter produces catenated tokens for chains of alphanumeric characters separated by non-alphabetic delimiters.final BooleanIftrue, the filter produces catenated tokens for chains of numeric characters separated by non-alphabetic delimiters.final BooleanIftrue, the filter produces catenated tokens for chains of alphabetical characters separated by non-alphabetic delimiters.final BooleanIftrue, the filter includes tokens consisting of only numeric characters in the output.final BooleanIftrue, the filter includes tokens consisting of only alphabetical characters in the output.final BooleanIftrue, the filter includes the original version of any split tokens in the output.Array of tokens the filter won’t split.final StringPath to a file that contains a list of tokens the filter won’t split.protected voidserializeInternal(jakarta.json.stream.JsonGenerator generator, JsonpMapper mapper) protected static <BuilderT extends WordDelimiterTokenFilterBase.AbstractBuilder<BuilderT>>
voidfinal BooleanIftrue, the filter splits tokens at letter case transitions.final BooleanIftrue, the filter splits tokens at letter-number transitions.final BooleanIftrue, the filter removes the English possessive ('s) from the end of each token.Array of custom type mappings for characters.final StringPath to a file that contains custom type mappings for characters.Methods inherited from class co.elastic.clients.elasticsearch._types.analysis.TokenFilterBase
serialize, setupTokenFilterBaseDeserializer, toString, version
-
Constructor Details
-
WordDelimiterTokenFilterBase
-
-
Method Details
-
catenateAll
Iftrue, the filter produces catenated tokens for chains of alphanumeric characters separated by non-alphabetic delimiters. Defaults tofalse.API name:
catenate_all -
catenateNumbers
Iftrue, the filter produces catenated tokens for chains of numeric characters separated by non-alphabetic delimiters. Defaults tofalse.API name:
catenate_numbers -
catenateWords
Iftrue, the filter produces catenated tokens for chains of alphabetical characters separated by non-alphabetic delimiters. Defaults tofalse.API name:
catenate_words -
generateNumberParts
Iftrue, the filter includes tokens consisting of only numeric characters in the output. Iffalse, the filter excludes these tokens from the output. Defaults totrue.API name:
generate_number_parts -
generateWordParts
Iftrue, the filter includes tokens consisting of only alphabetical characters in the output. Iffalse, the filter excludes these tokens from the output. Defaults totrue.API name:
generate_word_parts -
preserveOriginal
Iftrue, the filter includes the original version of any split tokens in the output. This original version includes non-alphanumeric delimiters. Defaults tofalse.API name:
preserve_original -
protectedWords
Array of tokens the filter won’t split.API name:
protected_words -
protectedWordsPath
Path to a file that contains a list of tokens the filter won’t split. This path must be absolute or relative to theconfiglocation, and the file must be UTF-8 encoded. Each token in the file must be separated by a line break.API name:
protected_words_path -
splitOnCaseChange
Iftrue, the filter splits tokens at letter case transitions. For example: camelCase -> [ camel, Case ]. Defaults totrue.API name:
split_on_case_change -
splitOnNumerics
Iftrue, the filter splits tokens at letter-number transitions. For example: j2se -> [ j, 2, se ]. Defaults totrue.API name:
split_on_numerics -
stemEnglishPossessive
Iftrue, the filter removes the English possessive ('s) from the end of each token. For example: O'Neil's -> [ O, Neil ]. Defaults totrue.API name:
stem_english_possessive -
typeTable
Array of custom type mappings for characters. This allows you to map non-alphanumeric characters as numeric or alphanumeric to avoid splitting on those characters.API name:
type_table -
typeTablePath
Path to a file that contains custom type mappings for characters. This allows you to map non-alphanumeric characters as numeric or alphanumeric to avoid splitting on those characters.API name:
type_table_path -
serializeInternal
- Overrides:
serializeInternalin classTokenFilterBase
-
setupWordDelimiterTokenFilterBaseDeserializer
protected static <BuilderT extends WordDelimiterTokenFilterBase.AbstractBuilder<BuilderT>> void setupWordDelimiterTokenFilterBaseDeserializer(ObjectDeserializer<BuilderT> op)
-