Class CodepointCountFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.util.FilteringTokenFilter
org.apache.lucene.analysis.miscellaneous.CodepointCountFilter
- All Implemented Interfaces:
Closeable,AutoCloseable
Removes words that are too long or too short from the stream.
Note: Length is calculated as the number of Unicode codepoints.
-
Nested Class Summary
Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource
AttributeSource.AttributeFactory, AttributeSource.State -
Constructor Summary
ConstructorsConstructorDescriptionCodepointCountFilter(Version version, TokenStream in, int min, int max) Create a newCodepointCountFilter. -
Method Summary
Methods inherited from class org.apache.lucene.analysis.util.FilteringTokenFilter
end, getEnablePositionIncrements, incrementToken, reset, setEnablePositionIncrementsMethods inherited from class org.apache.lucene.analysis.TokenFilter
closeMethods inherited from class org.apache.lucene.util.AttributeSource
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, copyTo, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, reflectAsString, reflectWith, restoreState, toString
-
Constructor Details
-
CodepointCountFilter
Create a newCodepointCountFilter. This will filter out tokens whoseCharTermAttributeis either too short (Character.codePointCount(char[], int, int)< min) or too long (Character.codePointCount(char[], int, int)> max).- Parameters:
version- the Lucene match versionin- theTokenStreamto consumemin- the minimum lengthmax- the maximum length
-
-
Method Details
-
accept
public boolean accept()
-