Package org.apache.lucene.analysis.core
Class StopFilterFactory
- java.lang.Object
-
- org.apache.lucene.analysis.util.AbstractAnalysisFactory
-
- org.apache.lucene.analysis.util.TokenFilterFactory
-
- org.apache.lucene.analysis.core.StopFilterFactory
-
- All Implemented Interfaces:
ResourceLoaderAware
public class StopFilterFactory extends TokenFilterFactory implements ResourceLoaderAware
Factory forStopFilter.<fieldType name="text_stop" class="solr.TextField" positionIncrementGap="100" autoGeneratePhraseQueries="true"> <analyzer> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" format="wordset" /> </analyzer> </fieldType>All attributes are optional:
ignoreCasedefaults tofalsewordsshould be the name of a stopwords file to parse, if not specified the factory will useStopAnalyzer.ENGLISH_STOP_WORDS_SETformatdefines how thewordsfile will be parsed, and defaults towordset. Ifwordsis not specified, thenformatmust not be specified.
The valid values for the
formatoption are:wordset- This is the default format, which supports one word per line (including any intra-word whitespace) and allows whole line comments begining with the "#" character. Blank lines are ignored. SeeWordlistLoader.getLinesfor details.snowball- This format allows for multiple words specified on each line, and trailing comments may be specified using the vertical line ("|"). Blank lines are ignored. SeeWordlistLoader.getSnowballWordSetfor details.
-
-
Field Summary
Fields Modifier and Type Field Description static java.lang.StringFORMAT_SNOWBALLstatic java.lang.StringFORMAT_WORDSET-
Fields inherited from class org.apache.lucene.analysis.util.AbstractAnalysisFactory
LUCENE_MATCH_VERSION_PARAM
-
-
Constructor Summary
Constructors Constructor Description StopFilterFactory(java.util.Map<java.lang.String,java.lang.String> args)Creates a new StopFilterFactory
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description TokenStreamcreate(TokenStream input)Transform the specified input TokenStreamCharArraySetgetStopWords()voidinform(ResourceLoader loader)Initializes this component with the provided ResourceLoader (used for loading classes, files, etc).booleanisEnablePositionIncrements()booleanisIgnoreCase()-
Methods inherited from class org.apache.lucene.analysis.util.TokenFilterFactory
availableTokenFilters, forName, lookupClass, reloadTokenFilters
-
Methods inherited from class org.apache.lucene.analysis.util.AbstractAnalysisFactory
get, get, get, get, get, getChar, getClassArg, getLuceneMatchVersion, getOriginalArgs, getSet, isExplicitLuceneMatchVersion, require, require, require, requireChar, setExplicitLuceneMatchVersion
-
-
-
-
Field Detail
-
FORMAT_WORDSET
public static final java.lang.String FORMAT_WORDSET
- See Also:
- Constant Field Values
-
FORMAT_SNOWBALL
public static final java.lang.String FORMAT_SNOWBALL
- See Also:
- Constant Field Values
-
-
Method Detail
-
inform
public void inform(ResourceLoader loader) throws java.io.IOException
Description copied from interface:ResourceLoaderAwareInitializes this component with the provided ResourceLoader (used for loading classes, files, etc).- Specified by:
informin interfaceResourceLoaderAware- Throws:
java.io.IOException
-
isEnablePositionIncrements
public boolean isEnablePositionIncrements()
-
isIgnoreCase
public boolean isIgnoreCase()
-
getStopWords
public CharArraySet getStopWords()
-
create
public TokenStream create(TokenStream input)
Description copied from class:TokenFilterFactoryTransform the specified input TokenStream- Specified by:
createin classTokenFilterFactory
-
-