Package org.apache.lucene.analysis.fa
Class PersianNormalizer
java.lang.Object
org.apache.lucene.analysis.fa.PersianNormalizer
Normalizer for Persian.
Normalization is done in-place for efficiency, operating on a termbuffer.
Normalization is defined as:
- Normalization of various heh + hamza forms and heh goal to heh.
- Normalization of farsi yeh and yeh barree to arabic yeh
- Normalization of persian keheh to arabic kaf
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic final charstatic final charstatic final charstatic final charstatic final charstatic final charstatic final charstatic final charstatic final char -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionintnormalize(char[] s, int len) Normalize an input buffer of Persian text
-
Field Details
-
YEH
public static final char YEH- See Also:
-
FARSI_YEH
public static final char FARSI_YEH- See Also:
-
YEH_BARREE
public static final char YEH_BARREE- See Also:
-
KEHEH
public static final char KEHEH- See Also:
-
KAF
public static final char KAF- See Also:
-
HAMZA_ABOVE
public static final char HAMZA_ABOVE- See Also:
-
HEH_YEH
public static final char HEH_YEH- See Also:
-
HEH_GOAL
public static final char HEH_GOAL- See Also:
-
HEH
public static final char HEH- See Also:
-
-
Constructor Details
-
PersianNormalizer
public PersianNormalizer()
-
-
Method Details
-
normalize
public int normalize(char[] s, int len) Normalize an input buffer of Persian text- Parameters:
s- input bufferlen- length of input buffer- Returns:
- length of input buffer after normalization
-