public final class UCharacterProperty extends Object
Internal class used for Unicode character property database.
This classes store binary data read from uprops.icu. It does not have the capability to parse the data into more high-level information. It only returns bytes of information when required.
Due to the form most commonly used for retrieval, array of char is used to store the binary data.
UCharacterPropertyDB also contains information on accessing indexes to significant points in the binary data.
Responsibility for molding the binary data into more meaning form lies on UCharacter.
| Modifier and Type | Field and Description |
|---|---|
static char |
LATIN_CAPITAL_LETTER_I_WITH_DOT_ABOVE_
Latin capital letter i with dot above
|
static char |
LATIN_SMALL_LETTER_DOTLESS_I_
Latin small letter i with dot above
|
static char |
LATIN_SMALL_LETTER_I_
Latin lowercase i
|
CharTrie |
m_trie_
Trie data
|
char[] |
m_trieData_
Optimization
CharTrie data array
|
char[] |
m_trieIndex_
Optimization
CharTrie index array
|
int |
m_trieInitialValue_
Optimization
CharTrie data offset
|
VersionInfo |
m_unicodeVersion_
Unicode version
|
static int |
NT_COUNT |
static int |
NT_FRACTION |
static int |
NT_LARGE |
static int |
SRC_BIDI
From ubidi_props.c/ubidi.icu
|
static int |
SRC_CASE
From ucase.c/ucase.icu
|
static int |
SRC_CHAR
From uchar.c/uprops.icu main trie
|
static int |
SRC_CHAR_AND_PROPSVEC
From uchar.c/uprops.icu main trie as well as properties vectors trie
|
static int |
SRC_COUNT
One more than the highest UPropertySource (SRC_) constant.
|
static int |
SRC_HST
Hangul_Syllable_Type, from uchar.c/uprops.icu
|
static int |
SRC_NAMES
From unames.c/unames.icu
|
static int |
SRC_NONE
No source, not a supported property.
|
static int |
SRC_NORM
From unorm.cpp/unorm.icu
|
static int |
SRC_PROPSVEC
From uchar.c/uprops.icu properties vectors trie
|
static int |
TYPE_MASK
Character type mask
|
| Modifier and Type | Method and Description |
|---|---|
int |
getAdditional(int codepoint,
int column)
Gets the unicode additional properties.
|
VersionInfo |
getAge(int codepoint)
Get the "age" of the code point.
|
static UCharacterProperty |
getInstance()
Loads the property data and initialize the UCharacterProperty instance.
|
static int |
getMask(int type)
Gets the type mask
|
int |
getMaxValues(int column)
Get the the maximum values for some enum/int properties.
|
int |
getProperty(int ch)
Gets the property value at the index.
|
static int |
getRawSupplementary(char lead,
char trail)
Forms a supplementary code point from the argument character
Note this is for internal use hence no checks for the validity of the surrogate characters are done |
int |
getSource(int which) |
static int |
getUnsignedValue(int prop)
Getting the unsigned numeric value of a character embedded in the property
argument
|
static boolean |
isRuleWhiteSpace(int c)
Checks if the argument c is to be treated as a white space in ICU
rules.
|
void |
setIndexData(CharTrie.FriendAgent friendagent)
Java friends implementation
|
public CharTrie m_trie_
public char[] m_trieIndex_
public char[] m_trieData_
public int m_trieInitialValue_
public VersionInfo m_unicodeVersion_
public static final char LATIN_CAPITAL_LETTER_I_WITH_DOT_ABOVE_
public static final char LATIN_SMALL_LETTER_DOTLESS_I_
public static final char LATIN_SMALL_LETTER_I_
public static final int TYPE_MASK
public static final int SRC_NONE
public static final int SRC_CHAR
public static final int SRC_PROPSVEC
public static final int SRC_HST
public static final int SRC_NAMES
public static final int SRC_NORM
public static final int SRC_CASE
public static final int SRC_BIDI
public static final int SRC_CHAR_AND_PROPSVEC
public static final int SRC_COUNT
public static final int NT_FRACTION
public static final int NT_LARGE
public static final int NT_COUNT
public void setIndexData(CharTrie.FriendAgent friendagent)
public final int getProperty(int ch)
ch - code point whose property value is to be retrievedpublic static int getUnsignedValue(int prop)
prop - the characterpublic int getAdditional(int codepoint,
int column)
codepoint - codepoint whose additional properties is to be
retrievedcolumn - public VersionInfo getAge(int codepoint)
Get the "age" of the code point.
The "age" is the Unicode version when the code point was first designated (as a non-character or for Private Use) or assigned a character.
This can be useful to avoid emitting code points to receiving processes that do not accept newer characters.
The data is from the UCD file DerivedAge.txt.
This API does not check the validity of the codepoint.
codepoint - The code point.public final int getSource(int which)
public static int getRawSupplementary(char lead,
char trail)
lead - lead surrogate charactertrail - trailing surrogate characterpublic static UCharacterProperty getInstance()
MissingResourceException - when data is missing or data has been corruptedpublic static boolean isRuleWhiteSpace(int c)
c - codepoint to checkpublic int getMaxValues(int column)
public static final int getMask(int type)
type - character typeCopyright © 2010 - 2020 Adobe. All Rights Reserved