- get() - Method in enum org.apache.tika.parser.strings.StringsEncoding
-
- getAllDetectableCharsets() - Static method in class org.apache.tika.parser.txt.CharsetDetector
-
Get the names of all charsets supported by CharsetDetector class.
- getCharset() - Method in class org.apache.tika.parser.csv.CSVParams
-
- getConfidence() - Method in class org.apache.tika.parser.csv.CSVResult
-
- getConfidence() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Get an indication of the confidence in the charset detected.
- getDelimiter() - Method in class org.apache.tika.parser.csv.CSVParams
-
- getDelimiter() - Method in class org.apache.tika.parser.csv.CSVResult
-
- getDetectableCharsets() - Method in class org.apache.tika.parser.txt.CharsetDetector
-
- getEncoding() - Method in class org.apache.tika.parser.strings.StringsConfig
-
Returns the character encoding of the strings that are to be found.
- getLanguage() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Get the ISO code for the language of the detected charset.
- getMarkLimit() - Method in class org.apache.tika.parser.txt.Icu4jEncodingDetector
-
- getMarkLimit() - Method in class org.apache.tika.parser.txt.UniversalEncodingDetector
-
- getMediaType() - Method in class org.apache.tika.parser.csv.CSVParams
-
- getMediaType() - Method in class org.apache.tika.parser.csv.CSVResult
-
- getMinLength() - Method in class org.apache.tika.parser.strings.StringsConfig
-
Returns the minimum sequence length (characters) to print.
- getMinLength() - Method in class org.apache.tika.parser.strings.StringsParser
-
- getMinSize() - Method in class org.apache.tika.parser.strings.Latin1StringsParser
-
Returns the minimum size of a character sequence to be extracted.
- getName() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Get the name of the detected charset.
- getNormalizedName() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
strips e.g.
- getReader(InputStream, String) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Autodetect the charset of an inputStream, and return a Java Reader
to access the converted input data.
- getReader() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Create a java.io.Reader for reading the Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
- getString(byte[], String) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Autodetect the charset of an inputStream, and return a String
containing the converted input data.
- getString() - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Create a Java String from Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
- getString(int) - Method in class org.apache.tika.parser.txt.CharsetMatch
-
Create a Java String from Unicode character data corresponding
to the original byte data supplied to the Charset detect operation.
- getStringsEncoding() - Method in class org.apache.tika.parser.strings.StringsParser
-
- getStringsPath() - Method in class org.apache.tika.parser.strings.StringsParser
-
- getStringsProg() - Static method in class org.apache.tika.parser.strings.StringsParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.csv.TextAndCSVParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.strings.Latin1StringsParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.strings.StringsParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.txt.TXTParser
-
- getTimeoutSeconds() - Method in class org.apache.tika.parser.strings.StringsConfig
-
Returns the maximum time (in seconds) to wait for the "strings" command
to terminate.
- getTimeoutSeconds() - Method in class org.apache.tika.parser.strings.StringsParser
-
- setCharset(Charset) - Method in class org.apache.tika.parser.csv.CSVParams
-
- setDeclaredEncoding(String) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Set the declared encoding for charset detection.
- setDelimiter(Character) - Method in class org.apache.tika.parser.csv.CSVParams
-
- setDetectableCharset(String, boolean) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
- setEncoding(StringsEncoding) - Method in class org.apache.tika.parser.strings.StringsConfig
-
Sets the character encoding of the strings that are to be found.
- setEncoding(String) - Method in class org.apache.tika.parser.strings.StringsParser
-
- setIgnoreCharsets(List<String>) - Method in class org.apache.tika.parser.txt.Icu4jEncodingDetector
-
- setMarkLimit(int) - Method in class org.apache.tika.parser.txt.Icu4jEncodingDetector
-
How far into the stream to read for charset detection.
- setMarkLimit(int) - Method in class org.apache.tika.parser.txt.UniversalEncodingDetector
-
How far into the stream to read for charset detection.
- setMediaType(MediaType) - Method in class org.apache.tika.parser.csv.CSVParams
-
- setMinLength(int) - Method in class org.apache.tika.parser.strings.StringsConfig
-
Sets the minimum sequence length (characters) to print.
- setMinLength(int) - Method in class org.apache.tika.parser.strings.StringsParser
-
- setMinSize(int) - Method in class org.apache.tika.parser.strings.Latin1StringsParser
-
Sets the minimum size of a character sequence to be extracted.
- setStringsPath(String) - Method in class org.apache.tika.parser.strings.StringsParser
-
Sets the "strings" installation folder.
- setStripMarkup(boolean) - Method in class org.apache.tika.parser.txt.Icu4jEncodingDetector
-
Whether or not to attempt to strip html-ish markup
from the stream before sending it to the underlying
detector.
- setText(byte[]) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Set the input text (byte) data whose charset is to be detected.
- setText(InputStream) - Method in class org.apache.tika.parser.txt.CharsetDetector
-
Set the input text (byte) data whose charset is to be detected.
- setTimeoutSeconds(int) - Method in class org.apache.tika.parser.strings.StringsConfig
-
Sets the maximum time (in seconds) to wait for the "strings" command to
terminate.
- setTimeoutSeconds(int) - Method in class org.apache.tika.parser.strings.StringsParser
-
- StringsConfig - Class in org.apache.tika.parser.strings
-
Configuration for the "strings" (or strings-alternative) command.
- StringsConfig() - Constructor for class org.apache.tika.parser.strings.StringsConfig
-
- StringsEncoding - Enum in org.apache.tika.parser.strings
-
Character encoding of the strings that are to be found using the "strings" command.
- StringsParser - Class in org.apache.tika.parser.strings
-
Parser that uses the "strings" (or strings-alternative) command to find the
printable strings in a object, or other binary, file
(application/octet-stream).
- StringsParser() - Constructor for class org.apache.tika.parser.strings.StringsParser
-