Package org.apache.tika.extractor
package org.apache.tika.extractor
Extraction of component documents.
-
ClassDescriptionTika container extractor interface.Interface for different document selection strategies for purposes like embedded document extraction by a
ContainerExtractorinstance.Utility class to handle common issues with embedded documents.Tika container extractor callback interface.An implementation ofContainerExtractorpowered by the regularParserAPI.Helper class for parsers of package archives or other compound document formats that support embedded or attached component documents.