package excel
- Alphabetic
- Public
- Protected
Type Members
- class CellRangeAddressDataLocator extends DataLocator
Locating the data in Excel Range Address
- trait DataLocator extends AnyRef
For handling Excel data address and read data from there
- class ExcelDataSource extends TableProvider with DataSourceRegister
Creality Spark Excel data source entry point.
Creality Spark Excel data source entry point.
This class is heavily influenced by datasources.v2.FileDataSourceV2. We can not extends FileDataSourceV2 directly because that needs a fallback implementation with V1 API for writing.
- class ExcelFilters extends OrderedFilters
Wrapping the API change between spark 3.0 vs 3.1
- class ExcelGenerator extends AnyRef
- class ExcelHeaderChecker extends Logging
Checks that column names in a Excel header and field names in the schema are the same by taking into account case sensitivity.
- class ExcelHelper extends AnyRef
- class ExcelInferSchema extends Serializable
- class ExcelOptions extends Serializable
- class ExcelOutputWriter extends OutputWriter with Logging
- class ExcelParser extends Logging
Constructs a parser for a given schema that translates Excel data to an InternalRow.
- case class ExcelPartitionReaderFactory(sqlConf: SQLConf, broadcastedConf: Broadcast[SerializableConfiguration], dataSchema: StructType, readDataSchema: StructType, partitionSchema: StructType, parsedOptions: ExcelOptions, filters: Seq[Filter]) extends FilePartitionReaderFactory with Product with Serializable
A factory used to create Excel readers.
A factory used to create Excel readers.
- sqlConf
SQL configuration.
- broadcastedConf
Broadcasted serializable Hadoop Configuration.
- dataSchema
Schema of Excel files.
- readDataSchema
Required data schema in the batch scan.
- partitionSchema
Schema of partitions.
- parsedOptions
Options for parsing Excel files.
- case class ExcelScan(sparkSession: SparkSession, fileIndex: PartitioningAwareFileIndex, dataSchema: StructType, readDataSchema: StructType, readPartitionSchema: StructType, options: CaseInsensitiveStringMap, pushedFilters: Array[Filter], partitionFilters: Seq[Expression] = Seq.empty, dataFilters: Seq[Expression] = Seq.empty) extends TextBasedFileScan with Product with Serializable
- case class ExcelScanBuilder(sparkSession: SparkSession, fileIndex: PartitioningAwareFileIndex, schema: StructType, dataSchema: StructType, options: CaseInsensitiveStringMap) extends FileScanBuilder with SupportsPushDownFilters with Product with Serializable
- case class ExcelTable(name: String, sparkSession: SparkSession, map: CaseInsensitiveStringMap, paths: Seq[String], userSpecifiedSchema: Option[StructType]) extends FileTable with Product with Serializable
- case class ExcelWriteBuilder(paths: Seq[String], formatName: String, supportsDataType: (DataType) => Boolean, info: LogicalWriteInfo) extends FileWrite with Product with Serializable
- class TableDataLocator extends DataLocator
Locating the data in Excel Table
Value Members
- object DataLocator
- object ExcelDataSource
- object ExcelDateTimeStringUtils
- object ExcelFilters
- object ExcelHelper
- object ExcelInferSchema extends Serializable
- object ExcelParser
- object PlainNumberFormat extends Format
A format that formats a double as a plain string without rounding and scientific notation.
A format that formats a double as a plain string without rounding and scientific notation. All other operations are unsupported.
- See also
org.apache.poi.ss.usermodel.ExcelGeneralNumberFormat and SSNFormat from org.apache.poi.ss.usermodel.DataFormatter from Apache POI.