package excel
- Alphabetic
- Public
- All
Type Members
-
class
CellRangeAddressDataLocator extends DataLocator
Locating the data in Excel Range Address
-
trait
DataLocator extends AnyRef
For handling Excel data address and read data from there
-
class
ExcelDataSource extends TableProvider with DataSourceRegister
Creality Spark Excel data source entry point.
Creality Spark Excel data source entry point.
This class is heavily influenced by datasources.v2.FileDataSourceV2. We can not extends FileDataSourceV2 directly because that needs a fallback implementation with V1 API for writing.
-
class
ExcelFilters extends OrderedFilters
Wrapping the API change between spark 3.0 vs 3.1
- class ExcelGenerator extends AnyRef
-
class
ExcelHeaderChecker extends Logging
Checks that column names in a Excel header and field names in the schema are the same by taking into account case sensitivity.
- class ExcelHelper extends AnyRef
- class ExcelInferSchema extends Serializable
- class ExcelOptions extends Serializable
- class ExcelOutputWriter extends OutputWriter with Logging
-
class
ExcelParser extends Logging
Constructs a parser for a given schema that translates Excel data to an InternalRow.
-
case class
ExcelPartitionReaderFactory(sqlConf: SQLConf, broadcastedConf: Broadcast[SerializableConfiguration], dataSchema: StructType, readDataSchema: StructType, partitionSchema: StructType, parsedOptions: ExcelOptions, filters: Seq[Filter]) extends FilePartitionReaderFactory with Product with Serializable
A factory used to create Excel readers.
A factory used to create Excel readers.
- sqlConf
SQL configuration.
- broadcastedConf
Broadcasted serializable Hadoop Configuration.
- dataSchema
Schema of Excel files.
- readDataSchema
Required data schema in the batch scan.
- partitionSchema
Schema of partitions.
- parsedOptions
Options for parsing Excel files.
- case class ExcelScan(sparkSession: SparkSession, fileIndex: PartitioningAwareFileIndex, dataSchema: StructType, readDataSchema: StructType, readPartitionSchema: StructType, options: CaseInsensitiveStringMap, pushedFilters: Array[Filter], partitionFilters: Seq[Expression] = Seq.empty, dataFilters: Seq[Expression] = Seq.empty) extends TextBasedFileScan with Product with Serializable
- case class ExcelScanBuilder(sparkSession: SparkSession, fileIndex: PartitioningAwareFileIndex, schema: StructType, dataSchema: StructType, options: CaseInsensitiveStringMap) extends FileScanBuilder with SupportsPushDownFilters with Product with Serializable
- case class ExcelTable(name: String, sparkSession: SparkSession, map: CaseInsensitiveStringMap, paths: Seq[String], userSpecifiedSchema: Option[StructType]) extends FileTable with Product with Serializable
- case class ExcelWriteBuilder(paths: Seq[String], formatName: String, supportsDataType: (DataType) ⇒ Boolean, info: LogicalWriteInfo) extends FileWrite with Product with Serializable
-
class
TableDataLocator extends DataLocator
Locating the data in Excel Table
Value Members
- object DataLocator
- object ExcelDataSource
- object ExcelDateTimeStringUtils
- object ExcelFilters
- object ExcelHelper
- object ExcelInferSchema extends Serializable
- object ExcelParser
-
object
PlainNumberFormat extends Format
A format that formats a double as a plain string without rounding and scientific notation.
A format that formats a double as a plain string without rounding and scientific notation. All other operations are unsupported.
- See also
org.apache.poi.ss.usermodel.ExcelGeneralNumberFormat and SSNFormat from org.apache.poi.ss.usermodel.DataFormatter from Apache POI.