Packages

package convert

Ordering
  1. Alphabetic
Visibility
  1. Public
  2. All

Type Members

  1. class Parquet2CSV extends SparkJob

    Convert parquet files to CSV.

    Convert parquet files to CSV. The folder hierarchy should be in the form /input_folder/domain/schema/part*.parquet Once converted the csv files is put in the folder /output_folder/domain/schema.csv file When the specified number of parittions is 1 then /output_folder/domain/schema.csv is the file containing the data otherwise, it is a folder containing the part*.csv files. When output_folder is not specified, then the input_folder is used a the base output folder.

  2. case class Parquet2CSVConfig(inputFolder: Path = new Path("/"), outputFolder: Option[Path] = None, domainName: Option[String] = None, schemaName: Option[String] = None, writeMode: Option[WriteMode] = None, deleteSource: Boolean = false, options: List[(String, String)] = Nil, partitions: Int = 1) extends Product with Serializable

Value Members

  1. object Parquet2CSV
  2. object Parquet2CSVConfig extends CliConfig[Parquet2CSVConfig] with Serializable

Ungrouped