trait DataUtils extends Serializable with WithHelpMethods
DataUtils provides utilities for understanding and interpreting datasets. This module is currently in preview and may be unstable.
- Alphabetic
- By Inheritance
- DataUtils
- WithHelpMethods
- Serializable
- AnyRef
- Any
- Hide All
- Show All
- Public
- Protected
Abstract Value Members
- abstract def help(moduleOrMethod: String): Unit
- Definition Classes
- WithHelpMethods
- abstract def help(): Unit
- Definition Classes
- WithHelpMethods
- abstract def summarize(df: Any, precise: Boolean = false): Unit
Summarize a Spark DataFrame and visualize the statistics to get quick insights.
Summarize a Spark DataFrame and visualize the statistics to get quick insights.
Example:
dbutils.data.summarize(df, precise=false)- df
The dataframe to summarize. Streaming dataframes are not supported.
- precise
If false, percentiles, distinct item counts, and frequent item counts will be computed approximately to reduce the run time. If true, distinct item counts and frequent item counts will be computed exactly, and percentiles will be computed with high precision.
- returns
visualization of the computed summmary statistics. Summarize a DataFrame and visualize the statistics to get quick insights.
Concrete Value Members
- final def !=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def ##: Int
- Definition Classes
- AnyRef → Any
- final def ==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- final def apply(): DataUtils.this.type
- Definition Classes
- WithHelpMethods
- final def asInstanceOf[T0]: T0
- Definition Classes
- Any
- def clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.CloneNotSupportedException]) @native()
- final def eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- def equals(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef → Any
- def finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.Throwable])
- final def getClass(): Class[_ <: AnyRef]
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- def hashCode(): Int
- Definition Classes
- AnyRef → Any
- Annotations
- @native()
- final def isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- final def ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
- final def notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native()
- final def synchronized[T0](arg0: => T0): T0
- Definition Classes
- AnyRef
- def toString(): String
- Definition Classes
- AnyRef → Any
- final def wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException])
- final def wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws(classOf[java.lang.InterruptedException]) @native()