case class WhyProfileSession(dataFrame: DataFrame, name: String, timeColumn: String = null, groupByColumns: Seq[String] = List(), modelProfile: ModelProfileSession = null) extends Product with Serializable
A class that enable easy access to the profiling API
- dataFrame
the dataframe to profile
- name
the name of the dataset
- timeColumn
the time column, if the data is to be broken down by time
- groupByColumns
the group by column
- Alphabetic
- By Inheritance
- WhyProfileSession
- Serializable
- Serializable
- Product
- Equals
- AnyRef
- Any
- Hide All
- Show All
- Public
- All
Instance Constructors
-
new
WhyProfileSession(dataFrame: DataFrame, name: String, timeColumn: String = null, groupByColumns: Seq[String] = List(), modelProfile: ModelProfileSession = null)
- dataFrame
the dataframe to profile
- name
the name of the dataset
- timeColumn
the time column, if the data is to be broken down by time
- groupByColumns
the group by column
Value Members
-
final
def
!=(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
-
final
def
##(): Int
- Definition Classes
- AnyRef → Any
-
final
def
==(arg0: Any): Boolean
- Definition Classes
- AnyRef → Any
- val PROFILE_FIELD: String
-
def
aggProfiles(timestamp: Instant = Instant.now()): DataFrame
Run aggregation and build profile based on the specification of this session
Run aggregation and build profile based on the specification of this session
- timestamp
the session timestamp for the whole run (often the current time, or the start of the batch run
- returns
a DataFrame with aggregated profiles under 'why_profile' column
-
def
aggProfiles(timestamp: Long): DataFrame
Run aggregation and build profile based on the specification of this session
Run aggregation and build profile based on the specification of this session
- timestamp
the session timestamp for the whole run
- returns
a DataFrame with aggregated profiles under 'why_profile' column
-
final
def
asInstanceOf[T0]: T0
- Definition Classes
- Any
-
def
clone(): AnyRef
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native() @HotSpotIntrinsicCandidate()
-
final
def
eq(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
getClass(): Class[_]
- Definition Classes
- AnyRef → Any
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
def
groupBy(columns: List[String]): WhyProfileSession
A Java friendly API.
A Java friendly API. This is used by the Py4J gateway to pass data into the JV
- columns
list of columns for grouping
- returns
a new WhyProfileSession object
- def groupBy(col1: String, cols: String*): WhyProfileSession
-
final
def
isInstanceOf[T0]: Boolean
- Definition Classes
- Any
- def log(orgId: String, modelId: String, apiKey: String): Unit
-
final
def
ne(arg0: AnyRef): Boolean
- Definition Classes
- AnyRef
-
final
def
notify(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
final
def
notifyAll(): Unit
- Definition Classes
- AnyRef
- Annotations
- @native() @HotSpotIntrinsicCandidate()
-
final
def
synchronized[T0](arg0: ⇒ T0): T0
- Definition Classes
- AnyRef
-
final
def
wait(arg0: Long, arg1: Int): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
-
final
def
wait(arg0: Long): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... ) @native()
-
final
def
wait(): Unit
- Definition Classes
- AnyRef
- Annotations
- @throws( ... )
- def withModelProfile(predictionField: String, targetField: String, scoreField: String): WhyProfileSession
-
def
withTimeColumn(timeColumn: String): WhyProfileSession
Set the column for grouping by time.
Set the column for grouping by time. This column must be of Timestamp type in Spark SQL.
Note that WhyLogs uses this column to group data together, so please make sure you truncate the data to the appropriate level of precision (i.e. daily, hourly) before calling this. We only accept a column name at the moment. You can alias raw Column into a column name with String, col: Column)
- timeColumn
the column that contains the timestamp.
Deprecated Value Members
-
def
finalize(): Unit
- Attributes
- protected[lang]
- Definition Classes
- AnyRef
- Annotations
- @throws( classOf[java.lang.Throwable] ) @Deprecated @deprecated
- Deprecated
(Since version ) see corresponding Javadoc for more information.