public class GLMParametersV3 extends ModelParametersSchemaV3
| Modifier and Type | Field and Description |
|---|---|
double[] |
alpha
Distribution of regularization between the L1 (Lasso) and L2 (Ridge) penalties.
|
boolean |
balanceClasses
Balance training data class counts via over/under-sampling (for imbalanced data).
|
FrameKeyV3 |
betaConstraints
Beta constraints
|
double |
betaEpsilon
Converge if beta changes less (using L-infinity norm) than beta esilon, ONLY applies to IRLSM solver
|
boolean |
buildNullModel
If set, will build a model with only the intercept.
|
boolean |
calcLike
if true, will return likelihood function value.
|
float[] |
classSamplingFactors
Desired over/under-sampling ratios per class (in lexicographic order).
|
boolean |
coldStart
Only applicable to multiple alpha/lambda values.
|
boolean |
computePValues
Request p-values computation, p-values work only with IRLSM solver and no regularization
|
double |
dispersionEpsilon
If changes in dispersion parameter estimation or loglikelihood value is smaller than dispersion_epsilon, will
break out of the dispersion parameter estimation loop using maximum likelihood.
|
double |
dispersionLearningRate
Dispersion learning rate is only valid for tweedie family dispersion parameter estimation using ml.
|
GLMDispersionMethod |
dispersionParameterMethod
Method used to estimate the dispersion parameter for Tweedie, Gamma and Negative Binomial only.
|
boolean |
earlyStopping
Stop early when there is no more relative improvement on train or validation (if provided)
|
GLMFamily |
family
Family.
|
boolean |
fixDispersionParameter
Only used for Tweedie, Gamma and Negative Binomial GLM.
|
boolean |
fixTweedieVariancePower
If true, will fix tweedie variance power value to the value set in tweedie_variance_power.
|
boolean |
generateScoringHistory
If set to true, will generate scoring history for GLM.
|
boolean |
generateVariableInflationFactors
if true, will generate variable inflation factors for numerical predictors.
|
double |
gradientEpsilon
Converge if objective changes less (using L-infinity norm) than this, ONLY applies to L-BFGS solver.
|
boolean |
hglm
If set to true, will return HGLM model.
|
GLMInfluence |
influence
If set to dfbetas will calculate the difference in beta when a datarow is included and excluded in the dataset.
|
double |
initDispersionParameter
Only used for Tweedie, Gamma and Negative Binomial GLM.
|
StringPairV3[] |
interactionPairs
A list of pairwise (first order) column interactions.
|
java.lang.String[] |
interactions
A list of predictor column indices to interact.
|
boolean |
intercept
Include constant term in the model
|
double[] |
lambda
Regularization strength
|
double |
lambdaMinRatio
Minimum lambda used in lambda search, specified as a ratio of lambda_max (the smallest lambda that drives all
coefficients to zero).
|
boolean |
lambdaSearch
Use lambda search starting at lambda max, given lambda is then interpreted as lambda min
|
GLMLink |
link
Link function.
|
int |
maxActivePredictors
Maximum number of active predictors during computation.
|
float |
maxAfterBalanceSize
Maximum relative size of the training data after balancing class counts (can be less than 1.0).
|
int |
maxConfusionMatrixSize
[Deprecated] Maximum size (# classes) for confusion matrices to be printed in the Logs
|
int |
maxIterations
Maximum number of iterations
|
int |
maxIterationsDispersion
Control the maximum number of iterations in the dispersion parameter estimation loop using maximum likelihood.
|
GLMMissingValuesHandling |
missingValuesHandling
Handling of missing values.
|
int |
nlambdas
Number of lambdas to be used in a search.
|
boolean |
nonNegative
Restrict coefficients (not intercept) to be non-negative
|
double |
objectiveEpsilon
Converge if objective value changes less than this.
|
double |
objReg
Likelihood divider in objective value computation, default (of -1.0) will set it to 1/nobs
|
FrameKeyV3 |
plugValues
Plug Values (a single row frame containing values that will be used to impute missing values of the
training/validation frame, use with conjunction missing_values_handling = PlugValues)
|
double |
prior
Prior probability for y==1.
|
GLMFamily[] |
randFamily
Random Component Family array.
|
GLMLink[] |
randLink
Link function array for random component in HGLM.
|
int[] |
randomColumns
random columns indices for HGLM.
|
boolean |
removeCollinearColumns
In case of linearly dependent columns, remove some of the dependent columns
|
int |
scoreIterationInterval
Perform scoring for every score_iteration_interval iterations
|
long |
seed
Seed for pseudo random number generator (if applicable)
|
GLMSolver |
solver
AUTO will set the solver based on given data and the other parameters.
|
boolean |
standardize
Standardize numeric columns to have zero mean and unit variance
|
double[] |
startval
double array to initialize fixed and random coefficients for HGLM, coefficients for GLM.
|
double |
theta
Theta
|
double |
tweedieEpsilon
In estimating tweedie dispersion parameter using maximum likelihood, this is used to choose the lower and upper
indices in the approximating of the infinite series summation.
|
double |
tweedieLinkPower
Tweedie link power
|
double |
tweedieVariancePower
Tweedie variance power
|
aucType, categoricalEncoding, checkpoint, customDistributionFunc, customMetricFunc, distribution, exportCheckpointsDir, foldAssignment, foldColumn, gainsliftBins, huberAlpha, ignoreConstCols, ignoredColumns, keepCrossValidationFoldAssignment, keepCrossValidationModels, keepCrossValidationPredictions, maxCategoricalLevels, maxRuntimeSecs, modelId, nfolds, offsetColumn, parallelizeCrossValidation, quantileAlpha, responseColumn, scoreEachIteration, stoppingMetric, stoppingRounds, stoppingTolerance, trainingFrame, tweediePower, validationFrame, weightsColumn| Constructor and Description |
|---|
GLMParametersV3()
Public constructor
|
| Modifier and Type | Method and Description |
|---|---|
java.lang.String |
toString()
Return the contents of this object as a JSON String.
|
public long seed
public GLMFamily family
@SerializedName(value="rand_family") public GLMFamily[] randFamily
@SerializedName(value="tweedie_variance_power") public double tweedieVariancePower
@SerializedName(value="dispersion_learning_rate") public double dispersionLearningRate
@SerializedName(value="tweedie_link_power") public double tweedieLinkPower
public double theta
public GLMSolver solver
public double[] alpha
public double[] lambda
@SerializedName(value="lambda_search") public boolean lambdaSearch
@SerializedName(value="early_stopping") public boolean earlyStopping
public int nlambdas
@SerializedName(value="score_iteration_interval") public int scoreIterationInterval
public boolean standardize
@SerializedName(value="cold_start") public boolean coldStart
@SerializedName(value="missing_values_handling") public GLMMissingValuesHandling missingValuesHandling
public GLMInfluence influence
@SerializedName(value="plug_values") public FrameKeyV3 plugValues
@SerializedName(value="non_negative") public boolean nonNegative
@SerializedName(value="max_iterations") public int maxIterations
@SerializedName(value="beta_epsilon") public double betaEpsilon
@SerializedName(value="objective_epsilon") public double objectiveEpsilon
@SerializedName(value="gradient_epsilon") public double gradientEpsilon
@SerializedName(value="obj_reg") public double objReg
public GLMLink link
@SerializedName(value="dispersion_parameter_method") public GLMDispersionMethod dispersionParameterMethod
@SerializedName(value="rand_link") public GLMLink[] randLink
public double[] startval
@SerializedName(value="random_columns") public int[] randomColumns
@SerializedName(value="calc_like") public boolean calcLike
@SerializedName(value="generate_variable_inflation_factors") public boolean generateVariableInflationFactors
public boolean intercept
@SerializedName(value="build_null_model") public boolean buildNullModel
@SerializedName(value="fix_dispersion_parameter") public boolean fixDispersionParameter
@SerializedName(value="init_dispersion_parameter") public double initDispersionParameter
@SerializedName(value="HGLM") public boolean hglm
public double prior
@SerializedName(value="lambda_min_ratio") public double lambdaMinRatio
@SerializedName(value="beta_constraints") public FrameKeyV3 betaConstraints
@SerializedName(value="max_active_predictors") public int maxActivePredictors
public java.lang.String[] interactions
@SerializedName(value="interaction_pairs") public StringPairV3[] interactionPairs
@SerializedName(value="balance_classes") public boolean balanceClasses
@SerializedName(value="class_sampling_factors") public float[] classSamplingFactors
@SerializedName(value="max_after_balance_size") public float maxAfterBalanceSize
@SerializedName(value="max_confusion_matrix_size") public int maxConfusionMatrixSize
@SerializedName(value="compute_p_values") public boolean computePValues
@SerializedName(value="fix_tweedie_variance_power") public boolean fixTweedieVariancePower
@SerializedName(value="remove_collinear_columns") public boolean removeCollinearColumns
@SerializedName(value="dispersion_epsilon") public double dispersionEpsilon
@SerializedName(value="tweedie_epsilon") public double tweedieEpsilon
@SerializedName(value="max_iterations_dispersion") public int maxIterationsDispersion
@SerializedName(value="generate_scoring_history") public boolean generateScoringHistory
public java.lang.String toString()
toString in class ModelParametersSchemaV3