@Generated(value="com.amazonaws:aws-java-sdk-code-generator") public class TabularJobConfig extends Object implements Serializable, Cloneable, StructuredPojo
The collection of settings used by an AutoML job V2 for the tabular problem type.
Constructor and Description |
---|
TabularJobConfig() |
Modifier and Type | Method and Description |
---|---|
TabularJobConfig |
clone() |
boolean |
equals(Object obj) |
CandidateGenerationConfig |
getCandidateGenerationConfig()
The configuration information of how model candidates are generated.
|
AutoMLJobCompletionCriteria |
getCompletionCriteria() |
String |
getFeatureSpecificationS3Uri()
A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot
job V2.
|
Boolean |
getGenerateCandidateDefinitionsOnly()
Generates possible candidates without training the models.
|
String |
getMode()
The method that Autopilot uses to train the data.
|
String |
getProblemType()
The type of supervised learning problem available for the model candidates of the AutoML job V2.
|
String |
getSampleWeightAttributeName()
If specified, this column name indicates which column of the dataset should be treated as sample weights for use
by the objective metric during the training, evaluation, and the selection of the best model.
|
String |
getTargetAttributeName()
The name of the target variable in supervised learning, usually represented by 'y'.
|
int |
hashCode() |
Boolean |
isGenerateCandidateDefinitionsOnly()
Generates possible candidates without training the models.
|
void |
marshall(ProtocolMarshaller protocolMarshaller)
Marshalls this structured data using the given
ProtocolMarshaller . |
void |
setCandidateGenerationConfig(CandidateGenerationConfig candidateGenerationConfig)
The configuration information of how model candidates are generated.
|
void |
setCompletionCriteria(AutoMLJobCompletionCriteria completionCriteria) |
void |
setFeatureSpecificationS3Uri(String featureSpecificationS3Uri)
A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot
job V2.
|
void |
setGenerateCandidateDefinitionsOnly(Boolean generateCandidateDefinitionsOnly)
Generates possible candidates without training the models.
|
void |
setMode(String mode)
The method that Autopilot uses to train the data.
|
void |
setProblemType(String problemType)
The type of supervised learning problem available for the model candidates of the AutoML job V2.
|
void |
setSampleWeightAttributeName(String sampleWeightAttributeName)
If specified, this column name indicates which column of the dataset should be treated as sample weights for use
by the objective metric during the training, evaluation, and the selection of the best model.
|
void |
setTargetAttributeName(String targetAttributeName)
The name of the target variable in supervised learning, usually represented by 'y'.
|
String |
toString()
Returns a string representation of this object.
|
TabularJobConfig |
withCandidateGenerationConfig(CandidateGenerationConfig candidateGenerationConfig)
The configuration information of how model candidates are generated.
|
TabularJobConfig |
withCompletionCriteria(AutoMLJobCompletionCriteria completionCriteria) |
TabularJobConfig |
withFeatureSpecificationS3Uri(String featureSpecificationS3Uri)
A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot
job V2.
|
TabularJobConfig |
withGenerateCandidateDefinitionsOnly(Boolean generateCandidateDefinitionsOnly)
Generates possible candidates without training the models.
|
TabularJobConfig |
withMode(AutoMLMode mode)
The method that Autopilot uses to train the data.
|
TabularJobConfig |
withMode(String mode)
The method that Autopilot uses to train the data.
|
TabularJobConfig |
withProblemType(ProblemType problemType)
The type of supervised learning problem available for the model candidates of the AutoML job V2.
|
TabularJobConfig |
withProblemType(String problemType)
The type of supervised learning problem available for the model candidates of the AutoML job V2.
|
TabularJobConfig |
withSampleWeightAttributeName(String sampleWeightAttributeName)
If specified, this column name indicates which column of the dataset should be treated as sample weights for use
by the objective metric during the training, evaluation, and the selection of the best model.
|
TabularJobConfig |
withTargetAttributeName(String targetAttributeName)
The name of the target variable in supervised learning, usually represented by 'y'.
|
public void setCandidateGenerationConfig(CandidateGenerationConfig candidateGenerationConfig)
The configuration information of how model candidates are generated.
candidateGenerationConfig
- The configuration information of how model candidates are generated.public CandidateGenerationConfig getCandidateGenerationConfig()
The configuration information of how model candidates are generated.
public TabularJobConfig withCandidateGenerationConfig(CandidateGenerationConfig candidateGenerationConfig)
The configuration information of how model candidates are generated.
candidateGenerationConfig
- The configuration information of how model candidates are generated.public void setCompletionCriteria(AutoMLJobCompletionCriteria completionCriteria)
completionCriteria
- public AutoMLJobCompletionCriteria getCompletionCriteria()
public TabularJobConfig withCompletionCriteria(AutoMLJobCompletionCriteria completionCriteria)
completionCriteria
- public void setFeatureSpecificationS3Uri(String featureSpecificationS3Uri)
A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot
job V2. You can input FeatureAttributeNames
(optional) in JSON format as shown below:
{ "FeatureAttributeNames":["col1", "col2", ...] }
.
You can also specify the data type of the feature (optional) in the format shown below:
{ "FeatureDataTypes":{"col1":"numeric", "col2":"categorical" ... } }
These column keys may not include the target column.
In ensembling mode, Autopilot only supports the following data types: numeric
,
categorical
, text
, and datetime
. In HPO mode, Autopilot can support
numeric
, categorical
, text
, datetime
, and
sequence
.
If only FeatureDataTypes
is provided, the column keys (col1
, col2
,..)
should be a subset of the column names in the input data.
If both FeatureDataTypes
and FeatureAttributeNames
are provided, then the column keys
should be a subset of the column names provided in FeatureAttributeNames
.
The key name FeatureAttributeNames
is fixed. The values listed in ["col1", "col2", ...]
are case sensitive and should be a list of strings containing unique values that are a subset of the column names
in the input data. The list of columns provided must not include the target column.
featureSpecificationS3Uri
- A URL to the Amazon S3 data source containing selected features from the input data source to run an
Autopilot job V2. You can input FeatureAttributeNames
(optional) in JSON format as shown
below:
{ "FeatureAttributeNames":["col1", "col2", ...] }
.
You can also specify the data type of the feature (optional) in the format shown below:
{ "FeatureDataTypes":{"col1":"numeric", "col2":"categorical" ... } }
These column keys may not include the target column.
In ensembling mode, Autopilot only supports the following data types: numeric
,
categorical
, text
, and datetime
. In HPO mode, Autopilot can support
numeric
, categorical
, text
, datetime
, and
sequence
.
If only FeatureDataTypes
is provided, the column keys (col1
, col2
,..) should be a subset of the column names in the input data.
If both FeatureDataTypes
and FeatureAttributeNames
are provided, then the column
keys should be a subset of the column names provided in FeatureAttributeNames
.
The key name FeatureAttributeNames
is fixed. The values listed in
["col1", "col2", ...]
are case sensitive and should be a list of strings containing unique
values that are a subset of the column names in the input data. The list of columns provided must not
include the target column.
public String getFeatureSpecificationS3Uri()
A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot
job V2. You can input FeatureAttributeNames
(optional) in JSON format as shown below:
{ "FeatureAttributeNames":["col1", "col2", ...] }
.
You can also specify the data type of the feature (optional) in the format shown below:
{ "FeatureDataTypes":{"col1":"numeric", "col2":"categorical" ... } }
These column keys may not include the target column.
In ensembling mode, Autopilot only supports the following data types: numeric
,
categorical
, text
, and datetime
. In HPO mode, Autopilot can support
numeric
, categorical
, text
, datetime
, and
sequence
.
If only FeatureDataTypes
is provided, the column keys (col1
, col2
,..)
should be a subset of the column names in the input data.
If both FeatureDataTypes
and FeatureAttributeNames
are provided, then the column keys
should be a subset of the column names provided in FeatureAttributeNames
.
The key name FeatureAttributeNames
is fixed. The values listed in ["col1", "col2", ...]
are case sensitive and should be a list of strings containing unique values that are a subset of the column names
in the input data. The list of columns provided must not include the target column.
FeatureAttributeNames
(optional) in JSON format as shown
below:
{ "FeatureAttributeNames":["col1", "col2", ...] }
.
You can also specify the data type of the feature (optional) in the format shown below:
{ "FeatureDataTypes":{"col1":"numeric", "col2":"categorical" ... } }
These column keys may not include the target column.
In ensembling mode, Autopilot only supports the following data types: numeric
,
categorical
, text
, and datetime
. In HPO mode, Autopilot can
support numeric
, categorical
, text
, datetime
, and
sequence
.
If only FeatureDataTypes
is provided, the column keys (col1
, col2
,..) should be a subset of the column names in the input data.
If both FeatureDataTypes
and FeatureAttributeNames
are provided, then the
column keys should be a subset of the column names provided in FeatureAttributeNames
.
The key name FeatureAttributeNames
is fixed. The values listed in
["col1", "col2", ...]
are case sensitive and should be a list of strings containing unique
values that are a subset of the column names in the input data. The list of columns provided must not
include the target column.
public TabularJobConfig withFeatureSpecificationS3Uri(String featureSpecificationS3Uri)
A URL to the Amazon S3 data source containing selected features from the input data source to run an Autopilot
job V2. You can input FeatureAttributeNames
(optional) in JSON format as shown below:
{ "FeatureAttributeNames":["col1", "col2", ...] }
.
You can also specify the data type of the feature (optional) in the format shown below:
{ "FeatureDataTypes":{"col1":"numeric", "col2":"categorical" ... } }
These column keys may not include the target column.
In ensembling mode, Autopilot only supports the following data types: numeric
,
categorical
, text
, and datetime
. In HPO mode, Autopilot can support
numeric
, categorical
, text
, datetime
, and
sequence
.
If only FeatureDataTypes
is provided, the column keys (col1
, col2
,..)
should be a subset of the column names in the input data.
If both FeatureDataTypes
and FeatureAttributeNames
are provided, then the column keys
should be a subset of the column names provided in FeatureAttributeNames
.
The key name FeatureAttributeNames
is fixed. The values listed in ["col1", "col2", ...]
are case sensitive and should be a list of strings containing unique values that are a subset of the column names
in the input data. The list of columns provided must not include the target column.
featureSpecificationS3Uri
- A URL to the Amazon S3 data source containing selected features from the input data source to run an
Autopilot job V2. You can input FeatureAttributeNames
(optional) in JSON format as shown
below:
{ "FeatureAttributeNames":["col1", "col2", ...] }
.
You can also specify the data type of the feature (optional) in the format shown below:
{ "FeatureDataTypes":{"col1":"numeric", "col2":"categorical" ... } }
These column keys may not include the target column.
In ensembling mode, Autopilot only supports the following data types: numeric
,
categorical
, text
, and datetime
. In HPO mode, Autopilot can support
numeric
, categorical
, text
, datetime
, and
sequence
.
If only FeatureDataTypes
is provided, the column keys (col1
, col2
,..) should be a subset of the column names in the input data.
If both FeatureDataTypes
and FeatureAttributeNames
are provided, then the column
keys should be a subset of the column names provided in FeatureAttributeNames
.
The key name FeatureAttributeNames
is fixed. The values listed in
["col1", "col2", ...]
are case sensitive and should be a list of strings containing unique
values that are a subset of the column names in the input data. The list of columns provided must not
include the target column.
public void setMode(String mode)
The method that Autopilot uses to train the data. You can either specify the mode manually or let Autopilot
choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot
chooses ENSEMBLING
for datasets smaller than 100 MB, and HYPERPARAMETER_TUNING
for
larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and regression tasks
directly from your dataset. This machine learning mode combines several base models to produce an optimal
predictive model. It then uses a stacking ensemble method to combine predictions from contributing members. A
multi-stack ensemble model can provide better performance over a single model by combining the predictive
capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version of a
model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO finds the best
hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
mode
- The method that Autopilot uses to train the data. You can either specify the mode manually or let
Autopilot choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot chooses ENSEMBLING
for datasets smaller than 100 MB, and
HYPERPARAMETER_TUNING
for larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and
regression tasks directly from your dataset. This machine learning mode combines several base models to
produce an optimal predictive model. It then uses a stacking ensemble method to combine predictions from
contributing members. A multi-stack ensemble model can provide better performance over a single model by
combining the predictive capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version
of a model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO
finds the best hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
AutoMLMode
public String getMode()
The method that Autopilot uses to train the data. You can either specify the mode manually or let Autopilot
choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot
chooses ENSEMBLING
for datasets smaller than 100 MB, and HYPERPARAMETER_TUNING
for
larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and regression tasks
directly from your dataset. This machine learning mode combines several base models to produce an optimal
predictive model. It then uses a stacking ensemble method to combine predictions from contributing members. A
multi-stack ensemble model can provide better performance over a single model by combining the predictive
capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version of a
model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO finds the best
hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
AUTO
. In AUTO
mode, Autopilot chooses ENSEMBLING
for datasets smaller than 100 MB, and
HYPERPARAMETER_TUNING
for larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and
regression tasks directly from your dataset. This machine learning mode combines several base models to
produce an optimal predictive model. It then uses a stacking ensemble method to combine predictions from
contributing members. A multi-stack ensemble model can provide better performance over a single model by
combining the predictive capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version
of a model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO
finds the best hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
AutoMLMode
public TabularJobConfig withMode(String mode)
The method that Autopilot uses to train the data. You can either specify the mode manually or let Autopilot
choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot
chooses ENSEMBLING
for datasets smaller than 100 MB, and HYPERPARAMETER_TUNING
for
larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and regression tasks
directly from your dataset. This machine learning mode combines several base models to produce an optimal
predictive model. It then uses a stacking ensemble method to combine predictions from contributing members. A
multi-stack ensemble model can provide better performance over a single model by combining the predictive
capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version of a
model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO finds the best
hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
mode
- The method that Autopilot uses to train the data. You can either specify the mode manually or let
Autopilot choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot chooses ENSEMBLING
for datasets smaller than 100 MB, and
HYPERPARAMETER_TUNING
for larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and
regression tasks directly from your dataset. This machine learning mode combines several base models to
produce an optimal predictive model. It then uses a stacking ensemble method to combine predictions from
contributing members. A multi-stack ensemble model can provide better performance over a single model by
combining the predictive capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version
of a model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO
finds the best hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
AutoMLMode
public TabularJobConfig withMode(AutoMLMode mode)
The method that Autopilot uses to train the data. You can either specify the mode manually or let Autopilot
choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot
chooses ENSEMBLING
for datasets smaller than 100 MB, and HYPERPARAMETER_TUNING
for
larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and regression tasks
directly from your dataset. This machine learning mode combines several base models to produce an optimal
predictive model. It then uses a stacking ensemble method to combine predictions from contributing members. A
multi-stack ensemble model can provide better performance over a single model by combining the predictive
capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version of a
model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO finds the best
hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
mode
- The method that Autopilot uses to train the data. You can either specify the mode manually or let
Autopilot choose for you based on the dataset size by selecting AUTO
. In AUTO
mode, Autopilot chooses ENSEMBLING
for datasets smaller than 100 MB, and
HYPERPARAMETER_TUNING
for larger ones.
The ENSEMBLING
mode uses a multi-stack ensemble model to predict classification and
regression tasks directly from your dataset. This machine learning mode combines several base models to
produce an optimal predictive model. It then uses a stacking ensemble method to combine predictions from
contributing members. A multi-stack ensemble model can provide better performance over a single model by
combining the predictive capabilities of multiple models. See Autopilot algorithm support for a list of algorithms supported by ENSEMBLING
mode.
The HYPERPARAMETER_TUNING
(HPO) mode uses the best hyperparameters to train the best version
of a model. HPO automatically selects an algorithm for the type of problem you want to solve. Then HPO
finds the best hyperparameters according to your objective metric. See Autopilot algorithm support for a list of algorithms supported by HYPERPARAMETER_TUNING
mode.
AutoMLMode
public void setGenerateCandidateDefinitionsOnly(Boolean generateCandidateDefinitionsOnly)
Generates possible candidates without training the models. A model candidate is a combination of data preprocessors, algorithms, and algorithm parameter settings.
generateCandidateDefinitionsOnly
- Generates possible candidates without training the models. A model candidate is a combination of data
preprocessors, algorithms, and algorithm parameter settings.public Boolean getGenerateCandidateDefinitionsOnly()
Generates possible candidates without training the models. A model candidate is a combination of data preprocessors, algorithms, and algorithm parameter settings.
public TabularJobConfig withGenerateCandidateDefinitionsOnly(Boolean generateCandidateDefinitionsOnly)
Generates possible candidates without training the models. A model candidate is a combination of data preprocessors, algorithms, and algorithm parameter settings.
generateCandidateDefinitionsOnly
- Generates possible candidates without training the models. A model candidate is a combination of data
preprocessors, algorithms, and algorithm parameter settings.public Boolean isGenerateCandidateDefinitionsOnly()
Generates possible candidates without training the models. A model candidate is a combination of data preprocessors, algorithms, and algorithm parameter settings.
public void setProblemType(String problemType)
The type of supervised learning problem available for the model candidates of the AutoML job V2. For more information, see SageMaker Autopilot problem types.
You must either specify the type of supervised learning problem in ProblemType
and provide the AutoMLJobObjective metric, or none at all.
problemType
- The type of supervised learning problem available for the model candidates of the AutoML job V2. For more
information, see SageMaker Autopilot problem types.
You must either specify the type of supervised learning problem in ProblemType
and provide
the AutoMLJobObjective metric, or none at all.
ProblemType
public String getProblemType()
The type of supervised learning problem available for the model candidates of the AutoML job V2. For more information, see SageMaker Autopilot problem types.
You must either specify the type of supervised learning problem in ProblemType
and provide the AutoMLJobObjective metric, or none at all.
You must either specify the type of supervised learning problem in ProblemType
and provide
the AutoMLJobObjective metric, or none at all.
ProblemType
public TabularJobConfig withProblemType(String problemType)
The type of supervised learning problem available for the model candidates of the AutoML job V2. For more information, see SageMaker Autopilot problem types.
You must either specify the type of supervised learning problem in ProblemType
and provide the AutoMLJobObjective metric, or none at all.
problemType
- The type of supervised learning problem available for the model candidates of the AutoML job V2. For more
information, see SageMaker Autopilot problem types.
You must either specify the type of supervised learning problem in ProblemType
and provide
the AutoMLJobObjective metric, or none at all.
ProblemType
public TabularJobConfig withProblemType(ProblemType problemType)
The type of supervised learning problem available for the model candidates of the AutoML job V2. For more information, see SageMaker Autopilot problem types.
You must either specify the type of supervised learning problem in ProblemType
and provide the AutoMLJobObjective metric, or none at all.
problemType
- The type of supervised learning problem available for the model candidates of the AutoML job V2. For more
information, see SageMaker Autopilot problem types.
You must either specify the type of supervised learning problem in ProblemType
and provide
the AutoMLJobObjective metric, or none at all.
ProblemType
public void setTargetAttributeName(String targetAttributeName)
The name of the target variable in supervised learning, usually represented by 'y'.
targetAttributeName
- The name of the target variable in supervised learning, usually represented by 'y'.public String getTargetAttributeName()
The name of the target variable in supervised learning, usually represented by 'y'.
public TabularJobConfig withTargetAttributeName(String targetAttributeName)
The name of the target variable in supervised learning, usually represented by 'y'.
targetAttributeName
- The name of the target variable in supervised learning, usually represented by 'y'.public void setSampleWeightAttributeName(String sampleWeightAttributeName)
If specified, this column name indicates which column of the dataset should be treated as sample weights for use by the objective metric during the training, evaluation, and the selection of the best model. This column is not considered as a predictive feature. For more information on Autopilot metrics, see Metrics and validation.
Sample weights should be numeric, non-negative, with larger values indicating which rows are more important than others. Data points that have invalid or no weight value are excluded.
Support for sample weights is available in Ensembling mode only.
sampleWeightAttributeName
- If specified, this column name indicates which column of the dataset should be treated as sample weights
for use by the objective metric during the training, evaluation, and the selection of the best model. This
column is not considered as a predictive feature. For more information on Autopilot metrics, see Metrics and
validation.
Sample weights should be numeric, non-negative, with larger values indicating which rows are more important than others. Data points that have invalid or no weight value are excluded.
Support for sample weights is available in Ensembling mode only.
public String getSampleWeightAttributeName()
If specified, this column name indicates which column of the dataset should be treated as sample weights for use by the objective metric during the training, evaluation, and the selection of the best model. This column is not considered as a predictive feature. For more information on Autopilot metrics, see Metrics and validation.
Sample weights should be numeric, non-negative, with larger values indicating which rows are more important than others. Data points that have invalid or no weight value are excluded.
Support for sample weights is available in Ensembling mode only.
Sample weights should be numeric, non-negative, with larger values indicating which rows are more important than others. Data points that have invalid or no weight value are excluded.
Support for sample weights is available in Ensembling mode only.
public TabularJobConfig withSampleWeightAttributeName(String sampleWeightAttributeName)
If specified, this column name indicates which column of the dataset should be treated as sample weights for use by the objective metric during the training, evaluation, and the selection of the best model. This column is not considered as a predictive feature. For more information on Autopilot metrics, see Metrics and validation.
Sample weights should be numeric, non-negative, with larger values indicating which rows are more important than others. Data points that have invalid or no weight value are excluded.
Support for sample weights is available in Ensembling mode only.
sampleWeightAttributeName
- If specified, this column name indicates which column of the dataset should be treated as sample weights
for use by the objective metric during the training, evaluation, and the selection of the best model. This
column is not considered as a predictive feature. For more information on Autopilot metrics, see Metrics and
validation.
Sample weights should be numeric, non-negative, with larger values indicating which rows are more important than others. Data points that have invalid or no weight value are excluded.
Support for sample weights is available in Ensembling mode only.
public String toString()
toString
in class Object
Object.toString()
public TabularJobConfig clone()
public void marshall(ProtocolMarshaller protocolMarshaller)
StructuredPojo
ProtocolMarshaller
.marshall
in interface StructuredPojo
protocolMarshaller
- Implementation of ProtocolMarshaller
used to marshall this object's data.