public final class DecisionTreeClassifier extends ProbabilisticClassifier<Vector,DecisionTreeClassifier,DecisionTreeClassificationModel>
Decision tree learning algorithm
for classification.
It supports both binary and multiclass labels, as well as both continuous and categorical
features.| Constructor and Description |
|---|
DecisionTreeClassifier() |
DecisionTreeClassifier(java.lang.String uid) |
| Modifier and Type | Method and Description |
|---|---|
DecisionTreeClassifier |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
DecisionTreeClassifier |
setCacheNodeIds(boolean value) |
DecisionTreeClassifier |
setCheckpointInterval(int value) |
DecisionTreeClassifier |
setImpurity(java.lang.String value) |
DecisionTreeClassifier |
setMaxBins(int value) |
DecisionTreeClassifier |
setMaxDepth(int value) |
DecisionTreeClassifier |
setMaxMemoryInMB(int value) |
DecisionTreeClassifier |
setMinInfoGain(double value) |
DecisionTreeClassifier |
setMinInstancesPerNode(int value) |
static java.lang.String[] |
supportedImpurities()
Accessor for supported impurities: entropy, gini
|
protected DecisionTreeClassificationModel |
train(DataFrame dataset)
Train a model using the given dataset and parameters.
|
java.lang.String |
uid()
An immutable unique ID for the object and its derivatives.
|
StructType |
validateAndTransformSchema(StructType schema,
boolean fitting,
DataType featuresDataType) |
StructType |
validateAndTransformSchema(StructType schema,
boolean fitting,
DataType featuresDataType)
Validates and transforms the input schema with the provided param map.
|
setProbabilityCol, setThresholdssetRawPredictionColextractLabeledPoints, fit, setFeaturesCol, setLabelCol, setPredictionCol, transformSchematransformSchemaclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn, validateParamstoStringinitializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningpublic DecisionTreeClassifier(java.lang.String uid)
public DecisionTreeClassifier()
public static final java.lang.String[] supportedImpurities()
public java.lang.String uid()
Identifiablepublic DecisionTreeClassifier setMaxDepth(int value)
public DecisionTreeClassifier setMaxBins(int value)
public DecisionTreeClassifier setMinInstancesPerNode(int value)
public DecisionTreeClassifier setMinInfoGain(double value)
public DecisionTreeClassifier setMaxMemoryInMB(int value)
public DecisionTreeClassifier setCacheNodeIds(boolean value)
public DecisionTreeClassifier setCheckpointInterval(int value)
public DecisionTreeClassifier setImpurity(java.lang.String value)
protected DecisionTreeClassificationModel train(DataFrame dataset)
Predictorfit() to avoid dealing with schema validation
and copying parameters into the model.
train in class Predictor<Vector,DecisionTreeClassifier,DecisionTreeClassificationModel>dataset - Training datasetpublic DecisionTreeClassifier copy(ParamMap extra)
Paramscopy in interface Paramscopy in class Predictor<Vector,DecisionTreeClassifier,DecisionTreeClassificationModel>extra - (undocumented)defaultCopy()public StructType validateAndTransformSchema(StructType schema, boolean fitting, DataType featuresDataType)
public StructType validateAndTransformSchema(StructType schema, boolean fitting, DataType featuresDataType)
schema - input schemafitting - whether this is in fittingfeaturesDataType - SQL DataType for FeaturesType.
E.g., VectorUDT for vector features.