Prefer batch computations operating on datasets as a whole for efficiency
Prefer batch computations operating on datasets as a whole for efficiency
Prefer batch computations operating on datasets as a whole for efficiency
Prefer batch computations operating on datasets as a whole for efficiency
Prefer batch computations operating on datasets as a whole for efficiency
Prefer batch computations operating on datasets as a whole for efficiency
Classification model trained by CatBoost. Use CatBoostClassifier to train it
Serialization
Supports standard Spark MLLib serialization. Data can be saved to distributed filesystem like HDFS or local files. When saved to
pathtwo files are created: -<path>/metadatawhich contains Spark-specific metadata in JSON format -<path>/modelwhich contains model in usual CatBoost format which can be read using other local CatBoost APIs (if stored in a distributed filesystem it has to be copied to the local filesystem first).Load model
Save model