sparklightautoml.transformers
Basic feature generation steps and helper utils.
Base Classes
Base class for estimators from sparklightautoml.transformers. |
|
Base class for transformers from sparklightautoml.transformers. |
|
Transformer that change roles for input columns. |
|
Entity that represents sequential of transformers in preprocess pipeline. |
|
Entity that represents parallel layers (transformers) in preprocess pipeline. |
|
Helper and base class for |
|
Mixin for param inputCols: input column names. |
|
Mixin for param inputCols: input column names. |
|
Transformer that drops columns from input dataframe. |
|
Converts prediction columns values from ONNX model format to LGBMCBooster format |
|
Converts probability columns values from ONNX model format to LGBMCBooster format |
Numeric
Fillna with median. |
|
Estimator that calculate nan rate for input columns and build |
|
Discretization of numeric features by quantiles. |
|
Classic StandardScaler. |
|
Transformer that replace inf values to np.nan values in input columns. |
|
Fillna with median. |
|
Convert probs to logodds. |
|
Adds columns with nan flags (0 or 1) for input columns. |
|
Adds column with quantile bin number of input columns. |
|
Classic StandardScaler. |
Categorical
Spark label encoder estimator. |
|
Spark ordinal encoder estimator. |
|
Calculates frequency in train data and produces |
|
Combines categorical features and fits |
|
Spark target encoder estimator. |
|
Spark multiclass target encoder estimator. |
|
Simple OneHotEncoder over label encoded categories. |
|
Simple Spark version of LabelEncoder. |
|
Spark version of |
|
Labels are encoded with frequency in train data. |
|
Combines category columns and encode with label encoder. |
|
Spark multiclass target encoder transformer. |
|
Helper class for |
Categorical (Scala)
Custom implementation of PySpark StringIndexer wrapper |
|
Model fitted by |
Datetime
Transforms datetime columns values to numeric values. |
|
Basic conversion strategy, used in selection one-to-one transformers. |
|
Extracts unit of time from Datetime values and marks holiday dates. |
|
Helper class for |