Aleksey Zinoviev created IGNITE-12079:
-----------------------------------------

             Summary: [ML][Umbrella] Add advanced preprocessing techniques
                 Key: IGNITE-12079
                 URL: https://issues.apache.org/jira/browse/IGNITE-12079
             Project: Ignite
          Issue Type: New Feature
          Components: ml
    Affects Versions: 2.8
            Reporter: Aleksey Zinoviev
            Assignee: Aleksey Zinoviev
             Fix For: 2.8


*Main goal:*

To reduce the gap between Apache Spark and Apache Ignite in preprocessing 
operations. The reducing of the gap could help with loading Spark ML Pipelines 
to Ignite ML.

 

Next steps:
 # Add Frequency Encoder
 # Add two Imputing Strategies (MIN, MAX, COUNT, MOST_FREQUENT, LEAST_FREQUENT)
 # Add RobustScaler (will be added in Spark 3.0)
 # Add CountVectorizer
 # Add FeatureHasher
 # Add QuantileDiscretizer
 # Add Locality Sensitive Hashing (LSH)
 # Add LabelEncoder
 # Add RevertStringIndexing
 # Add multi-column preprocessor



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to