[ https://issues.apache.org/jira/browse/IGNITE-9145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16698800#comment-16698800 ]
ASF GitHub Bot commented on IGNITE-9145: ---------------------------------------- Github user asfgit closed the pull request at: https://github.com/apache/ignite/pull/5481 > [ML] Add different strategies to index labels in StringEncoderTrainer > --------------------------------------------------------------------- > > Key: IGNITE-9145 > URL: https://issues.apache.org/jira/browse/IGNITE-9145 > Project: Ignite > Issue Type: Improvement > Components: ml > Reporter: Aleksey Zinoviev > Assignee: Aleksey Zinoviev > Priority: Major > Fix For: 2.8 > > > The main idea to add a few strategies of indexing: sorting and so on. > Currently it supports only one strategy (most popular with zero and less > popular with the max index size). > There are can be a few options > * 'frequencyDesc': descending order by label frequency (most frequent label > assigned 0) > * 'frequencyAsc': ascending order by label frequency (least frequent label > assigned 0) > > Please, update the method **transformFrequenciesToEncodingValues and add the > strategy as a parameter of trainer. > -- This message was sent by Atlassian JIRA (v7.6.3#76005)