[
https://issues.apache.org/jira/browse/IGNITE-9145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16696042#comment-16696042
]
ASF GitHub Bot commented on IGNITE-9145:
----------------------------------------
GitHub user zaleslaw opened a pull request:
https://github.com/apache/ignite/pull/5481
IGNITE-9145: Added EncodingSortingStrategy
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/gridgain/apache-ignite ignite-9145
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/ignite/pull/5481.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #5481
----
commit b90b1de2129ffec9741386cc9a1e9c23e199461e
Author: Zinoviev Alexey <zaleslaw.sin@...>
Date: 2018-11-21T17:22:08Z
IGNITE-9145: Added EncodingSortingStrategy
----
> [ML] Add different strategies to index labels in StringEncoderTrainer
> ---------------------------------------------------------------------
>
> Key: IGNITE-9145
> URL: https://issues.apache.org/jira/browse/IGNITE-9145
> Project: Ignite
> Issue Type: Improvement
> Components: ml
> Reporter: Aleksey Zinoviev
> Assignee: Aleksey Zinoviev
> Priority: Major
>
> The main idea to add a few strategies of indexing: sorting and so on.
> Currently it supports only one strategy (most popular with zero and less
> popular with the max index size).
> There are can be a few options
> * 'frequencyDesc': descending order by label frequency (most frequent label
> assigned 0)
> * 'frequencyAsc': ascending order by label frequency (least frequent label
> assigned 0)
>
> Please, update the method **transformFrequenciesToEncodingValues and add the
> strategy as a parameter of trainer.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)