[jira] [Updated] (IGNITE-11655) [ML]: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Zinoviev updated IGNITE-11655: - Summary: [ML]: OneHotEncoder returns more columns than expected (was: ML: OneHotEncoder returns more columns than expected) > [ML]: OneHotEncoder returns more columns than expected > -- > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Assignee: Alexey Zinoviev >Priority: Critical > Fix For: 2.8 > > Time Spent: 20m > Remaining Estimate: 0h > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (IGNITE-11655) [ML]: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Zinoviev updated IGNITE-11655: - Ignite Flags: Release Notes Required Release Note: OneHotEncoder returns more columns than expected > [ML]: OneHotEncoder returns more columns than expected > -- > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Assignee: Alexey Zinoviev >Priority: Critical > Fix For: 2.8 > > Time Spent: 20m > Remaining Estimate: 0h > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Muzafarov updated IGNITE-11655: - Fix Version/s: 2.8 > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Assignee: Alexey Zinoviev >Priority: Critical > Fix For: 2.8 > > Time Spent: 20m > Remaining Estimate: 0h > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Aleksey Zinoviev updated IGNITE-11655: -- Priority: Critical (was: Major) > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Assignee: Aleksey Zinoviev >Priority: Critical > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Description: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} was: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Description: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} was: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Description: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} was: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Description: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} was: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Description: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} was: OneHotEncoder returns more columns than expected (two values that might be encoded using two columns encoded using 3 columns). The following example demonstrates the problem: {code:java} Map training = new HashMap<>(); training.put(0, new Object[]{42.0}); training.put(1, new Object[]{43.0}); training.put(2, new Object[]{42.0}); EncoderTrainer trainer = new EncoderTrainer() .withEncoderType(EncoderType.ONE_HOT_ENCODER) .withEncodedFeature(0); IgniteBiFunction processor = trainer.fit(training, 1, (k, v) -> v); Vector res = processor.apply(1, new Object[]{42.0}); System.out.println(Arrays.toString(res.asArray())); >>> [0.0, 1.0, 0.0] {code} > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > {code:java} > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new EncoderTrainer Object[]>() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = trainer.fit(training, > 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Ignite Flags: (was: Docs Required) > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new > EncoderTrainer() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = > trainer.fit(training, 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Affects Version/s: 2.7 > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Affects Versions: 2.7 >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new > EncoderTrainer() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = > trainer.fit(training, 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (IGNITE-11655) ML: OneHotEncoder returns more columns than expected
[ https://issues.apache.org/jira/browse/IGNITE-11655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Dmitriev updated IGNITE-11655: Component/s: ml > ML: OneHotEncoder returns more columns than expected > > > Key: IGNITE-11655 > URL: https://issues.apache.org/jira/browse/IGNITE-11655 > Project: Ignite > Issue Type: Bug > Components: ml >Reporter: Anton Dmitriev >Priority: Major > > OneHotEncoder returns more columns than expected (two values that might be > encoded using two columns encoded using 3 columns). The following example > demonstrates the problem: > Map training = new HashMap<>(); > training.put(0, new Object[]{42.0}); > training.put(1, new Object[]{43.0}); > training.put(2, new Object[]{42.0}); > EncoderTrainer trainer = new > EncoderTrainer() > .withEncoderType(EncoderType.ONE_HOT_ENCODER) > .withEncodedFeature(0); > IgniteBiFunction processor = > trainer.fit(training, 1, (k, v) -> v); > Vector res = processor.apply(1, new Object[]{42.0}); > System.out.println(Arrays.toString(res.asArray())); > >>> [0.0, 1.0, 0.0] -- This message was sent by Atlassian JIRA (v7.6.3#76005)