Makoto Yui created HIVEMALL-259:
-----------------------------------
Summary: [BUG] feature_binning does not work properly under
certain conditions
Key: HIVEMALL-259
URL: https://issues.apache.org/jira/browse/HIVEMALL-259
Project: Hivemall
Issue Type: Bug
Affects Versions: 0.5.2
Reporter: Makoto Yui
Assignee: Makoto Yui
Fix For: 0.6.0
feature_binning does not properly work in certain condition.
It might be a bug in quantiles lookup by a different key type object at [this
line|[https://github.com/apache/incubator-hivemall/blob/master/core/src/main/java/hivemall/ftvec/binning/FeatureBinningUDF.java#L133]].
{code:java}
WITH extracted as (
select
extract_feature(feature) as index,
extract_weight(feature) as value
from
input l
LATERAL VIEW explode(features) r as feature
),
bins as (
select
map(index, build_bins(value, 5, true)) as quantiles -- 5 bins with auto
bin shrinking
from
extracted
group by
index
)
select
l.features as original,
feature_binning(l.features, r.quantiles) as features
from
input l
cross join bins r
;
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)