[
https://issues.apache.org/jira/browse/HIVE-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jerome Boulon updated HIVE-259:
-------------------------------
Attachment: Percentile.xlsx
jb2.txt
Percentile test file + validation using Excep Percentile function:
CREATE TABLE JB2
(
duration bigint,
code string
)
ROW FORMAT DELIMITED FIELDS TERMINATED BY ' ' LINES TERMINATED BY '\n'
STORED AS TEXTFILE;
LOAD DATA LOCAL INPATH '/jb2.txt' INTO TABLE JB2;
Result:
hive> select percentile(duration,"25,50,99") from JB2;
Ended Job = job_201002201654_0006
OK
[14.0,33.0,416.4000000000001]
Time taken: 36.261 seconds
hive> select code,percentile(duration,"25,50,99") from JB2 group by code;
Ended Job = job_201002201654_0007
OK
a [2.0,17.5,427.2299999999999]
b [22.75,44.5,345.84999999999997]
c [18.0,29.0,58.760000000000005]
Time taken: 23.419 seconds
hive> quit;
> Add PERCENTILE aggregate function
> ---------------------------------
>
> Key: HIVE-259
> URL: https://issues.apache.org/jira/browse/HIVE-259
> Project: Hadoop Hive
> Issue Type: New Feature
> Components: Query Processor
> Reporter: Venky Iyer
> Assignee: Jerome Boulon
> Attachments: HIVE-259-2.patch, HIVE-259.1.patch, HIVE-259.patch,
> jb2.txt, Percentile.xlsx
>
>
> Compute atleast 25, 50, 75th percentiles
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.