Re: A New GroupBy Syntax

田原 Mon, 18 Nov 2019 20:57:37 -0800

Hi,

I've completed the work of new groupie syntax which also supports the inner 
interval(the corresponding JIRA link is 
https://issues.apache.org/jira/projects/IOTDB/issues/IOTDB-217) and also update 
the outdated group by documents.


The PR link is https://github.com/apache/incubator-iotdb/pull/570.


> -----原始邮件-----
> 发件人: "田原" <[email protected]>
> 发送时间: 2019-11-12 16:16:48 (星期二)
> 收件人: [email protected]
> 抄送: 
> 主题: A New GroupBy Syntax
> 
> Hi,
> 
> 
> Recently, I'm working on the "IOTDB-217: A new predicate of time", the JIRA 
> link is https://issues.apache.org/jira/projects/IOTDB/issues/IOTDB-217.
> 
> 
> I think we need a new GroupByClause to support it.
> 
> 
> The new GroupByClause grammar is shown as the following:
> 
> 
> GroupByClause : LPAREN <TimeInterval> COMMA <TimeUnit> (COMMA <TimeUnit>)? 
> RPAREN
> TimeUnit : Integer <DurationUnit>
> DurationUnit : "ms" | "s" | "m" | "h" | "d" | "w"
> TimeInterval: '[' TimeValue ',' TimeValue ']'
> 
> 
> group by ([startTime, endTime], timeInterval, slidingStep(optional，default 
> value is equal to timeInterval，it must be greater than or equal to 
> timeInterval if specified))
> 
> 
> If we want to do aggregate query for the data of 8:00 a.m. to 11:00 a.m. 
> every day in July and August of 2019, the query sql will be like this:
> 
> 
> select count(status), max_value(temperature) from root.ln.wf01.wt01 
> group by ([2019-7-01T08:00:00, 2019-8-31T23:59:59], 3h, 1d);
> 
> 
> Part of the query results are as follows:
> 
> 
> | Time   | count(root.ln.wf01.wt01.status) | 
> max_value(root.ln.wf01.wt01.temperature)
> | ---------------------| ----- | ---------- |
> | 2019-7-01T08:00:00   | 1440  | 25.996933  |
> | 2019-7-02T08:00:00   | 1440  | 26.102223  |
> | 2019-7-03T08:00:00   | 1440  | 26.211028  |
> | 2019-7-04T08:00:00   | 1440  | 25.999288  |
> | 2019-7-05T08:00:00   | 1440  | 25.958802  |
> | 2019-7-06T08:00:00   | 1430  | 30.222344  |
> 
> 
> How does the original group by statement correspond to the new grammar? 
> Simply omit the optional parameter of the second parameter 'sliding step'.
> 
> 
> Original group by sql
> select count(status), max_value(temperature) from root.ln.wf01.wt01 
> group by (1d, [2017-11-01T00:00:00, 2017-11-07T23:00:00]);
> 
> 
> Transformed to new group by grammar
> select count(status), max_value(temperature) from root.ln.wf01.wt01 
> group by ([2017-11-01T00:00:00, 2017-11-07T23:00:00], 1d);
> 
> 
> It can be seen that the new groupby sql is almost identical to the original 
> one when the original groupbysql does not specify a starting point, and the 
> second optional parameter given by the original groupby syntax - the starting 
> point of the axis, I think, to some extent, it coincides with the starting 
> point of the latter, which increases the complexity of the group by syntax, 
> so in the new groupby syntax, this optional parameter is directly discarded.
> 
> 
>

Re: A New GroupBy Syntax

Reply via email to