[ https://issues.apache.org/jira/browse/CARBONDATA-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ajantha Bhat resolved CARBONDATA-4079. -------------------------------------- Resolution: Fixed Timestamp filter is slow when carbon.push.rowfilters.for.vector = false due to extra cast added in spark filtering. suggested to set this property to true > Queries with Date range are taking time > --------------------------------------- > > Key: CARBONDATA-4079 > URL: https://issues.apache.org/jira/browse/CARBONDATA-4079 > Project: CarbonData > Issue Type: Improvement > Components: data-query > Affects Versions: 2.1.0 > Reporter: suyash yadav > Assignee: Ajantha Bhat > Priority: Major > > Hi Team, > We are doing a POC to understand how can we improve performance of the query > fired against table created in apache carbondata. > Below is the sample query: > > *spark.sql("select ts,resource,metric,value from fact_timestamp_global left > join tags_10_Days_test on fact_timestamp_global.tags_id= tags_10_Days_test.id > where metric in ('Outbound Utilization (percent)','Inbound Utilization > (percent)') and resource='10.212.7.98_if:<0001>' and ts between '2020-09-21 > 00:00:00' and '2020-09-21 12:55:55' group by > ts,resource,metric,value").show(10000,false)* > As you can see above query contains the date range filter.We have noticed > that due to this date range filter the query time is coming around 15 seconds > which is not proving useful as we have to bring down the query execution time > to 3 to 4 seconds. > Could you please review above query and suggest a better way of framing the > above query specially the date range filter which can be helpful to get the > desired query execution time? > > In case you need more details then please do let me know. > > Regards > Suyash Yadav -- This message was sent by Atlassian Jira (v8.3.4#803005)