[ 
https://issues.apache.org/jira/browse/CARBONDATA-2204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

xuchuanyin reassigned CARBONDATA-2204:
--------------------------------------

       Assignee:     (was: xuchuanyin)
    Description: 
* Problems

Currently in carbondata, a single query will access tablestatus file 7 times, 
which will definitely slow down the query performance especially when this file 
is in remote cluster since reading this file is purely client side operation.

 
 *  Steps to reproduce

1. Add logger in `AtomicFileOperationsImpl.openForRead` and printout the file 
name to read.

2. Run a query on carbondata table. Here I ran `TestLoadDataGeneral.test("test 
data loading CSV file without extension name")`.

3. Observe the output log and search the keyword 'tablestatus'.  

 

  was:
# Problems

Currently in carbondata, a single query will access tablestatus file 7 times, 
which will definitely slow down the query performance especially when this file 
is in remote cluster since reading this file is purely client side operation.

 

# Steps to reproduce

1. Add logger in `AtomicFileOperationsImpl.openForRead` and printout the file 
name to read.

2. Run a query on carbondata table. Here I ran `TestLoadDataGeneral.test("test 
data loading CSV file without extension name")`.

3. Observe the output log and search the keyword 'tablestatus'.  

 


> Access tablestatus file too many times during query
> ---------------------------------------------------
>
>                 Key: CARBONDATA-2204
>                 URL: https://issues.apache.org/jira/browse/CARBONDATA-2204
>             Project: CarbonData
>          Issue Type: Improvement
>          Components: data-query
>    Affects Versions: 1.3.0
>            Reporter: xuchuanyin
>            Priority: Major
>
> * Problems
> Currently in carbondata, a single query will access tablestatus file 7 times, 
> which will definitely slow down the query performance especially when this 
> file is in remote cluster since reading this file is purely client side 
> operation.
>  
>  *  Steps to reproduce
> 1. Add logger in `AtomicFileOperationsImpl.openForRead` and printout the file 
> name to read.
> 2. Run a query on carbondata table. Here I ran 
> `TestLoadDataGeneral.test("test data loading CSV file without extension 
> name")`.
> 3. Observe the output log and search the keyword 'tablestatus'.  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to