[
https://issues.apache.org/jira/browse/CARBONDATA-3056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kunal Kapoor resolved CARBONDATA-3056.
--------------------------------------
Resolution: Fixed
Assignee: Naman Rastogi
Fix Version/s: 1.5.1
> Implement concurrent reading through CarbonReader
> -------------------------------------------------
>
> Key: CARBONDATA-3056
> URL: https://issues.apache.org/jira/browse/CARBONDATA-3056
> Project: CarbonData
> Issue Type: Sub-task
> Reporter: Naman Rastogi
> Assignee: Naman Rastogi
> Priority: Minor
> Fix For: 1.5.1
>
> Time Spent: 2h 10m
> Remaining Estimate: 0h
>
> The current reading through SDK is slow as in CarbonReader, we are reading
> the carbondata files sequentially, even though we have individual
> CarbonRecordReader for each file. We can parallelize this by adding an API in
> CarbonReader class
> *List<CarbonReader> readers = CarbonReader.split(numSplits)*
> which returns a list of CarbonReaders, which can be used to read parallelly,
> as reading each file is independent of other files.
>
> This enables the SDK user to read the files as it is, or in a multithreaded
> environment.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)