GitHub user manishgupta88 opened a pull request:
https://github.com/apache/incubator-carbondata/pull/703
[CARBONDATA-780] Alter table support for compaction through sort step
Alter table need to support compaction process where complete data need to
be sorted again and then written to file.
Currently in compaction process data is directly given to writer step where
it is splitted into columns and written. But as columns are sorted from left to
right, on dropping a column data will again become unorganized as dropped
column data will not be considered during compaction. In these scenarios
complete data need to be sorted again and then submitted to writer step.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/manishgupta88/incubator-carbondata
compaction_restructure_support
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/incubator-carbondata/pull/703.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #703
----
commit b108c22024f6381385f0c394ea6ebe515a2e96b4
Author: ravikiran <[email protected]>
Date: 2017-03-15T15:07:26Z
Added class to handle sorting of data for compaction scenario
commit 11f80e3f22f68332ced85ae8da3a122d0a52447e
Author: manishgupta88 <[email protected]>
Date: 2017-03-15T13:54:05Z
Handling for compaction for restructure case. Handled to completely sort
the data again if any restructured block is selected for compaction
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---