Hi An Lan,
Data is already distributed, in this case may be one
blocklet is returning more number of rows and other returning less because
of this some task will take more time.
In driver log block distribution log is not present, so it is not clear
whether it is going for block di
hi Vinod,
It is an expected feature for many people as Jacky mentioned. I think
Update/Delete should be basic module for CarbonData, meanwhile it is
complex question for distributed storage system. The solution you proposed
is based on traditional 'Base + Delta' approach, which is applied on
bigta
Github user Zhangshunyu closed the pull request at:
https://github.com/apache/incubator-carbondata/pull/319
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if
GitHub user Jay357089 opened a pull request:
https://github.com/apache/incubator-carbondata/pull/320
[CARBONDATA-412][WIP]Fix load bug when table name has '_'
https://issues.apache.org/jira/browse/CARBONDATA-412
## Reason:
this is because in windows, file separator is
Jay created CARBONDATA-412:
--
Summary: in windows, when load into table whose name has "_", the
old segment will be deleted.
Key: CARBONDATA-412
URL: https://issues.apache.org/jira/browse/CARBONDATA-412
Proje
Hi Kumar Vishal,
1. I found the quantity of rows filtered out by invert index is not uniform
between different tasks and the difference is large. Some task may be 3~4k
row after filtered, but the longer tasks may be 3~4w. When most longer task
on same node, time cost will be more longer than other
Github user ravipesala closed the pull request at:
https://github.com/apache/incubator-carbondata/pull/318
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if t
GitHub user Zhangshunyu opened a pull request:
https://github.com/apache/incubator-carbondata/pull/319
[CARBONDATA-411] Test
test
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/Zhangshunyu/incubator-carbondata a
Alternatively y
zhangshunyu created CARBONDATA-411:
--
Summary: test
Key: CARBONDATA-411
URL: https://issues.apache.org/jira/browse/CARBONDATA-411
Project: CarbonData
Issue Type: Improvement
Compone
Github user asfgit closed the pull request at:
https://github.com/apache/incubator-carbondata/pull/311
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the f
Github user asfgit closed the pull request at:
https://github.com/apache/incubator-carbondata/pull/267
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the f
GitHub user ravipesala opened a pull request:
https://github.com/apache/incubator-carbondata/pull/318
[WIP] Dictionary server implementation for single pass data load
It is work under progress, we can review the design of this PR
You can merge this pull request into a Git repository
Hi Vinod,
It is great to have this feature, as there were many people asking for data
update during the CarbonData meetup earlier. I believe it will be useful for
many big data applications.
For the solution you proposed, I have following doubts:
1. Data update is complex as if transaction is
13 matches
Mail list logo