[
https://issues.apache.org/jira/browse/PHOENIX-4872?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16665774#comment-16665774
]
ASF GitHub Bot commented on PHOENIX-4872:
-----------------------------------------
GitHub user swaroopak opened a pull request:
https://github.com/apache/phoenix/pull/395
PHOENIX-4872: BulkLoad has bug when loading on
single-cell-array-with-offsets table.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/swaroopak/phoenix PHOENIX-4872
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/phoenix/pull/395.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #395
----
commit ceebebb0ed58a628723b885f28cff19f2dad3185
Author: s.kadam <s.kadam@...>
Date: 2018-10-26T23:54:49Z
PHOENIX-4872: BulkLoad has bug when loading on
single-cell-array-with-offsets table.
----
> BulkLoad has bug when loading on single-cell-array-with-offsets table.
> ----------------------------------------------------------------------
>
> Key: PHOENIX-4872
> URL: https://issues.apache.org/jira/browse/PHOENIX-4872
> Project: Phoenix
> Issue Type: Bug
> Affects Versions: 4.11.0, 4.12.0, 4.13.0, 4.14.0
> Reporter: JeongMin Ju
> Assignee: Swaroopa Kadam
> Priority: Critical
>
> CsvBulkLoadTool creates incorrect data for the
> SCAWO(SingleCellArrayWithOffsets) table.
> Every phoenix table needs a marker (empty) column, but CsvBulkLoadTool does
> not create that column for SCAWO tables.
> If you check the data through HBase Shell, you can see that there is no
> corresponding column.
> If created by Upsert Query, it is created normally.
> {code:java}
> column=0:\x00\x00\x00\x00, timestamp=1535420036372, value=x
> {code}
> Since there is no upper column, the result of all Group By queries is zero.
> This is because "families":
> {"0": ["\\ x00 \\ x00 \\ x00 \\ x00"]}
> is added to the column of the Scan object.
> Because the CsvBulkLoadTool has not created the column, the result of the
> scan is empty.
>
> This problem applies only to tables with multiple column families. The
> single-column family table works luckily.
> "Families": \{"0": ["ALL"]} is added to the column of the Scan object in the
> single column family table.
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)