[ 
https://issues.apache.org/jira/browse/TAJO-1796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14730840#comment-14730840
 ] 

ASF GitHub Bot commented on TAJO-1796:
--------------------------------------

GitHub user eminency opened a pull request:

    https://github.com/apache/tajo/pull/729

    TAJO-1796: Count all query for Parquet is crashed

    New PhysicalExec is added for simple count query for Parquet.
    
    It is just a workaround.
    It should be improved later to enable for other types supporting statistics.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/eminency/tajo TAJO-1796

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/tajo/pull/729.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #729
    
----
commit 51e97f7f8012365c7996287a05bc1fb784b31d75
Author: Jongyoung Park <[email protected]>
Date:   2015-08-26T03:23:53Z

    workaround

commit f41305c68b06706e3d3861038415855b39a1cbc9
Author: Jongyoung Park <[email protected]>
Date:   2015-09-04T12:38:32Z

    Parquet format property added

commit 76680d51180b4d063079b4c686ee1fc19c183c61
Author: Jongyoung Park <[email protected]>
Date:   2015-09-04T12:39:56Z

    isCountQuery is added in SimScanProto

commit 8e6ad2a49c0ab0b68ecea03541515dfd18276325
Author: Jongyoung Park <[email protected]>
Date:   2015-09-04T12:43:59Z

    Logical/physical plan implementation logic is changed

commit dd5cd57f753e1a8386db38d82691af67e88b997c
Author: Jongyoung Park <[email protected]>
Date:   2015-09-04T12:44:11Z

    Test added

commit 77611d420e4d8cbae175c653c11c1096620939ac
Author: Jongyoung Park <[email protected]>
Date:   2015-09-04T13:40:13Z

    Fix compilation error after rebasing

----


> Count all query for Parquet is crashed
> --------------------------------------
>
>                 Key: TAJO-1796
>                 URL: https://issues.apache.org/jira/browse/TAJO-1796
>             Project: Tajo
>          Issue Type: Bug
>          Components: Storage
>            Reporter: Jongyoung Park
>            Assignee: Jongyoung Park
>            Priority: Critical
>             Fix For: 0.11.0
>
>
> When 'select count( * )' is excuted, it fails with following error log:
> {quote}
> 2015-08-21 17:17:31,371 ERROR 
> org.apache.tajo.engine.planner.physical.HashShuffleFileWriteExec: A group 
> type can not be empty. Parquet does not support empty group without leaves. 
> Empty group: table_schema
> org.apache.parquet.schema.InvalidSchemaException: A group type can not be 
> empty. Parquet does not support empty group without leaves. Empty group: 
> table_schema
>   at org.apache.parquet.schema.GroupType.<init>(GroupType.java:92)
>   at org.apache.parquet.schema.GroupType.<init>(GroupType.java:48)
>   at org.apache.parquet.schema.MessageType.<init>(MessageType.java:50)
>   at 
> org.apache.tajo.storage.parquet.TajoSchemaConverter.convert(TajoSchemaConverter.java:152)
>   at 
> org.apache.tajo.storage.parquet.TajoReadSupport.init(TajoReadSupport.java:76)
>   at 
> org.apache.parquet.hadoop.InternalParquetRecordReader.initialize(InternalParquetRecordReader.java:172)
>   at 
> org.apache.parquet.hadoop.ParquetReader.initReader(ParquetReader.java:152)
>   at org.apache.parquet.hadoop.ParquetReader.read(ParquetReader.java:128)
>   at 
> org.apache.tajo.storage.parquet.ParquetScanner.next(ParquetScanner.java:73)
> ...
> {quote}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to