[
https://issues.apache.org/jira/browse/BEAM-7929?focusedWorklogId=559667&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-559667
]
ASF GitHub Bot logged work on BEAM-7929:
----------------------------------------
Author: ASF GitHub Bot
Created on: 02/Mar/21 00:25
Start Date: 02/Mar/21 00:25
Worklog Time Spent: 10m
Work Description: TheNeuralBit commented on a change in pull request
#14117:
URL: https://github.com/apache/beam/pull/14117#discussion_r585146136
##########
File path:
sdks/java/extensions/sql/src/main/java/org/apache/beam/sdk/extensions/sql/meta/provider/parquet/ParquetTableProvider.java
##########
@@ -39,19 +36,18 @@
* favorite_numbers ARRAY<INTEGER>
* )
* TYPE 'parquet'
- * LOCATION '/home/admin/users.parquet'
+ * LOCATION '/home/admin/orders/'
Review comment:
Thanks for updating this, I guess the original version won't work since
we always add a `/*` to the end for reads. This should probably just padd
through the location directly instead, so the user can specify a glob if they
want. Another follow-on jira I suppose.
##########
File path: sdks/java/extensions/sql/build.gradle
##########
@@ -79,6 +79,7 @@ dependencies {
provided project(":sdks:java:io:kafka")
provided project(":sdks:java:io:google-cloud-platform")
compile project(":sdks:java:io:mongodb")
+ compile library.java.avro
Review comment:
I suppose this is necessary because of the direct references to
`org.apache.avro`? I think if we can push all that complexity into ParquetIO it
won't be necessary, right?
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 559667)
Time Spent: 1h (was: 50m)
> ParquetTable.buildIOReader should support column projection and filter
> predicate
> --------------------------------------------------------------------------------
>
> Key: BEAM-7929
> URL: https://issues.apache.org/jira/browse/BEAM-7929
> Project: Beam
> Issue Type: New Feature
> Components: dsl-sql, io-java-parquet
> Affects Versions: 2.14.0
> Reporter: Neville Li
> Priority: P3
> Time Spent: 1h
> Remaining Estimate: 0h
>
> To leverage the performance improvements in Parquet.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)