[jira] [Updated] (HUDI-2494) Fix usage of different key generators with metadata enabled

sivabalan narayanan (Jira) Mon, 04 Oct 2021 11:28:05 -0700


     [ 
https://issues.apache.org/jira/browse/HUDI-2494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


sivabalan narayanan updated HUDI-2494:
--------------------------------------
    Description: 
With [sync metadata patch|https://github.com/apache/hudi/pull/3590/], when 
metadata is enabled by default, some spark datasource tests failed which were 
using timestamp based key gen and custom key gen. Metadata table's records are 
getting picked up when we do 

 
{code:java}
spark.read.format(hudi).load(basePath + "/*/*")
{code}
 

For now, I have disabled metadata for these tests. 

testSparkPartitonByWithTimestampBasedKeyGenerator

testSparkPartitonByWithCustomKeyGenerator

 

I was looking at 
[options|https://spark.apache.org/docs/latest/sql-data-sources-generic-options.html]
 to ignore certain path, but looks like there is none. 

 

 

  was:
With [sync metadata patch|https://github.com/apache/hudi/pull/3590/], when 
metadata is enabled by default, some spark datasource tests failed which were 
using timestamp based key gen and custom key gen. Metadata table's records are 
getting picked up when we do 

 
{code:java}
spark.read.format(hudi).load(basePath + "/*/*")
{code}
 

For now, I have disabled metadata for these tests. 

testSparkPartitonByWithTimestampBasedKeyGenerator

testSparkPartitonByWithCustomKeyGenerator

 


> Fix usage of different key generators with metadata enabled
> -----------------------------------------------------------
>
>                 Key: HUDI-2494
>                 URL: https://issues.apache.org/jira/browse/HUDI-2494
>             Project: Apache Hudi
>          Issue Type: Sub-task
>          Components: Spark Integration
>            Reporter: sivabalan narayanan
>            Assignee: sivabalan narayanan
>            Priority: Major
>              Labels: sev:critical
>             Fix For: 0.10.0
>
>
> With [sync metadata patch|https://github.com/apache/hudi/pull/3590/], when 
> metadata is enabled by default, some spark datasource tests failed which were 
> using timestamp based key gen and custom key gen. Metadata table's records 
> are getting picked up when we do 
>  
> {code:java}
> spark.read.format(hudi).load(basePath + "/*/*")
> {code}
>  
> For now, I have disabled metadata for these tests. 
> testSparkPartitonByWithTimestampBasedKeyGenerator
> testSparkPartitonByWithCustomKeyGenerator
>  
> I was looking at 
> [options|https://spark.apache.org/docs/latest/sql-data-sources-generic-options.html]
>  to ignore certain path, but looks like there is none. 
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Updated] (HUDI-2494) Fix usage of different key generators with metadata enabled

Reply via email to