[jira] [Updated] (HUDI-2964) Fix AWSLockConfiguration to inherit from HoodieConfig

2021-12-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2964: -- Fix Version/s: 0.10.1 > Fix AWSLockConfiguration to inherit from HoodieConfig >

[jira] [Updated] (HUDI-2951) Disable remote view storage config for flink

2021-12-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2951: -- Fix Version/s: (was: 0.10.0) > Disable remote view storage config for flink >

[jira] [Updated] (HUDI-2951) Disable remote view storage config for flink

2021-12-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2951?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2951: -- Fix Version/s: 0.10.1 > Disable remote view storage config for flink >

[jira] [Updated] (HUDI-2299) The log format DELETE block lose the info orderingVal

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2299: - Story Points: 4 > The log format DELETE block lose the info orderingVal >

[jira] [Updated] (HUDI-2876) hudi should remove the temp file which create by HoodieMergedLogRecordScanner, when we use presto

2021-12-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2876: -- Fix Version/s: 0.10.1 > hudi should remove the temp file which create by >

[jira] [Updated] (HUDI-2900) Fix corrupt block end position

2021-12-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2900: -- Fix Version/s: 0.10.1 > Fix corrupt block end position > --

[GitHub] [hudi] singaretti commented on issue #4177: [SUPPORT] org.apache.hudi.exception.HoodieException: Unknown versionCode:2

2021-12-29 Thread GitBox
singaretti commented on issue #4177: URL: https://github.com/apache/hudi/issues/4177#issuecomment-1002748963 Hi, I was facing this error last weeks, and now, I guess I fixed data pipeline just adjusting AWS EMR versions that it was using Hudi to write to s3 buckets. Data Pipeline

[jira] [Updated] (HUDI-2644) Integrate existing curves with stats from the metadata table

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2644: - Priority: Blocker (was: Major) > Integrate existing curves with stats from the metadata table >

[jira] [Updated] (HUDI-2644) Integrate existing curves with stats from the metadata table

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2644: - Fix Version/s: 0.11.0 > Integrate existing curves with stats from the metadata table >

[jira] [Updated] (HUDI-2644) Integrate existing curves with stats from the metadata table

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2644: - Story Points: 4 > Integrate existing curves with stats from the metadata table >

[jira] [Assigned] (HUDI-2644) Integrate existing curves with stats from the metadata table

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar reassigned HUDI-2644: Assignee: Manoj Govindassamy > Integrate existing curves with stats from the metadata

[jira] [Updated] (HUDI-2100) [UMBRELLA] Support Space curve for hudi

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2100: - Fix Version/s: (was: 0.11.0) > [UMBRELLA] Support Space curve for hudi >

[jira] [Updated] (HUDI-2675) Not an Avro data file

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2675: - Issue Type: Improvement (was: Bug) > Not an Avro data file > - > >

[jira] [Updated] (HUDI-2675) Not an Avro data file

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2675: - Fix Version/s: 0.10.1 > Not an Avro data file > - > > Key:

[jira] [Updated] (HUDI-2777) Data import performance deteriorates because multiple Spark jobs are started when data is written to disks.

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2777: - Issue Type: Improvement (was: Bug) > Data import performance deteriorates because multiple Spark

[jira] [Updated] (HUDI-2774) Async Clustering via deltstreamer fails with IllegalStateException: Duplicate key [==>20211116123724586__replacecommit__INFLIGHT]

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-2774: - Issue Type: Improvement (was: Bug) > Async Clustering via deltstreamer fails with

[jira] [Updated] (HUDI-1380) Async cleaning does not work with Timeline Server

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1380: - Story Points: 4 > Async cleaning does not work with Timeline Server >

[jira] [Updated] (HUDI-1380) Async cleaning does not work with Timeline Server

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1380: - Issue Type: Improvement (was: Bug) > Async cleaning does not work with Timeline Server >

[jira] [Updated] (HUDI-3127) Add a new HoodieHFileReader for Trino with Java 11

2021-12-29 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3127: Status: In Progress (was: Open) > Add a new HoodieHFileReader for Trino with Java 11 >

[jira] [Updated] (HUDI-3126) Address whackamoles during testing of Hudi Trino connector

2021-12-29 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3126: Status: In Progress (was: Open) > Address whackamoles during testing of Hudi Trino connector >

[jira] [Updated] (HUDI-3107) Fix HiveSyncTool drop partitions using JDBC

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3107: - Sprint: Hudi-Sprint-0.10.1 > Fix HiveSyncTool drop partitions using JDBC >

[jira] [Updated] (HUDI-1185) KeyGenerator class/interfaces need to be decoupled from Spark

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1185: - Story Points: 4 > KeyGenerator class/interfaces need to be decoupled from Spark >

[jira] [Updated] (HUDI-1275) Incremental TImeline Syncing causes compaction to fail with FileNotFound exception

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-1275: - Story Points: 4 > Incremental TImeline Syncing causes compaction to fail with FileNotFound >

[jira] [Created] (HUDI-3127) Add a new HoodieHFileReader for Trino with Java 11

2021-12-29 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-3127: --- Summary: Add a new HoodieHFileReader for Trino with Java 11 Key: HUDI-3127 URL: https://issues.apache.org/jira/browse/HUDI-3127 Project: Apache Hudi Issue Type:

[jira] [Updated] (HUDI-3127) Add a new HoodieHFileReader for Trino with Java 11

2021-12-29 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-3127: Story Points: 2 > Add a new HoodieHFileReader for Trino with Java 11 >

[jira] [Updated] (HUDI-431) Support Parquet in MOR log files

2021-12-29 Thread Vinoth Chandar (Jira)
[ https://issues.apache.org/jira/browse/HUDI-431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vinoth Chandar updated HUDI-431: Sprint: Hudi-Sprint-Jan-3 > Support Parquet in MOR log files > > >

[jira] [Created] (HUDI-3126) Address whackamoles during testing of Hudi Trino connector

2021-12-29 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-3126: --- Summary: Address whackamoles during testing of Hudi Trino connector Key: HUDI-3126 URL: https://issues.apache.org/jira/browse/HUDI-3126 Project: Apache Hudi Issue

[GitHub] [hudi] hudi-bot commented on pull request #4471: [HUDI-3125] spark-sql write timestamp directly

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4471: URL: https://github.com/apache/hudi/pull/4471#issuecomment-1002683042 ## CI report: * 190b5c5b04dabbf3aa28a0a43634a7f358104ce1 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4471: [HUDI-3125] spark-sql write timestamp directly

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4471: URL: https://github.com/apache/hudi/pull/4471#issuecomment-1002660297 ## CI report: * 190b5c5b04dabbf3aa28a0a43634a7f358104ce1 Azure:

[jira] [Resolved] (HUDI-3092) Implement partition filtering based on predicates in split manager/source

2021-12-29 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo resolved HUDI-3092. - > Implement partition filtering based on predicates in split manager/source >

[jira] [Closed] (HUDI-2807) Failing to acquire lock with async clustering if clustering gets delayed due to lack of resources

2021-12-29 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan closed HUDI-2807. - Resolution: Cannot Reproduce > Failing to acquire lock with async clustering if

[GitHub] [hudi] nsivabalan commented on issue #4027: [SUPPORT] Structured streaming Async clustering IndexOutOfBoundsException

2021-12-29 Thread GitBox
nsivabalan commented on issue #4027: URL: https://github.com/apache/hudi/issues/4027#issuecomment-1002678785 Hey, I tried to reproduce locally and could not. https://gist.github.com/nsivabalan/7d6ea90ebfa76f9a53abedfa562562b7 can you confirm few things: 1. is your table MOR?

[GitHub] [hudi] YannByron commented on issue #4429: [SUPPORT] Spark SQL CTAS command doesn't work with 0.10.0 version and Spark 3.1.1

2021-12-29 Thread GitBox
YannByron commented on issue #4429: URL: https://github.com/apache/hudi/issues/4429#issuecomment-1002676758 @vingov I can't reproduce this with Hudi 0.10.0 and Spark 3.1.2.

[GitHub] [hudi] hudi-bot commented on pull request #4471: [HUDI-3125] spark-sql write timestamp directly

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4471: URL: https://github.com/apache/hudi/pull/4471#issuecomment-1002660297 ## CI report: * 190b5c5b04dabbf3aa28a0a43634a7f358104ce1 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002631794 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4471: [HUDI-3125] spark-sql write timestamp directly

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4471: URL: https://github.com/apache/hudi/pull/4471#issuecomment-1002658858 ## CI report: * 190b5c5b04dabbf3aa28a0a43634a7f358104ce1 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002660277 ## CI report: * f718cea8b001eaeb132d97b2ae0e49f1bc101c0f Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4471: [HUDI-3125] spark-sql write timestamp directly

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4471: URL: https://github.com/apache/hudi/pull/4471#issuecomment-1002658858 ## CI report: * 190b5c5b04dabbf3aa28a0a43634a7f358104ce1 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-3125) Spark SQL writing timestamp type don't need to disable `spark.sql.datetime.java8API.enabled` manually

2021-12-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3125: - Labels: pull-request-available (was: ) > Spark SQL writing timestamp type don't need to disable

[GitHub] [hudi] YannByron opened a new pull request #4471: [HUDI-3125] spark-sql write timestamp directly

2021-12-29 Thread GitBox
YannByron opened a new pull request #4471: URL: https://github.com/apache/hudi/pull/4471 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[jira] [Updated] (HUDI-3125) Spark SQL writing timestamp type don't need to disable `spark.sql.datetime.java8API.enabled` manually

2021-12-29 Thread Yann Byron (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yann Byron updated HUDI-3125: - Summary: Spark SQL writing timestamp type don't need to disable `spark.sql.datetime.java8API.enabled`

[jira] [Created] (HUDI-3125) Spark SQL write timestamp type without disable `spark.sql.datetime.java8API.enabled` manually

2021-12-29 Thread Yann Byron (Jira)
Yann Byron created HUDI-3125: Summary: Spark SQL write timestamp type without disable `spark.sql.datetime.java8API.enabled` manually Key: HUDI-3125 URL: https://issues.apache.org/jira/browse/HUDI-3125

[GitHub] [hudi] hudi-bot removed a comment on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-1002609920 ## CI report: * ee973637958fb9c1496cfb45f78346e2f01ffa02 UNKNOWN * c435898754ec3ce579eb2b67c473443c9ee70e46 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-12-29 Thread GitBox
hudi-bot commented on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-1002638207 ## CI report: * ee973637958fb9c1496cfb45f78346e2f01ffa02 UNKNOWN * c435898754ec3ce579eb2b67c473443c9ee70e46 UNKNOWN * 2de552412d4967d5af69c1c64a8796eb23d9149f

[GitHub] [hudi] vinothchandar commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-29 Thread GitBox
vinothchandar commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1002634513 you may want to rebase the PR. seems like a lot went in :) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [hudi] vinothchandar commented on pull request #4350: [HUDI-3047] Basic Implementation of Spark Datasource V2

2021-12-29 Thread GitBox
vinothchandar commented on pull request #4350: URL: https://github.com/apache/hudi/pull/4350#issuecomment-1002634398 @leesf so this is just preparing the code and moving things around? no functional changes? Will review this more closely -- This is an automated message from the Apache

[GitHub] [hudi] hudi-bot removed a comment on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002630414 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002631794 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002630414 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002610463 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] danny0405 commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
danny0405 commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002611981 Thanks for the contribution, can we fix the checkstyle issue and add nested rows test case. -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] hudi-bot removed a comment on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002581386 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002610463 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-1002275040 ## CI report: * ee973637958fb9c1496cfb45f78346e2f01ffa02 UNKNOWN * c435898754ec3ce579eb2b67c473443c9ee70e46 UNKNOWN *

[GitHub] [hudi] hudi-bot commented on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-12-29 Thread GitBox
hudi-bot commented on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-1002609920 ## CI report: * ee973637958fb9c1496cfb45f78346e2f01ffa02 UNKNOWN * c435898754ec3ce579eb2b67c473443c9ee70e46 UNKNOWN * 2de552412d4967d5af69c1c64a8796eb23d9149f

[GitHub] [hudi] minihippo commented on pull request #3173: [HUDI-1951] Add bucket hash index, compatible with the hive bucket

2021-12-29 Thread GitBox
minihippo commented on pull request #3173: URL: https://github.com/apache/hudi/pull/3173#issuecomment-1002609673 @hudi-bot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] hudi-bot commented on pull request #4468: [Issue: #2802] Fixing Hive getSchema for RT tables

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4468: URL: https://github.com/apache/hudi/pull/4468#issuecomment-1002606537 ## CI report: * 433fa28702fd7628c7b48c18385ec8c4700ec2b8 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4468: [Issue: #2802] Fixing Hive getSchema for RT tables

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4468: URL: https://github.com/apache/hudi/pull/4468#issuecomment-1002575901 ## CI report: * 433fa28702fd7628c7b48c18385ec8c4700ec2b8 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002580234 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002581386 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002580234 ## CI report: * ad9143cc014c85e6282c880325627505833076e4 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[jira] [Updated] (HUDI-3083) Support component data types for flink bulk_insert

2021-12-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated HUDI-3083: - Labels: pull-request-available (was: ) > Support component data types for flink bulk_insert >

[GitHub] [hudi] lsyldliu commented on pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
lsyldliu commented on pull request #4470: URL: https://github.com/apache/hudi/pull/4470#issuecomment-1002579411 cc @danny0405 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [hudi] lsyldliu opened a new pull request #4470: [HUDI-3083] Support component data types for flink bulk_insert

2021-12-29 Thread GitBox
lsyldliu opened a new pull request #4470: URL: https://github.com/apache/hudi/pull/4470 ## *Tips* - *Thank you very much for contributing to Apache Hudi.* - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.* ## What is the

[GitHub] [hudi] todd5167 opened a new issue #4469: [SUPPORT] bulk_insert cannot write final metadata information

2021-12-29 Thread GitBox
todd5167 opened a new issue #4469: URL: https://github.com/apache/hudi/issues/4469 **Environment Description** Hudi version : 0.10.0 Flink version : 1.13.3 Storage (HDFS/S3/GCS..) : AWS s3 Running on Docker? (yes/no) : yes, k8s

[GitHub] [hudi] hudi-bot removed a comment on pull request #4468: [Issue: #2802] Fixing Hive getSchema for RT tables

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4468: URL: https://github.com/apache/hudi/pull/4468#issuecomment-1002574877 ## CI report: * 433fa28702fd7628c7b48c18385ec8c4700ec2b8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot

[GitHub] [hudi] hudi-bot commented on pull request #4468: [Issue: #2802] Fixing Hive getSchema for RT tables

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4468: URL: https://github.com/apache/hudi/pull/4468#issuecomment-1002575901 ## CI report: * 433fa28702fd7628c7b48c18385ec8c4700ec2b8 Azure:

[GitHub] [hudi] aditiwari01 commented on issue #2802: Hive read issues when different partition have different schemas.

2021-12-29 Thread GitBox
aditiwari01 commented on issue #2802: URL: https://github.com/apache/hudi/issues/2802#issuecomment-1002574909 @nsivabalan I had to setup my env with latest hudi code. I have raised the PR with minor patch for the same. Kindly go through the change. I have tested it on local and on cluster

[GitHub] [hudi] hudi-bot commented on pull request #4468: [Issue: #2802] Fixing Hive getSchema for RT tables

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4468: URL: https://github.com/apache/hudi/pull/4468#issuecomment-1002574877 ## CI report: * 433fa28702fd7628c7b48c18385ec8c4700ec2b8 UNKNOWN Bot commands @hudi-bot supports the following commands: - `@hudi-bot run azure`

[GitHub] [hudi] aditiwari01 opened a new pull request #4468: [Issue: #2802] Fixing Hive getSchema for RT tables

2021-12-29 Thread GitBox
aditiwari01 opened a new pull request #4468: URL: https://github.com/apache/hudi/pull/4468 ## What is the purpose of the pull request *Fixing Hive getSchema for RT tables* Refer issue for more details: https://github.com/apache/hudi/issues/2802 ## Verify this pull

[hudi] branch master updated (a29b27c -> 504747e)

2021-12-29 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from a29b27c [MINOR] HoodieInstantTimeGenerator improve method used (#4462) add 504747e [HUDI-3108] Fix Purge Drop

[GitHub] [hudi] leesf merged pull request #4455: [HUDI-3108] Fix Purge Drop MOR Table Cause error

2021-12-29 Thread GitBox
leesf merged pull request #4455: URL: https://github.com/apache/hudi/pull/4455 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] sunilknataraj commented on issue #4311: Duplicate Records in Merge on Read [SUPPORT]

2021-12-29 Thread GitBox
sunilknataraj commented on issue #4311: URL: https://github.com/apache/hudi/issues/4311#issuecomment-1002566382 Myself and John belong to same company and talking about same issue. As I mentioned in my post, John already provided the hudiOptions in the original issue description and I was

[GitHub] [hudi] hudi-bot removed a comment on pull request #4459: [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job.

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4459: URL: https://github.com/apache/hudi/pull/4459#issuecomment-1002533544 ## CI report: * d9182c1661e37f29622caafd9eaa23de73b26331 UNKNOWN * 270eee7ef88fc59339675b1443b8918e63015fed Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4459: [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job.

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4459: URL: https://github.com/apache/hudi/pull/4459#issuecomment-1002556138 ## CI report: * d9182c1661e37f29622caafd9eaa23de73b26331 UNKNOWN * 0bf10eb8008082305ae4abe87add403207bb6262 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4459: [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job.

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4459: URL: https://github.com/apache/hudi/pull/4459#issuecomment-1002527922 ## CI report: * d9182c1661e37f29622caafd9eaa23de73b26331 UNKNOWN * 270eee7ef88fc59339675b1443b8918e63015fed Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4459: [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job.

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4459: URL: https://github.com/apache/hudi/pull/4459#issuecomment-1002533544 ## CI report: * d9182c1661e37f29622caafd9eaa23de73b26331 UNKNOWN * 270eee7ef88fc59339675b1443b8918e63015fed Azure:

[hudi] branch master updated (9412281 -> a29b27c)

2021-12-29 Thread leesf
This is an automated email from the ASF dual-hosted git repository. leesf pushed a change to branch master in repository https://gitbox.apache.org/repos/asf/hudi.git. from 9412281 [HUDI-2983] Remove Log4j2 transitive dependencies (#4281) add a29b27c [MINOR] HoodieInstantTimeGenerator

[GitHub] [hudi] leesf merged pull request #4462: [MINOR] HoodieInstantTimeGenerator improve method used

2021-12-29 Thread GitBox
leesf merged pull request #4462: URL: https://github.com/apache/hudi/pull/4462 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4459: [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job.

2021-12-29 Thread GitBox
hudi-bot removed a comment on pull request #4459: URL: https://github.com/apache/hudi/pull/4459#issuecomment-1002412658 ## CI report: * d9182c1661e37f29622caafd9eaa23de73b26331 UNKNOWN * 270eee7ef88fc59339675b1443b8918e63015fed Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4459: [HUDI-3116]Add a new HoodieDropPartitionsTool to let users drop table partitions through a standalone job.

2021-12-29 Thread GitBox
hudi-bot commented on pull request #4459: URL: https://github.com/apache/hudi/pull/4459#issuecomment-1002527922 ## CI report: * d9182c1661e37f29622caafd9eaa23de73b26331 UNKNOWN * 270eee7ef88fc59339675b1443b8918e63015fed Azure:

[jira] [Updated] (HUDI-3052) Flaky TestJsonKafkaSource in CI runs

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3052?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3052: - Fix Version/s: 0.11.0 > Flaky TestJsonKafkaSource in CI runs > - > >

[jira] [Updated] (HUDI-3106) Fix HiveSyncTool not sync schema

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3106: - Fix Version/s: 0.11.0 0.10.1 > Fix HiveSyncTool not sync schema >

[jira] [Updated] (HUDI-3106) Fix HiveSyncTool not sync schema

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3106: - Reporter: Raymond Xu (was: Forward Xu) > Fix HiveSyncTool not sync schema >

[jira] [Updated] (HUDI-3099) Purge drop partition for spark sql

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-3099: - Fix Version/s: 0.11.0 > Purge drop partition for spark sql > -- > >

[GitHub] [hudi] xushiyan commented on issue #4461: [SUPPORT]Hudi(0.10.0) write to Aliyun oss using metadata table warning

2021-12-29 Thread GitBox
xushiyan commented on issue #4461: URL: https://github.com/apache/hudi/issues/4461#issuecomment-1002462150 @nikenfls pls let us know if it gets resolved. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[jira] [Updated] (HUDI-2837) The original hoodie.table.name should be maintained in Spark SQL

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2837: - Reviewers: Raymond Xu, Yann Byron (was: Raymond Xu) > The original hoodie.table.name should be

[jira] [Updated] (HUDI-2426) spark sql extensions breaks read.table from metastore

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2426: - Fix Version/s: 0.10.1 > spark sql extensions breaks read.table from metastore >

[jira] [Updated] (HUDI-2611) `create table if not exists` should print message instead of throwing error

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2611: - Fix Version/s: 0.10.1 > `create table if not exists` should print message instead of throwing error >

[jira] [Updated] (HUDI-2661) java.lang.NoSuchMethodError: org.apache.spark.sql.catalyst.catalog.CatalogTable.copy

2021-12-29 Thread Raymond Xu (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Raymond Xu updated HUDI-2661: - Fix Version/s: 0.10.1 > java.lang.NoSuchMethodError: >

<    1   2   3