[GitHub] [hudi] yihua opened a new pull request #4300: [HUDI-2785] Add Trino setup in Docker Demo

2021-12-13 Thread GitBox
yihua opened a new pull request #4300: URL: https://github.com/apache/hudi/pull/4300 ## What is the purpose of the pull request *(For example: This pull request adds quick-start document.)* ## Brief change log *(for example:)* - *Modify AnnotationLocation

[jira] [Updated] (HUDI-349) Make cleaner retention based on time period to account for higher deviations in ingestion runs

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-349: - Labels: core-flow-ds pull-request-available sev:high (was: pull-request-available) >

[GitHub] [hudi] nochimow opened a new issue #4299: [SUPPORT] Upsert performance severed decreased after 3 years of data loading

2021-12-13 Thread GitBox
nochimow opened a new issue #4299: URL: https://github.com/apache/hudi/issues/4299 Hi there, I'm currently facing some performance gaps in one specific table after we load 3 years of data. Our typical cenario is the following: Ingestion of 57 avro files (stored on S3) with

[jira] [Commented] (HUDI-1071) Upgrade dropwizard metrics to make use of SettableGauge

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458793#comment-17458793 ] sivabalan narayanan commented on HUDI-1071: --- [~rxu] : in master we have 4.1.0. Are we good to

[GitHub] [hudi] hudi-bot commented on pull request #4286: [WIP][HUDI-2955] Upgrade Hadoop to 3.3.1, Hive to 3.1.2, HBase to 2.4.8

2021-12-13 Thread GitBox
hudi-bot commented on pull request #4286: URL: https://github.com/apache/hudi/pull/4286#issuecomment-992996624 ## CI report: * 4e607fb282b9098d297c56a8fa1f367dd8294201 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4286: [WIP][HUDI-2955] Upgrade Hadoop to 3.3.1, Hive to 3.1.2, HBase to 2.4.8

2021-12-13 Thread GitBox
hudi-bot removed a comment on pull request #4286: URL: https://github.com/apache/hudi/pull/4286#issuecomment-992958415 ## CI report: * 4a459976c56d12c1beb46284862113b866bba284 Azure:

[jira] [Updated] (HUDI-1420) HoodieTableMetaClient.getMarkerFolderPath works incorrectly on windows client with hdfs server for wrong file seperator

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1420: -- Labels: core-flow-ds pull-request-available sev:critical (was: pull-request-available)

[jira] [Commented] (HUDI-2955) Upgrade Hadoop to 3.3.x

2021-12-13 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458782#comment-17458782 ] Alexey Kudinkin commented on HUDI-2955: --- Example Hive 3 setup:

[jira] [Updated] (HUDI-1936) Introduce a optional property for conditional upsert

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1936: -- Labels: features pull-request-available sev:high (was: features

[jira] [Commented] (HUDI-2059) When log exists in mor table, clustering is triggered. The query result shows that the update record in log is lost

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458751#comment-17458751 ] sivabalan narayanan commented on HUDI-2059: --- May I know whats the latest on this ticket. Is it

[jira] [Updated] (HUDI-2059) When log exists in mor table, clustering is triggered. The query result shows that the update record in log is lost

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2059: -- Labels: pull-request-available sev:high (was: pull-request-available) > When log

[jira] [Updated] (HUDI-2083) Hudi CLI does not work with S3

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2083: -- Labels: pull-request-available query-eng sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2183) HiveSyncTool java.lang.NoClassDefFoundError: org/json/JSONException

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2183: -- Labels: HiveSyncTool query-eng user-support-issues (was: HiveSyncTool) > HiveSyncTool

[jira] [Updated] (HUDI-2270) Remove corrupted clean action

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2270: -- Labels: core-flow-ds pull-request-available sev:critical (was: pull-request-available)

[jira] [Resolved] (HUDI-2297) Estimate available memory size for spillable map accurately

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2297. --- > Estimate available memory size for spillable map accurately >

[jira] [Updated] (HUDI-2297) Estimate available memory size for spillable map accurately

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2297: -- Fix Version/s: 0.10.0 > Estimate available memory size for spillable map accurately >

[GitHub] [hudi] hudi-bot commented on pull request #4286: [WIP][HUDI-2955] Upgrade Hadoop to 3.3.1, Hive to 3.1.2, HBase to 2.4.8

2021-12-13 Thread GitBox
hudi-bot commented on pull request #4286: URL: https://github.com/apache/hudi/pull/4286#issuecomment-992958415 ## CI report: * 4a459976c56d12c1beb46284862113b866bba284 Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4286: [WIP][HUDI-2955] Upgrade Hadoop to 3.3.1, Hive to 3.1.2, HBase to 2.4.8

2021-12-13 Thread GitBox
hudi-bot removed a comment on pull request #4286: URL: https://github.com/apache/hudi/pull/4286#issuecomment-992956442 ## CI report: * 4a459976c56d12c1beb46284862113b866bba284 Azure:

[jira] [Updated] (HUDI-2413) Sql source in delta streamer does not work

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2413: -- Labels: core-flow-ds pull-request-available sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2364) Run compaction without user schema file provided

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2364: -- Labels: core-flow-ds pull-request-available sev:normal (was: pull-request-available)

[jira] [Updated] (HUDI-2402) Hive Sync supports Kerberos authentication

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2402: -- Labels: pull-request-available query-eng sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2417) Add support allowDuplicateInserts in HoodieJavaClient

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2417: -- Labels: pull-request-available sev:normal (was: pull-request-available) > Add support

[GitHub] [hudi] alexeykudinkin commented on pull request #4020: [WIP][HUDI-2783] Upgrade HBase to 2.x

2021-12-13 Thread GitBox
alexeykudinkin commented on pull request #4020: URL: https://github.com/apache/hudi/pull/4020#issuecomment-992957507 I've been trying to address to fix IT tests after HBase upgrade and kept hitting HBase classes conflicts b/w our HBase deps and Hadoop 2.x deps (there are non-BWC changes).

[jira] [Resolved] (HUDI-2539) Update the config keys of 0.8.0 version in the docs to 0.9.0

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2539. --- > Update the config keys of 0.8.0 version in the docs to 0.9.0 >

[jira] [Updated] (HUDI-2527) Flaky test: TestHoodieClientMultiWriter.testMultiWriterWithAsyncTableServicesWithConflict

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2527: -- Status: Resolved (was: Patch Available) > Flaky test: >

[GitHub] [hudi] hudi-bot removed a comment on pull request #4286: [WIP][HUDI-2955] Upgrade Hadoop to 3.3.1, Hive to 3.1.2, HBase to 2.4.8

2021-12-13 Thread GitBox
hudi-bot removed a comment on pull request #4286: URL: https://github.com/apache/hudi/pull/4286#issuecomment-991552638 ## CI report: * 4a459976c56d12c1beb46284862113b866bba284 Azure:

[GitHub] [hudi] hudi-bot commented on pull request #4286: [WIP][HUDI-2955] Upgrade Hadoop to 3.3.1, Hive to 3.1.2, HBase to 2.4.8

2021-12-13 Thread GitBox
hudi-bot commented on pull request #4286: URL: https://github.com/apache/hudi/pull/4286#issuecomment-992956442 ## CI report: * 4a459976c56d12c1beb46284862113b866bba284 Azure:

[GitHub] [hudi] nsivabalan commented on pull request #3813: [HUDI-2563][hudi-client] Refactor CompactionTriggerStrategy.

2021-12-13 Thread GitBox
nsivabalan commented on pull request #3813: URL: https://github.com/apache/hudi/pull/3813#issuecomment-992956175 Ccn @yihua who worked on lot of client refactoring code -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[jira] [Updated] (HUDI-2665) Overflow of DataOutputStream may lead to corrupted log block

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2665: -- Fix Version/s: 0.10.0 > Overflow of DataOutputStream may lead to corrupted log block >

[jira] [Updated] (HUDI-2658) When disable auto clean, do not check if MIN_COMMITS_TO_KEEP was larger CLEANER_COMMITS_RETAINED

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2658: -- Labels: pull-request-available sev:normal (was: pull-request-available) > When disable

[jira] [Resolved] (HUDI-2665) Overflow of DataOutputStream may lead to corrupted log block

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2665. --- > Overflow of DataOutputStream may lead to corrupted log block >

[jira] [Resolved] (HUDI-2679) TestMergeIntoLogOnlyTable typo

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2679. --- > TestMergeIntoLogOnlyTable typo > -- > >

[jira] [Updated] (HUDI-2683) Parallelize deleting archived hoodie commits

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2683: -- Labels: pull-request-available sev:high (was: pull-request-available) > Parallelize

[jira] [Updated] (HUDI-2675) Not an Avro data file

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2675: -- Labels: core-flow-ds pull-request-available sev:critical (was: pull-request-available)

[jira] [Updated] (HUDI-2711) Fallback to full table scan for IncrementalRelation and HoodieIncrSource when data file is missing.

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2711: -- Labels: pull-request-available query-eng sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2774) Async Clustering via deltstreamer fails with IllegalStateException: Duplicate key [==>20211116123724586__replacecommit__INFLIGHT]

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2774: -- Labels: core-flow-ds pull-request-available sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2775) Add documentation for external configuration support

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2775: -- Fix Version/s: 0.10.0 > Add documentation for external configuration support >

[jira] [Resolved] (HUDI-2775) Add documentation for external configuration support

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2775. --- > Add documentation for external configuration support >

[jira] [Updated] (HUDI-2779) Cache BaseDir if HudiTableNotFound Exception thrown

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2779: -- Fix Version/s: 0.10.0 > Cache BaseDir if HudiTableNotFound Exception thrown >

[jira] [Resolved] (HUDI-2779) Cache BaseDir if HudiTableNotFound Exception thrown

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2779. --- > Cache BaseDir if HudiTableNotFound Exception thrown >

[jira] [Updated] (HUDI-2777) Data import performance deteriorates because multiple Spark jobs are started when data is written to disks.

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2777: -- Labels: pull-request-available query-eng sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2780) Mor reads the log file and skips the complete block as a bad block, resulting in data loss

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2780: -- Labels: core-flow-ds pull-request-available sev:critical (was: pull-request-available)

[jira] [Updated] (HUDI-2833) Clean up unused archive files instead of expanding indefinitely

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2833: -- Labels: core-flow-ds pull-request-available sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2848) When I run hudi-cli.sh using hadoop 3.2.1, there is a error about class conflict

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2848: -- Fix Version/s: 0.10.0 > When I run hudi-cli.sh using hadoop 3.2.1, there is a error

[jira] [Resolved] (HUDI-2848) When I run hudi-cli.sh using hadoop 3.2.1, there is a error about class conflict

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2848. --- > When I run hudi-cli.sh using hadoop 3.2.1, there is a error about class > conflict >

[jira] [Updated] (HUDI-2857) HoodieTableMetaClient.TEMPFOLDER_NAME causes IllegalArgumentException in windows environment

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2857: -- Labels: core-flow-ds easyfix sev:high (was: easyfix) >

[jira] [Updated] (HUDI-2876) hudi should remove the temp file which create by HoodieMergedLogRecordScanner, when we use presto

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2876: -- Labels: pull-request-available query-eng sev:high (was: pull-request-available) >

[jira] [Resolved] (HUDI-2885) Add call for voting for latest release on hudi homepage

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2885. --- > Add call for voting for latest release on hudi homepage >

[jira] [Updated] (HUDI-2884) Allow loading external configs while querying Hudi tables with Spark

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2884: -- Fix Version/s: 0.11.0 > Allow loading external configs while querying Hudi tables with

[jira] [Commented] (HUDI-2884) Allow loading external configs while querying Hudi tables with Spark

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458742#comment-17458742 ] sivabalan narayanan commented on HUDI-2884: --- [~wenningd] : is there any pending items here or

[jira] [Updated] (HUDI-2885) Add call for voting for latest release on hudi homepage

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2885: -- Fix Version/s: 0.10.0 > Add call for voting for latest release on hudi homepage >

[jira] [Assigned] (HUDI-2892) Pending Clustering may stain the ActiveTimeLine and lead to incomplete query results

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2892: - Assignee: Yue Zhang > Pending Clustering may stain the ActiveTimeLine and lead

[jira] [Updated] (HUDI-2892) Pending Clustering may stain the ActiveTimeLine and lead to incomplete query results

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2892: -- Fix Version/s: 0.11.0 > Pending Clustering may stain the ActiveTimeLine and lead to

[jira] [Resolved] (HUDI-2892) Pending Clustering may stain the ActiveTimeLine and lead to incomplete query results

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2892. --- > Pending Clustering may stain the ActiveTimeLine and lead to incomplete query > results

[jira] [Commented] (HUDI-2901) Fixed the bug clustering jobs are not running in parallel

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458740#comment-17458740 ] sivabalan narayanan commented on HUDI-2901: --- Please "resolve" the ticket if the patch has been

[jira] [Updated] (HUDI-2903) get table schema from the last commit with data written

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2903: -- Labels: pull-request-available sev:high (was: pull-request-available) > get table

[jira] [Updated] (HUDI-2915) Fix field not found in record error for spark-sql

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2915: -- Summary: Fix field not found in record error for spark-sql (was: Fix field not found

[jira] [Updated] (HUDI-2909) KeyGenerator is broken in 0.10.0

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2909: -- Labels: core-flow-ds pull-request-available sev:critical (was: pull-request-available)

[jira] [Assigned] (HUDI-2955) Upgrade Hadoop to 3.3.x

2021-12-13 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin reassigned HUDI-2955: - Assignee: Alexey Kudinkin > Upgrade Hadoop to 3.3.x > --- > >

[jira] [Updated] (HUDI-2925) Cleaner may attempt to delete the same file twice when metadata table is enabled

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2925: -- Labels: core-flow-ds pull-request-available sev:high (was: pull-request-available) >

[jira] [Commented] (HUDI-2946) Upgrade maven plugin to make Hudi be compatible with higher Java versions

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458738#comment-17458738 ] sivabalan narayanan commented on HUDI-2946: --- CCn [~alexey.kudinkin] who is looking into jdk 11

[jira] [Updated] (HUDI-2958) Automatically set spark.sql.parquet.writelegacyformat; When using bulkinsert to insert data which contains decimal Type.

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2958: -- Labels: pull-request-available query-eng sev:high (was: pull-request-available) >

[jira] [Commented] (HUDI-2966) Add TaskCompletionListener for HoodieMergeOnReadRDD to close logScanner

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458736#comment-17458736 ] sivabalan narayanan commented on HUDI-2966: --- [~xiaotaotao] : good job on the fix. Did you check

[jira] [Updated] (HUDI-2966) Add TaskCompletionListener for HoodieMergeOnReadRDD to close logScanner

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2966: -- Labels: core-flow-ds pull-request-available sev:high (was: pull-request-available) >

[jira] [Updated] (HUDI-2962) Support JVM based local process lock provider implementation

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2962: -- Labels: pull-request-available sev:high (was: pull-request-available) > Support JVM

[jira] [Assigned] (HUDI-2966) Add TaskCompletionListener for HoodieMergeOnReadRDD to close logScanner

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2966: - Assignee: tao meng > Add TaskCompletionListener for HoodieMergeOnReadRDD to

[jira] [Updated] (HUDI-2978) Change default index type to Simple

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2978: -- Labels: release-notes sev:high (was: release-notes) > Change default index type to

[jira] [Resolved] (HUDI-2975) Update website docs for 0.10.0 release

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2975. --- > Update website docs for 0.10.0 release > -- > >

[jira] [Assigned] (HUDI-2974) Make the prefix for metrics name configurable

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-2974: - Assignee: Rajesh Mahindra > Make the prefix for metrics name configurable >

[jira] [Resolved] (HUDI-2974) Make the prefix for metrics name configurable

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan resolved HUDI-2974. --- > Make the prefix for metrics name configurable >

[jira] [Updated] (HUDI-2994) Add judgement to existed partitionPath in the catch code block for HUDI-2743

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2994: -- Labels: core-flow-ds pull-request-available sev:critical (was: pull-request-available)

[jira] [Updated] (HUDI-2983) Remove all Log4j2 transitive dependencies

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2983: -- Labels: pull-request-available sev:high (was: pull-request-available) > Remove all

[jira] [Updated] (HUDI-2990) Sync to HMS when deleting partitions

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2990: -- Labels: pull-request-available sev:normal (was: pull-request-available sev:high) >

[jira] [Updated] (HUDI-2990) Sync to HMS when deleting partitions

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2990: -- Labels: pull-request-available sev:high (was: pull-request-available) > Sync to HMS

[jira] [Updated] (HUDI-2997) Skip the corrupt meta file for pending rollback action

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-2997: -- Labels: core-flow-ds pull-request-available sev:critical (was: pull-request-available)

[GitHub] [hudi] hudi-bot commented on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-13 Thread GitBox
hudi-bot commented on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-992929848 ## CI report: * 574e3e740029210c50f163fa88995cc41495bb8b Azure:

[GitHub] [hudi] hudi-bot removed a comment on pull request #4293: [HUDI-2981][HUDI-2982] Metadata table - enabling virtual keys and key deduplication by default

2021-12-13 Thread GitBox
hudi-bot removed a comment on pull request #4293: URL: https://github.com/apache/hudi/pull/4293#issuecomment-992883217 ## CI report: * 3c75b5af15f36dcc8ea7b0dbb9408134cca13c82 Azure:

[jira] [Updated] (HUDI-258) Hive Query engine not supporting join queries between RT and RO tables

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-258: - Labels: bug-bash-0.6.0 help-requested query-eng user-support-issues (was: bug-bash-0.6.0

[jira] [Updated] (HUDI-281) HiveSync failure through Spark when useJdbc is set to false

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-281: - Labels: query-eng user-support-issues (was: user-support-issues) > HiveSync failure

[jira] [Commented] (HUDI-281) HiveSync failure through Spark when useJdbc is set to false

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-281?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458722#comment-17458722 ] sivabalan narayanan commented on HUDI-281: -- [~uditme] : with support for "hms" mode, is this Jira

[jira] [Commented] (HUDI-465) Make Hive Sync via Spark painless

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458719#comment-17458719 ] sivabalan narayanan commented on HUDI-465: -- [~309637554] : with "hms" mode support, and default

[jira] [Updated] (HUDI-465) Make Hive Sync via Spark painless

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-465: - Labels: help-wanted query-eng sev:normal starter user-support-issues (was: help-wanted

[jira] [Commented] (HUDI-691) hoodie.*.consume.* should be set whitelist in hive-site.xml

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458718#comment-17458718 ] sivabalan narayanan commented on HUDI-691: -- CCn [~KWeller]  > hoodie.*.consume.* should be set

[jira] [Updated] (HUDI-691) hoodie.*.consume.* should be set whitelist in hive-site.xml

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-691: - Labels: query-eng sev:high user-support-issues (was: query-eng user-support-issues) >

[jira] [Updated] (HUDI-691) hoodie.*.consume.* should be set whitelist in hive-site.xml

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-691: - Labels: query-eng user-support-issues (was: user-support-issues) > hoodie.*.consume.*

[jira] [Commented] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458717#comment-17458717 ] sivabalan narayanan commented on HUDI-735: -- [~nicholasjiang] : Sorry, is it just the error msg is

[jira] [Updated] (HUDI-735) Improve deltastreamer error message when case mismatch of commandline arguments.

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-735: - Labels: core-flow-ds sev:normal user-support-issues (was: sev:normal user-support-issues)

[jira] [Comment Edited] (HUDI-1022) Document examples for Spark structured streaming writing into Hudi

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458708#comment-17458708 ] sivabalan narayanan edited comment on HUDI-1022 at 12/13/21, 8:51 PM: --

[jira] [Updated] (HUDI-1022) Document examples for Spark structured streaming writing into Hudi

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1022: -- Labels: docs sev:normal user-support-issues (was: sev:normal user-support-issues) >

[jira] [Updated] (HUDI-851) Add Documentation on partitioning data with examples and details on how to sync to Hive

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-851: - Labels: query-eng user-support-issues (was: user-support-issues) > Add Documentation on

[jira] [Commented] (HUDI-1022) Document examples for Spark structured streaming writing into Hudi

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458708#comment-17458708 ] sivabalan narayanan commented on HUDI-1022: --- [~codope] : please go ahead. Felix can chime in or

[jira] [Updated] (HUDI-1036) HoodieCombineHiveInputFormat not picking up HoodieRealtimeFileSplit

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1036: -- Labels: query-eng sev:high user-support-issues (was: sev:normal user-support-issues)

[jira] [Assigned] (HUDI-1022) Document examples for Spark structured streaming writing into Hudi

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan reassigned HUDI-1022: - Assignee: Sagar Sumit (was: Felix Kizhakkel Jose) > Document examples for Spark

[jira] [Updated] (HUDI-1210) Update doc to clarify that start timestamp is exclusive for incremental queries

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1210: -- Labels: docs query-eng sev:normal user-support-issues (was: user-support-issues) >

[jira] [Commented] (HUDI-1210) Update doc to clarify that start timestamp is exclusive for incremental queries

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458707#comment-17458707 ] sivabalan narayanan commented on HUDI-1210: --- CCn [~KWeller]  > Update doc to clarify that start

[jira] [Updated] (HUDI-1221) Ensure docker demo page reflects the latest support on all query engines

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1221: -- Labels: documentation sev:high user-support-issues (was: documentation

[jira] [Commented] (HUDI-1221) Ensure docker demo page reflects the latest support on all query engines

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17458705#comment-17458705 ] sivabalan narayanan commented on HUDI-1221: --- [~GaryLi] / [~bhavanisudha] : Can either of you

[jira] [Updated] (HUDI-1221) Ensure docker demo page reflects the latest support on all query engines

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1221: -- Labels: documentation query-eng sev:high user-support-issues (was: documentation

[jira] [Updated] (HUDI-1278) Need a generic payload class which can skip late arriving data based on specific fields

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1278: -- Labels: core-flow-ds sev:normal user-support-issues (was: sev:normal

[jira] [Updated] (HUDI-1549) Programmatic way to fetch earliest commit retained

2021-12-13 Thread sivabalan narayanan (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] sivabalan narayanan updated HUDI-1549: -- Labels: query-eng sev:normal user-support-issues (was: sev:normal user-support-issues)

<    1   2   3   4   5   6   7   >