[jira] [Commented] (HUDI-320) Keep docs on master instead of asf-site branch

2019-11-01 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16965070#comment-16965070 ] Ethan Guo commented on HUDI-320: I can take this up.  Found the other ticket for versioning the docs:

[jira] [Comment Edited] (HUDI-320) Keep docs on master instead of asf-site branch

2019-11-01 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16965070#comment-16965070 ] Ethan Guo edited comment on HUDI-320 at 11/1/19 8:18 PM: - I can take this up. 

[jira] [Commented] (HUDI-226) Hudi Website - Provide links to documentation corresponding to older release versions

2019-11-02 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16965478#comment-16965478 ] Ethan Guo commented on HUDI-226: [~yanghua], are you actively working on this ticket?  I can take this up

[jira] [Closed] (HUDI-320) Keep docs on master instead of asf-site branch

2019-11-02 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo closed HUDI-320. -- Resolution: Duplicate > Keep docs on master instead of asf-site branch >

[jira] [Commented] (HUDI-226) Hudi Website - Provide links to documentation corresponding to older release versions

2019-11-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16967004#comment-16967004 ] Ethan Guo commented on HUDI-226: [~yanghua] Sg.  Vinoth was my manager and mentor at Uber before :) > Hudi

[jira] [Created] (HUDI-320) Keep docs on master instead of asf-site branch

2019-10-31 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-320: -- Summary: Keep docs on master instead of asf-site branch Key: HUDI-320 URL: https://issues.apache.org/jira/browse/HUDI-320 Project: Apache Hudi (incubating) Issue Type:

[jira] [Commented] (HUDI-320) Keep docs on master instead of asf-site branch

2019-10-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16964472#comment-16964472 ] Ethan Guo commented on HUDI-320: It happened to me that my colleague followed the latest docs to use the

[jira] [Updated] (HUDI-320) Keep docs on master instead of asf-site branch

2019-10-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-320: --- Description: Given that each version has new features and improvements, some involving configuration and

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-10-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16964456#comment-16964456 ] Ethan Guo commented on HUDI-76: --- After some exploration, here are my initial thoughts on how to implement this

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-10-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16964457#comment-16964457 ] Ethan Guo commented on HUDI-76: --- I'll write a PoC in the next few days and create WIP PR. > CSV Source

[jira] [Comment Edited] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-10-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16964457#comment-16964457 ] Ethan Guo edited comment on HUDI-76 at 10/31/19 11:50 PM: -- I'll write a PoC in the

[jira] [Created] (HUDI-319) Create online javadocs based on the jar

2019-10-31 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-319: -- Summary: Create online javadocs based on the jar Key: HUDI-319 URL: https://issues.apache.org/jira/browse/HUDI-319 Project: Apache Hudi (incubating) Issue Type: New

[jira] [Assigned] (HUDI-319) Create online javadocs based on the jar

2019-10-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-319: -- Assignee: Ethan Guo > Create online javadocs based on the jar > ---

[jira] [Assigned] (HUDI-132) [Good to do] Automate doc update/deploy process

2019-11-13 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-132: -- Assignee: Ethan Guo > [Good to do] Automate doc update/deploy process >

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-11-14 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16974562#comment-16974562 ] Ethan Guo commented on HUDI-76: --- [~vinoth]  CSV to Row conversion would make the change much simpler.  I can

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-11-14 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16974563#comment-16974563 ] Ethan Guo commented on HUDI-76: --- And the performance problem will be resolved once the row-to-avro conversion

[jira] [Closed] (HUDI-276) Translate Documentation -> Configurations page

2019-11-11 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo closed HUDI-276. -- Resolution: Fixed This PR is merged: [https://github.com/apache/incubator-hudi/pull/1006] > Translate

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-11-12 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16973025#comment-16973025 ] Ethan Guo commented on HUDI-76: --- [~vinoth] [~xleesf] > CSV Source support for Hudi Delta Streamer >

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-11-15 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16975480#comment-16975480 ] Ethan Guo commented on HUDI-76: --- [~vinoth] [~vbalaji] [~xleesf]   Sounds good.  I updated RFC-1 based on the

[jira] [Created] (HUDI-336) Improve landing page related content on Hudi website

2019-11-15 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-336: -- Summary: Improve landing page related content on Hudi website Key: HUDI-336 URL: https://issues.apache.org/jira/browse/HUDI-336 Project: Apache Hudi (incubating) Issue

[jira] [Updated] (HUDI-336) Improve landing page related content on Hudi website

2019-11-15 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-336: --- Description: The landing page of [hudi.apache.org|https://hudi.apache.org/] shows detailed information about

[jira] [Commented] (HUDI-336) Improve landing page related content on Hudi website

2019-11-15 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-336?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16975516#comment-16975516 ] Ethan Guo commented on HUDI-336: [~xleesf] [~vinoth] Moving previous comments on RFC-10 around this here:  

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-11-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16976901#comment-16976901 ] Ethan Guo commented on HUDI-76: --- [~vinoth]  I'm okay with the existing lastModifiedTimes approach.  Just

[jira] [Created] (HUDI-407) Implement a join-based index

2019-12-13 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-407: -- Summary: Implement a join-based index Key: HUDI-407 URL: https://issues.apache.org/jira/browse/HUDI-407 Project: Apache Hudi (incubating) Issue Type: Improvement

[jira] [Created] (HUDI-411) Quantify the benefit of sizing files using benchmarks

2019-12-13 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-411: -- Summary: Quantify the benefit of sizing files using benchmarks Key: HUDI-411 URL: https://issues.apache.org/jira/browse/HUDI-411 Project: Apache Hudi (incubating) Issue

[jira] [Created] (HUDI-412) Push file ranges to timeline server

2019-12-13 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-412: -- Summary: Push file ranges to timeline server Key: HUDI-412 URL: https://issues.apache.org/jira/browse/HUDI-412 Project: Apache Hudi (incubating) Issue Type: Improvement

[jira] [Assigned] (HUDI-412) Push file ranges to timeline server

2019-12-13 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-412: -- Assignee: Ethan Guo > Push file ranges to timeline server > --- > >

[jira] [Assigned] (HUDI-411) Quantify the benefit of sizing files using benchmarks

2019-12-13 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-411: -- Assignee: Ethan Guo > Quantify the benefit of sizing files using benchmarks >

[jira] [Created] (HUDI-413) Use ColumnIndex in parquet to speed up scans

2019-12-13 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-413: -- Summary: Use ColumnIndex in parquet to speed up scans Key: HUDI-413 URL: https://issues.apache.org/jira/browse/HUDI-413 Project: Apache Hudi (incubating) Issue Type:

[jira] [Created] (HUDI-273) Translate Documentation -> Writing Data page

2019-09-24 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-273: -- Summary: Translate Documentation -> Writing Data page Key: HUDI-273 URL: https://issues.apache.org/jira/browse/HUDI-273 Project: Apache Hudi (incubating) Issue Type:

[jira] [Updated] (HUDI-275) Translate Documentation -> Querying Data page

2019-09-24 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-275: --- Component/s: docs-chinese > Translate Documentation -> Querying Data page >

[jira] [Updated] (HUDI-273) Translate Documentation -> Writing Data page

2019-09-24 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-273: --- Component/s: docs-chinese > Translate Documentation -> Writing Data page >

[jira] [Assigned] (HUDI-275) Translate Documentation -> Querying Data page

2019-09-24 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-275: -- Assignee: Ethan Guo > Translate Documentation -> Querying Data page >

[jira] [Created] (HUDI-275) Translate Documentation -> Querying Data page

2019-09-24 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-275: -- Summary: Translate Documentation -> Querying Data page Key: HUDI-275 URL: https://issues.apache.org/jira/browse/HUDI-275 Project: Apache Hudi (incubating) Issue Type:

[jira] [Created] (HUDI-276) Translate Documentation -> Configurations page

2019-09-24 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-276: -- Summary: Translate Documentation -> Configurations page Key: HUDI-276 URL: https://issues.apache.org/jira/browse/HUDI-276 Project: Apache Hudi (incubating) Issue Type:

[jira] [Updated] (HUDI-273) Translate Documentation -> Writing Data page

2019-09-24 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-273: --- Fix Version/s: 0.6.0 > Translate Documentation -> Writing Data page >

[jira] [Updated] (HUDI-275) Translate Documentation -> Querying Data page

2019-09-24 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-275: --- Fix Version/s: 0.6.0 > Translate Documentation -> Querying Data page >

[jira] [Created] (HUDI-277) Translate Documentation -> Performance page

2019-09-24 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-277: -- Summary: Translate Documentation -> Performance page Key: HUDI-277 URL: https://issues.apache.org/jira/browse/HUDI-277 Project: Apache Hudi (incubating) Issue Type:

[jira] [Commented] (HUDI-276) Translate Documentation -> Configurations page

2019-10-04 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16944975#comment-16944975 ] Ethan Guo commented on HUDI-276: 我在美国,已经有一段时间使用 Hudi 了。希望能完善 Hudi 的中文文档来帮助国内的开发者。 > Translate

[jira] [Commented] (HUDI-649) Code cleanup in hudi-client package

2020-02-28 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17048123#comment-17048123 ] Ethan Guo commented on HUDI-649: [~vinoth] ^ > Code cleanup in hudi-client package >

[jira] [Created] (HUDI-649) Code cleanup in hudi-client package

2020-02-28 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-649: -- Summary: Code cleanup in hudi-client package Key: HUDI-649 URL: https://issues.apache.org/jira/browse/HUDI-649 Project: Apache Hudi (incubating) Issue Type: Improvement

[jira] [Updated] (HUDI-649) Address code style issues in hudi-client package

2020-02-28 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-649: --- Summary: Address code style issues in hudi-client package (was: Code cleanup in hudi-client package) >

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2020-01-08 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17011447#comment-17011447 ] Ethan Guo commented on HUDI-76: --- [~liujinhui]  Yes, that's a good question.  I'm also addressing this issue on

[jira] [Updated] (HUDI-551) Abstract a test case class for DFS Source to make it extensible

2020-01-17 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-551: --- Description: * Create a new class {{AbstractDFSSourceTestBase}} based on  {{DFSSourceTestCase}} in the last

[jira] [Created] (HUDI-551) Abstract a test case class for DFS Source to make it extensible

2020-01-17 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-551: -- Summary: Abstract a test case class for DFS Source to make it extensible Key: HUDI-551 URL: https://issues.apache.org/jira/browse/HUDI-551 Project: Apache Hudi (incubating)

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Description: When using the `FilebasedSchemaProvider` to provide the source schema in Avro, while ingesting

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Attachment: Screen Shot 2020-01-18 at 12.15.09 AM.png > Fix the schema mismatch in Row-to-Avro conversion >

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Attachment: Screen Shot 2020-01-18 at 12.12.58 AM.png > Fix the schema mismatch in Row-to-Avro conversion >

[jira] [Created] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-17 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-552: -- Summary: Fix the schema mismatch in Row-to-Avro conversion Key: HUDI-552 URL: https://issues.apache.org/jira/browse/HUDI-552 Project: Apache Hudi (incubating) Issue

[jira] [Updated] (HUDI-551) Abstract a test case class for DFS Source to make it extensible

2020-01-17 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-551: --- Component/s: Testing > Abstract a test case class for DFS Source to make it extensible >

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Description: !Screen Shot 2020-01-18 at 12.12.58 AM.png|width=543,height=392! !Screen Shot 2020-01-18 at

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Attachment: Screen Shot 2020-01-18 at 12.13.08 AM.png > Fix the schema mismatch in Row-to-Avro conversion >

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Description: When using the `FilebasedSchemaProvider` to provide the source schema in Avro, while ingesting

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Attachment: Screen Shot 2020-01-18 at 12.31.23 AM.png > Fix the schema mismatch in Row-to-Avro conversion >

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Description: When using the `FilebasedSchemaProvider` to provide the source schema in Avro, while ingesting

[jira] [Updated] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-552: --- Description: When using the `FilebasedSchemaProvider` to provide the source schema in Avro, while ingesting

[jira] [Resolved] (HUDI-552) Fix the schema mismatch in Row-to-Avro conversion

2020-01-18 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo resolved HUDI-552. Resolution: Fixed > Fix the schema mismatch in Row-to-Avro conversion >

[jira] [Updated] (HUDI-76) CSV Source support for Hudi Delta Streamer

2020-01-20 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-76: -- Fix Version/s: (was: 0.5.1) 0.6.0 > CSV Source support for Hudi Delta Streamer >

[jira] [Updated] (HUDI-76) CSV Source support for Hudi Delta Streamer

2020-01-20 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-76: -- Fix Version/s: (was: 0.6.0) 0.5.2 > CSV Source support for Hudi Delta Streamer >

[jira] [Updated] (HUDI-76) CSV Source support for Hudi Delta Streamer

2020-01-20 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-76: -- Priority: Major (was: Minor) > CSV Source support for Hudi Delta Streamer >

[jira] [Assigned] (HUDI-132) Automate doc update/deploy process

2020-01-21 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-132: -- Assignee: lamber-ken (was: Ethan Guo) > Automate doc update/deploy process >

[jira] [Closed] (HUDI-319) Generate unified javadoc pages using maven

2020-01-08 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo closed HUDI-319. -- Resolution: Implemented > Generate unified javadoc pages using maven >

[jira] [Commented] (HUDI-472) Make sortBy() inside bulkInsertInternal() configurable for bulk_insert

2019-12-30 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17005978#comment-17005978 ] Ethan Guo commented on HUDI-472: [~hzp2000itb]  just noticed that you assigned the ticket to yourself.  Are

[jira] [Commented] (HUDI-472) Make sortBy() inside bulkInsertInternal() configurable for bulk_insert

2019-12-30 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17005976#comment-17005976 ] Ethan Guo commented on HUDI-472: I've put up a WIP PR regarding this. The original design choice to apply

[jira] [Updated] (HUDI-472) Make sortBy() inside bulkInsertInternal() configurable for bulk_insert

2019-12-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-472: --- Fix Version/s: 0.5.1 > Make sortBy() inside bulkInsertInternal() configurable for bulk_insert >

[jira] [Updated] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-12-31 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-76: -- Fix Version/s: 0.5.1 > CSV Source support for Hudi Delta Streamer > -- >

[jira] [Created] (HUDI-488) Refactor Source classes in hudi-utilities

2019-12-31 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-488: -- Summary: Refactor Source classes in hudi-utilities Key: HUDI-488 URL: https://issues.apache.org/jira/browse/HUDI-488 Project: Apache Hudi (incubating) Issue Type:

[jira] [Commented] (HUDI-76) CSV Source support for Hudi Delta Streamer

2019-12-24 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-76?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17002939#comment-17002939 ] Ethan Guo commented on HUDI-76: --- [~vinoth] I have a PoC ready.  Haven't added unit tests yet.  I can put up a

[jira] [Created] (HUDI-504) Restructuring and auto-generation of docs

2020-01-06 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-504: -- Summary: Restructuring and auto-generation of docs Key: HUDI-504 URL: https://issues.apache.org/jira/browse/HUDI-504 Project: Apache Hudi (incubating) Issue Type:

[jira] [Updated] (HUDI-319) Generate unified javadoc pages using maven

2020-01-06 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-319: --- Summary: Generate unified javadoc pages using maven (was: Create online javadocs based on the jar) >

[jira] [Created] (HUDI-505) Add unified javadoc to the Hudi Website

2020-01-06 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-505: -- Summary: Add unified javadoc to the Hudi Website Key: HUDI-505 URL: https://issues.apache.org/jira/browse/HUDI-505 Project: Apache Hudi (incubating) Issue Type:

[jira] [Created] (HUDI-472) Make sortBy() inside bulkInsertInternal() configurable for bulk_insert

2019-12-27 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-472: -- Summary: Make sortBy() inside bulkInsertInternal() configurable for bulk_insert Key: HUDI-472 URL: https://issues.apache.org/jira/browse/HUDI-472 Project: Apache Hudi

[jira] [Updated] (HUDI-319) Generate unified javadoc pages using maven

2020-01-08 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-319: --- Fix Version/s: 0.5.1 > Generate unified javadoc pages using maven > --

[jira] [Assigned] (HUDI-472) Make sortBy() inside bulkInsertInternal() configurable for bulk_insert

2020-01-08 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-472: -- Assignee: Ethan Guo (was: He ZongPing) > Make sortBy() inside bulkInsertInternal() configurable for

[jira] [Commented] (HUDI-472) Make sortBy() inside bulkInsertInternal() configurable for bulk_insert

2020-01-08 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010487#comment-17010487 ] Ethan Guo commented on HUDI-472: reassigning this to myself for better tracking.. > Make sortBy() inside

[jira] [Commented] (HUDI-739) HoodieIOException: Could not delete in-flight instant

2020-05-16 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17109118#comment-17109118 ] Ethan Guo commented on HUDI-739: [~shivnarayan] Looks like this issue involves Amazon S3 and EMR.  Do we

[jira] [Commented] (HUDI-1138) Re-implement marker files via timeline server

2021-06-14 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17363053#comment-17363053 ] Ethan Guo commented on HUDI-1138: - We may consider blocking the requests for batching so that the timeline

[jira] [Commented] (HUDI-1888) Fix NPE in `RowKeyGenertorHelper#getNestedFieldVal` when row writer is enabled

2021-05-17 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17345904#comment-17345904 ] Ethan Guo commented on HUDI-1888: - https://github.com/apache/hudi/pull/2957 > Fix NPE in

[jira] [Assigned] (HUDI-1888) Fix NPE in `RowKeyGenertorHelper#getNestedFieldVal` when row writer is enabled

2021-05-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-1888: --- Assignee: Ethan Guo > Fix NPE in `RowKeyGenertorHelper#getNestedFieldVal` when row writer is >

[jira] [Created] (HUDI-1888) Fix NPE in `RowKeyGenertorHelper#getNestedFieldVal` when row writer is enabled

2021-05-10 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-1888: --- Summary: Fix NPE in `RowKeyGenertorHelper#getNestedFieldVal` when row writer is enabled Key: HUDI-1888 URL: https://issues.apache.org/jira/browse/HUDI-1888 Project: Apache

[jira] [Updated] (HUDI-1888) Fix NPE in `RowKeyGenertorHelper#getNestedFieldVal` when row writer is enabled

2021-05-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-1888: Description: When row writer is enabled, NullPointerException is thrown when inserting records with

[jira] [Assigned] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2021-05-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-1889: --- Assignee: Ethan Guo > Support partition path in a nested field in HoodieFileIndex >

[jira] [Updated] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2021-05-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-1889: Description: Partition path in a nested field is not supported in HoodieFileIndex.  When using a nested

[jira] [Updated] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2021-05-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-1889: Description: When using a nested field for the partition path, the following exception is thrown:

[jira] [Updated] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2021-05-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-1889: Description: When using a nested field for the partition path, the following exception is thrown:

[jira] [Created] (HUDI-1889) Support partition path in a nested field in HoodieFileIndex

2021-05-10 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-1889: --- Summary: Support partition path in a nested field in HoodieFileIndex Key: HUDI-1889 URL: https://issues.apache.org/jira/browse/HUDI-1889 Project: Apache Hudi Issue

[jira] [Commented] (HUDI-1888) Fix NPE in `RowKeyGenertorHelper#getNestedFieldVal` when row writer is enabled

2021-05-10 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17341715#comment-17341715 ] Ethan Guo commented on HUDI-1888: - I'll send a PR to fix this. > Fix NPE in

[jira] [Commented] (HUDI-1138) Re-implement marker files via timeline server

2021-05-25 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17351505#comment-17351505 ] Ethan Guo commented on HUDI-1138: - Here is my plan for improving the marker file mechanism: * Abstraction

[jira] [Commented] (HUDI-1138) Re-implement marker files via timeline server

2021-06-03 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17356211#comment-17356211 ] Ethan Guo commented on HUDI-1138: - [~nishith29]  echoing Vinoth, we have to rely on timeline server as a

[jira] [Assigned] (HUDI-2271) Follow-up items for timeline-server-based marker files

2021-08-20 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2271: --- Assignee: Ethan Guo > Follow-up items for timeline-server-based marker files >

[jira] [Created] (HUDI-2305) Fix marker-based rollback in 0.9.0

2021-08-12 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-2305: --- Summary: Fix marker-based rollback in 0.9.0 Key: HUDI-2305 URL: https://issues.apache.org/jira/browse/HUDI-2305 Project: Apache Hudi Issue Type: Bug

[jira] [Assigned] (HUDI-2305) Fix marker-based rollback in 0.9.0

2021-08-12 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2305: --- Assignee: Ethan Guo > Fix marker-based rollback in 0.9.0 > -- > >

[jira] [Created] (HUDI-2350) Add insert/delete/update test nodes for Spark datasource/deltastreamer in integration test suite

2021-08-23 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-2350: --- Summary: Add insert/delete/update test nodes for Spark datasource/deltastreamer in integration test suite Key: HUDI-2350 URL: https://issues.apache.org/jira/browse/HUDI-2350

[jira] [Created] (HUDI-2351) Fix `Task not serializable` due to new APIs in FSUtils for marker mechanism

2021-08-24 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-2351: --- Summary: Fix `Task not serializable` due to new APIs in FSUtils for marker mechanism Key: HUDI-2351 URL: https://issues.apache.org/jira/browse/HUDI-2351 Project: Apache Hudi

[jira] [Created] (HUDI-2347) Write a blog for marker mechanisms

2021-08-23 Thread Ethan Guo (Jira)
Ethan Guo created HUDI-2347: --- Summary: Write a blog for marker mechanisms Key: HUDI-2347 URL: https://issues.apache.org/jira/browse/HUDI-2347 Project: Apache Hudi Issue Type: Bug

[jira] [Assigned] (HUDI-2347) Write a blog for marker mechanisms

2021-08-23 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2347: --- Assignee: Ethan Guo > Write a blog for marker mechanisms > -- > >

[jira] [Updated] (HUDI-2388) Add test nodes for Spark SQL in integration test suite

2021-09-01 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-2388: Issue Type: Test (was: Improvement) > Add test nodes for Spark SQL in integration test suite >

[jira] [Assigned] (HUDI-2388) Add test nodes for Spark SQL in integration test suite

2021-09-01 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2388: --- Assignee: Ethan Guo > Add test nodes for Spark SQL in integration test suite >

[jira] [Assigned] (HUDI-2350) Add insert/delete/update test nodes for Spark datasource/deltastreamer in integration test suite

2021-09-01 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo reassigned HUDI-2350: --- Assignee: Ethan Guo > Add insert/delete/update test nodes for Spark datasource/deltastreamer in >

[jira] [Updated] (HUDI-2350) Add insert/delete/update test nodes for Spark datasource/deltastreamer in integration test suite

2021-09-01 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-2350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-2350: Issue Type: Test (was: Bug) > Add insert/delete/update test nodes for Spark datasource/deltastreamer in >

  1   2   3   4   5   6   7   8   9   10   >