[jira] [Commented] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2022-10-27 Thread leesf (Jira)


[ 
https://issues.apache.org/jira/browse/HUDI-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17625407#comment-17625407
 ] 

leesf commented on HUDI-1741:
-

[~nicholasjiang] agree with the solution

> Row Level TTL Support for records stored in Hudi
> 
>
> Key: HUDI-1741
> URL: https://issues.apache.org/jira/browse/HUDI-1741
> Project: Apache Hudi
>  Issue Type: New Feature
>  Components: Utilities
>Reporter: Balaji Varadarajan
>Priority: Major
>
> For e:g : Have records only updated last month 
>  
> GH: https://github.com/apache/hudi/issues/2743



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2022-10-27 Thread Nicholas Jiang (Jira)


[ 
https://issues.apache.org/jira/browse/HUDI-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17625400#comment-17625400
 ] 

Nicholas Jiang commented on HUDI-1741:
--

[~shivnarayan], IMO, each record of hudi has the commit time of hudi. The 
solution is to first follow the TTL, do not display expired data when checking, 
or even push down to the data source directly, and then delete it when doing 
operations such as clustering that need to rewrite the data. WDYT?

> Row Level TTL Support for records stored in Hudi
> 
>
> Key: HUDI-1741
> URL: https://issues.apache.org/jira/browse/HUDI-1741
> Project: Apache Hudi
>  Issue Type: New Feature
>  Components: Utilities
>Reporter: Balaji Varadarajan
>Priority: Major
>
> For e:g : Have records only updated last month 
>  
> GH: https://github.com/apache/hudi/issues/2743



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2021-04-05 Thread Aditya Tiwari (Jira)


[ 
https://issues.apache.org/jira/browse/HUDI-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17314925#comment-17314925
 ] 

Aditya Tiwari commented on HUDI-1741:
-

[~pratyakshsharma] I guess with time based cleaning policy, we might need some 
modifications in compactor as well. 

For a recently updated base file also some of its records might be older.


Time based cleaner and filtering out records with older commit time while 
compacting(in MOR) or rewriting(in COW) base file should solve the issue.

> Row Level TTL Support for records stored in Hudi
> 
>
> Key: HUDI-1741
> URL: https://issues.apache.org/jira/browse/HUDI-1741
> Project: Apache Hudi
>  Issue Type: New Feature
>  Components: Utilities
>Reporter: Balaji Varadarajan
>Priority: Major
>
> For e:g : Have records only updated last month 
>  
> GH: https://github.com/apache/hudi/issues/2743



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2021-04-03 Thread Pratyaksh Sharma (Jira)


[ 
https://issues.apache.org/jira/browse/HUDI-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17314297#comment-17314297
 ] 

Pratyaksh Sharma commented on HUDI-1741:


Guess the same can be handled with this Jira - 
https://issues.apache.org/jira/browse/HUDI-349? [~vbalaji] [~shivnarayan]

> Row Level TTL Support for records stored in Hudi
> 
>
> Key: HUDI-1741
> URL: https://issues.apache.org/jira/browse/HUDI-1741
> Project: Apache Hudi
>  Issue Type: New Feature
>  Components: Utilities
>Reporter: Balaji Varadarajan
>Priority: Major
>
> For e:g : Have records only updated last month 
>  
> GH: https://github.com/apache/hudi/issues/2743



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (HUDI-1741) Row Level TTL Support for records stored in Hudi

2021-03-30 Thread Balaji Varadarajan (Jira)


[ 
https://issues.apache.org/jira/browse/HUDI-1741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17311938#comment-17311938
 ] 

Balaji Varadarajan commented on HUDI-1741:
--

[~shivnarayan] : FYI

> Row Level TTL Support for records stored in Hudi
> 
>
> Key: HUDI-1741
> URL: https://issues.apache.org/jira/browse/HUDI-1741
> Project: Apache Hudi
>  Issue Type: New Feature
>  Components: Utilities
>Reporter: Balaji Varadarajan
>Priority: Major
>
> For e:g : Have records only updated last month 
>  
> GH: https://github.com/apache/hudi/issues/2743



--
This message was sent by Atlassian Jira
(v8.3.4#803005)