GDPR requirements

2017-11-07 Thread Machiel Groeneveld
Hi, The upcoming cross EU law GDPR requires companies to remove data collected from consumers as requested. I'm exploring the options concerning our Parquet tables. I don't see any support for mutating parquet files, if it's not there is it possible to add that? I wonder if anyone has any

[jira] [Updated] (PARQUET-1089) A NullPointerException in DictionaryValuesWriter when writing Parquet

2017-11-07 Thread Irina Truong (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Irina Truong updated PARQUET-1089: -- Affects Version/s: 1.8.1 > A NullPointerException in DictionaryValuesWriter when writing

[jira] [Updated] (PARQUET-1156) dev/merge_parquet_pr.py problems

2017-11-07 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1156: --- Description: I have run into several issues while trying to run dev/merge_parquet_pr.py

[jira] [Assigned] (PARQUET-1153) Parquet-thrift doesn't compile with Thrift 0.10.0

2017-11-07 Thread Nandor Kollar (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nandor Kollar reassigned PARQUET-1153: -- Assignee: Nandor Kollar > Parquet-thrift doesn't compile with Thrift 0.10.0 >

Re: Issues using TypedColumnReader::ReadBatchSpaced

2017-11-07 Thread Uwe L. Korn
Hello William, Seems like you got the problem Felipe earlier mentioned. My response to that was: the parquet::ByteArray instances don't own the data, so their internal pointer might get invalid on the next call to ReadBatchSpaced. This should actually make no difference if you that

[jira] [Updated] (PARQUET-1156) dev/merge_parquet_pr.py problems

2017-11-07 Thread Zoltan Ivanfi (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zoltan Ivanfi updated PARQUET-1156: --- Description: I have run into several issues while trying to run dev/merge_parquet_pr.py

[jira] [Created] (PARQUET-1156) dev/merge_parquet_pr.py problems

2017-11-07 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1156: -- Summary: dev/merge_parquet_pr.py problems Key: PARQUET-1156 URL: https://issues.apache.org/jira/browse/PARQUET-1156 Project: Parquet Issue Type: Bug

[jira] [Assigned] (PARQUET-1025) Support new min-max statistics in parquet-mr

2017-11-07 Thread Gabor Szadovszky (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gabor Szadovszky reassigned PARQUET-1025: - Assignee: Gabor Szadovszky > Support new min-max statistics in parquet-mr >