[jira] [Created] (ARROW-12635) [RUST] U64::MAX does not roundtrip through parquet

2021-05-03 Thread Marco Neumann (Jira)
Marco Neumann created ARROW-12635: - Summary: [RUST] U64::MAX does not roundtrip through parquet Key: ARROW-12635 URL: https://issues.apache.org/jira/browse/ARROW-12635 Project: Apache Arrow

[jira] [Commented] (ARROW-7712) [CI][Crossbow] Fix or delete fuzzit jobs

2020-01-29 Thread Marco Neumann (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17025739#comment-17025739 ] Marco Neumann commented on ARROW-7712: -- [~apitrou] I think we should focus on a single solution. I

[jira] [Created] (ARROW-6872) [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError

2019-10-14 Thread Marco Neumann (Jira)
Marco Neumann created ARROW-6872: Summary: [C++][Python] Empty table with dictionary-columns raises ArrowNotImplementedError Key: ARROW-6872 URL: https://issues.apache.org/jira/browse/ARROW-6872

[jira] [Commented] (ARROW-5525) [C++][CI] Enable continuous fuzzing

2019-09-17 Thread Marco Neumann (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16931568#comment-16931568 ] Marco Neumann commented on ARROW-5525: -- {quote}[~marco.neumann.by] you are admin in the

[jira] [Commented] (ARROW-5525) [C++][CI] Enable continuous fuzzing

2019-09-17 Thread Marco Neumann (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16931519#comment-16931519 ] Marco Neumann commented on ARROW-5525: -- There's [https://fuzzit.dev/] where you can login via

[jira] [Created] (ARROW-6424) [C++][Fuzzing] Fuzzit nightly is broken

2019-09-03 Thread Marco Neumann (Jira)
Marco Neumann created ARROW-6424: Summary: [C++][Fuzzing] Fuzzit nightly is broken Key: ARROW-6424 URL: https://issues.apache.org/jira/browse/ARROW-6424 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-6273) [C++][Fuzzing] Add fuzzer for parquet->arrow read path

2019-08-16 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-6273: Summary: [C++][Fuzzing] Add fuzzer for parquet->arrow read path Key: ARROW-6273 URL: https://issues.apache.org/jira/browse/ARROW-6273 Project: Apache Arrow

[jira] [Created] (ARROW-6270) [C++][Fuzzing] IPC reads do not check buffer indices

2019-08-16 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-6270: Summary: [C++][Fuzzing] IPC reads do not check buffer indices Key: ARROW-6270 URL: https://issues.apache.org/jira/browse/ARROW-6270 Project: Apache Arrow

[jira] [Created] (ARROW-6269) [C++][Fuzzing] IPC reads do not check decimal precision

2019-08-16 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-6269: Summary: [C++][Fuzzing] IPC reads do not check decimal precision Key: ARROW-6269 URL: https://issues.apache.org/jira/browse/ARROW-6269 Project: Apache Arrow

[jira] [Assigned] (ARROW-5959) [C++][CI] Fuzzit does not know about branch + commit hash

2019-07-26 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann reassigned ARROW-5959: Assignee: Marco Neumann (was: Yevgeny Pats) > [C++][CI] Fuzzit does not know about

[jira] [Created] (ARROW-5990) RowGroupMetaData.column misses bounds check

2019-07-19 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5990: Summary: RowGroupMetaData.column misses bounds check Key: ARROW-5990 URL: https://issues.apache.org/jira/browse/ARROW-5990 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-5987) [C++][Fuzzing] arrow-ipc-fuzzing-test crash 3c3f1b74f347ec6c8b0905e7126b9074b9dc5564

2019-07-19 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann resolved ARROW-5987. -- Resolution: Cannot Reproduce > [C++][Fuzzing] arrow-ipc-fuzzing-test crash >

[jira] [Commented] (ARROW-5987) [C++][Fuzzing] arrow-ipc-fuzzing-test crash 3c3f1b74f347ec6c8b0905e7126b9074b9dc5564

2019-07-19 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16888688#comment-16888688 ] Marco Neumann commented on ARROW-5987: -- I swear this was an issue earlier and was now magically

[jira] [Created] (ARROW-5987) [C++][Fuzzing] arrow-ipc-fuzzing-test crash 3c3f1b74f347ec6c8b0905e7126b9074b9dc5564

2019-07-19 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5987: Summary: [C++][Fuzzing] arrow-ipc-fuzzing-test crash 3c3f1b74f347ec6c8b0905e7126b9074b9dc5564 Key: ARROW-5987 URL: https://issues.apache.org/jira/browse/ARROW-5987

[jira] [Created] (ARROW-5959) [C++][CI] Fuzzit does not know about branch + commit hash

2019-07-16 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5959: Summary: [C++][CI] Fuzzit does not know about branch + commit hash Key: ARROW-5959 URL: https://issues.apache.org/jira/browse/ARROW-5959 Project: Apache Arrow

[jira] [Created] (ARROW-5921) [C++][Fuzzing] Missing nullptr checks in IPC

2019-07-12 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5921: Summary: [C++][Fuzzing] Missing nullptr checks in IPC Key: ARROW-5921 URL: https://issues.apache.org/jira/browse/ARROW-5921 Project: Apache Arrow Issue

[jira] [Comment Edited] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-07-10 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16881852#comment-16881852 ] Marco Neumann edited comment on ARROW-5028 at 7/10/19 8:37 AM: --- *You need a

[jira] [Commented] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-07-10 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16881852#comment-16881852 ] Marco Neumann commented on ARROW-5028: -- *You need a massive machine (>10GB RAM) to run this!*

[jira] [Updated] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-07-10 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann updated ARROW-5028: - Attachment: dct.json.gz > [Python][C++] Arrow to Parquet conversion drops and corrupts values >

[jira] [Commented] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-07-08 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16880106#comment-16880106 ] Marco Neumann commented on ARROW-5028: -- [~emkornfi...@gmail.com] sorry for the late reply. I was

[jira] [Created] (ARROW-5607) [C++][Fuzzing] arrow-ipc-fuzzing-test crash 607e9caa76863a97f2694a769a1ae2fb83c55e02

2019-06-14 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5607: Summary: [C++][Fuzzing] arrow-ipc-fuzzing-test crash 607e9caa76863a97f2694a769a1ae2fb83c55e02 Key: ARROW-5607 URL: https://issues.apache.org/jira/browse/ARROW-5607

[jira] [Assigned] (ARROW-5605) [C++][Fuzzing] arrow-ipc-fuzzing-test crash 74aec871d14bb6b07c72ea8f0e8c9f72cbe6b73c

2019-06-14 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann reassigned ARROW-5605: Assignee: Marco Neumann > [C++][Fuzzing] arrow-ipc-fuzzing-test crash >

[jira] [Created] (ARROW-5605) [C++][Fuzzing] arrow-ipc-fuzzing-test crash 74aec871d14bb6b07c72ea8f0e8c9f72cbe6b73c

2019-06-14 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5605: Summary: [C++][Fuzzing] arrow-ipc-fuzzing-test crash 74aec871d14bb6b07c72ea8f0e8c9f72cbe6b73c Key: ARROW-5605 URL: https://issues.apache.org/jira/browse/ARROW-5605

[jira] [Created] (ARROW-5593) [C++][Fuzzing] Test fuzzers against arrow-testing corpus

2019-06-13 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5593: Summary: [C++][Fuzzing] Test fuzzers against arrow-testing corpus Key: ARROW-5593 URL: https://issues.apache.org/jira/browse/ARROW-5593 Project: Apache Arrow

[jira] [Updated] (ARROW-5589) arrow-ipc-fuzzing-test crash 2354085db0125113f04f7bd23f54b85cca104713

2019-06-13 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann updated ARROW-5589: - Description: {{arrow-ipc-fuzzing-test}} found the attached attached crash. Reproduce with {code}

[jira] [Updated] (ARROW-5589) arrow-ipc-fuzzing-test crash 2354085db0125113f04f7bd23f54b85cca104713

2019-06-13 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann updated ARROW-5589: - Summary: arrow-ipc-fuzzing-test crash 2354085db0125113f04f7bd23f54b85cca104713 (was:

[jira] [Created] (ARROW-5589) ipc-fuzzing-test crash 2354085db0125113f04f7bd23f54b85cca104713

2019-06-13 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5589: Summary: ipc-fuzzing-test crash 2354085db0125113f04f7bd23f54b85cca104713 Key: ARROW-5589 URL: https://issues.apache.org/jira/browse/ARROW-5589 Project: Apache Arrow

[jira] [Created] (ARROW-5525) Enable continuous fuzzing

2019-06-07 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5525: Summary: Enable continuous fuzzing Key: ARROW-5525 URL: https://issues.apache.org/jira/browse/ARROW-5525 Project: Apache Arrow Issue Type: Test

[jira] [Commented] (ARROW-2256) [C++] Fuzzer builds fail out of the box on Ubuntu 16.04 using LLVM apt repos

2019-06-07 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16858504#comment-16858504 ] Marco Neumann commented on ARROW-2256: -- I can confirm that and have a fix ready to commit. > [C++]

[jira] [Assigned] (ARROW-2256) [C++] Fuzzer builds fail out of the box on Ubuntu 16.04 using LLVM apt repos

2019-06-07 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann reassigned ARROW-2256: Assignee: Marco Neumann > [C++] Fuzzer builds fail out of the box on Ubuntu 16.04 using

[jira] [Commented] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-06-03 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16854341#comment-16854341 ] Marco Neumann commented on ARROW-5028: -- Sadly not, since the debugging is quite complicated and I

[jira] [Created] (ARROW-5166) [Python] Statistics for uint64 columns may overflow

2019-04-12 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5166: Summary: [Python] Statistics for uint64 columns may overflow Key: ARROW-5166 URL: https://issues.apache.org/jira/browse/ARROW-5166 Project: Apache Arrow

[jira] [Commented] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-04-05 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16810953#comment-16810953 ] Marco Neumann commented on ARROW-5028: -- So the original table seems to be broken because the

[jira] [Commented] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-04-05 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16810752#comment-16810752 ] Marco Neumann commented on ARROW-5028: -- More debugging results: * {{def_levels}} and {{rep_levels}}

[jira] [Commented] (ARROW-5028) [Python][C++] Arrow to Parquet conversion drops and corrupts values

2019-03-29 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16804793#comment-16804793 ] Marco Neumann commented on ARROW-5028: -- Short update: The error also occurs when: * Converting the

[jira] [Updated] (ARROW-5028) Arrow->Parquet conversion drops and corrupts values

2019-03-27 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann updated ARROW-5028: - Environment: python 3.6 > Arrow->Parquet conversion drops and corrupts values >

[jira] [Updated] (ARROW-5028) Arrow->Parquet conversion drops and corrupts values

2019-03-27 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann updated ARROW-5028: - Summary: Arrow->Parquet conversion drops and corrupts values (was: Arrow->Parquet store drops

[jira] [Created] (ARROW-5028) Arrow->Parquet store drops and corrupts values

2019-03-27 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-5028: Summary: Arrow->Parquet store drops and corrupts values Key: ARROW-5028 URL: https://issues.apache.org/jira/browse/ARROW-5028 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-2963) [Python] Deadlock during fork-join and use_threads=True

2018-08-02 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16566987#comment-16566987 ] Marco Neumann commented on ARROW-2963: -- The problem is that using threads worked in {{0.9.0}},

[jira] [Created] (ARROW-2963) [Python] Deadlock during fork-join and use_threads=True

2018-08-02 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-2963: Summary: [Python] Deadlock during fork-join and use_threads=True Key: ARROW-2963 URL: https://issues.apache.org/jira/browse/ARROW-2963 Project: Apache Arrow

[jira] [Assigned] (ARROW-2554) pa.array type inference bug when using NS-timestamp

2018-06-08 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann reassigned ARROW-2554: Assignee: Marco Neumann > pa.array type inference bug when using NS-timestamp >

[jira] [Created] (ARROW-2554) pa.array type inference bug when using NS-timestamp

2018-05-08 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-2554: Summary: pa.array type inference bug when using NS-timestamp Key: ARROW-2554 URL: https://issues.apache.org/jira/browse/ARROW-2554 Project: Apache Arrow

[jira] [Assigned] (ARROW-2513) [Python] DictionaryType should give access to index type and dictionary array

2018-04-26 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann reassigned ARROW-2513: Assignee: Marco Neumann > [Python] DictionaryType should give access to index type and

[jira] [Created] (ARROW-2513) [Python] DictionaryType should give access to index type and dictionary array

2018-04-26 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-2513: Summary: [Python] DictionaryType should give access to index type and dictionary array Key: ARROW-2513 URL: https://issues.apache.org/jira/browse/ARROW-2513 Project:

[jira] [Commented] (ARROW-1589) [C++] Fuzzing for certain input formats

2018-01-29 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16343242#comment-16343242 ] Marco Neumann commented on ARROW-1589: -- So the "empty input" is one of them. The fuzzing process is

[jira] [Commented] (ARROW-1589) [C++] Fuzzing for certain input formats

2018-01-08 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16315898#comment-16315898 ] Marco Neumann commented on ARROW-1589: -- I'll open a PR until end of January, sorry for the delay. The

[jira] [Commented] (ARROW-1589) [C++] Fuzzing for certain input formats

2017-09-25 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16179136#comment-16179136 ] Marco Neumann commented on ARROW-1589: -- {quote}Please understand that this software we are discussing

[jira] [Commented] (ARROW-1589) [C++] Fuzzing for certain input formats

2017-09-25 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178675#comment-16178675 ] Marco Neumann commented on ARROW-1589: -- Currently it is not clearly stated that the message stream is

[jira] [Created] (ARROW-1589) Fuzzing for certain input formats

2017-09-21 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-1589: Summary: Fuzzing for certain input formats Key: ARROW-1589 URL: https://issues.apache.org/jira/browse/ARROW-1589 Project: Apache Arrow Issue Type: Test

[jira] [Assigned] (ARROW-1276) Cannot serializer empty DataFrame to parquet

2017-07-26 Thread Marco Neumann (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marco Neumann reassigned ARROW-1276: Assignee: Marco Neumann > Cannot serializer empty DataFrame to parquet >

[jira] [Created] (ARROW-1276) Cannot serializer empty DataFrame to parquet

2017-07-26 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-1276: Summary: Cannot serializer empty DataFrame to parquet Key: ARROW-1276 URL: https://issues.apache.org/jira/browse/ARROW-1276 Project: Apache Arrow Issue

[jira] [Created] (ARROW-1083) Object categoricals are not serialized when only None is present

2017-06-02 Thread Marco Neumann (JIRA)
Marco Neumann created ARROW-1083: Summary: Object categoricals are not serialized when only None is present Key: ARROW-1083 URL: https://issues.apache.org/jira/browse/ARROW-1083 Project: Apache Arrow