[
https://issues.apache.org/jira/browse/ARROW-8447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8447:
--
Labels: dataset pull-request-available (was: dataset)
> [C++][Dataset] Ensure
[
https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096862#comment-17096862
]
Wes McKinney commented on ARROW-8657:
-
> As a result all parquet files that were created with
[
https://issues.apache.org/jira/browse/ARROW-7759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ben Kietzman resolved ARROW-7759.
-
Resolution: Fixed
Issue resolved by pull request 7033
[
https://issues.apache.org/jira/browse/ARROW-7759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ben Kietzman reassigned ARROW-7759:
---
Assignee: Ben Kietzman (was: Antoine Pitrou)
> [C++][Dataset] Add CsvFileFormat for CSV
[
https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney updated ARROW-8657:
Summary: [Python][C++][Parquet] Forward compatibility issue from 0.16 to
0.17 when using
[
https://issues.apache.org/jira/browse/ARROW-8656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8656:
--
Labels: pull-request-available (was: )
> [Python] Switch to VS2017 in the windows wheel
Krisztian Szucs created ARROW-8656:
--
Summary: [Python] Switch to VS2017 in the windows wheel builds
Key: ARROW-8656
URL: https://issues.apache.org/jira/browse/ARROW-8656
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney updated ARROW-8657:
Description:
With the recent release of 0.17, the ParquetVersion is used to define the
logical
[
https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096866#comment-17096866
]
Wes McKinney commented on ARROW-8657:
-
For the record, I think we need to introduce a new flag to
[
https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096866#comment-17096866
]
Wes McKinney edited comment on ARROW-8657 at 4/30/20, 6:37 PM:
---
For the
[
https://issues.apache.org/jira/browse/ARROW-8657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney updated ARROW-8657:
Fix Version/s: 0.17.1
> [Python][C++][Parquet] Forward compatibility issue from 0.16 to 0.17 when
Pierre Belzile created ARROW-8657:
-
Summary: Distinguish parquet version 2 logical type vs DataPageV2
Key: ARROW-8657
URL: https://issues.apache.org/jira/browse/ARROW-8657
Project: Apache Arrow
Ben Kietzman created ARROW-8658:
---
Summary: [C++][Dataset] Implement subtree pruning for
FileSystemDataset::GetFragments
Key: ARROW-8658
URL: https://issues.apache.org/jira/browse/ARROW-8658
Project:
[
https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mike Macpherson updated ARROW-8654:
---
Description:
{code:java}
import pandas as pd
import numpy as np
num_rows, num_cols = 1000,
[
https://issues.apache.org/jira/browse/ARROW-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Francois Saint-Jacques reassigned ARROW-8648:
-
Assignee: Mark Hildreth
> [Rust] Optimize Rust CI Build Times
>
[
https://issues.apache.org/jira/browse/ARROW-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Francois Saint-Jacques resolved ARROW-8648.
---
Fix Version/s: 1.0.0
Resolution: Fixed
Issue resolved by pull request
[
https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096858#comment-17096858
]
Wes McKinney commented on ARROW-8654:
-
FWIW, "large" metadata from very wide tables is a problematic
[
https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096859#comment-17096859
]
Wes McKinney commented on ARROW-8654:
-
Also, the perf of reading very wide Parquet files won't be
[
https://issues.apache.org/jira/browse/ARROW-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Francois Saint-Jacques resolved ARROW-8592.
---
Resolution: Fixed
Issue resolved by pull request 7068
[
https://issues.apache.org/jira/browse/ARROW-8659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8659:
--
Labels: pull-request-available (was: )
> [Rust] ListBuilder and FixedSizeListBuilder capacity
[
https://issues.apache.org/jira/browse/ARROW-8447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ben Kietzman resolved ARROW-8447.
-
Resolution: Fixed
Issue resolved by pull request 7075
[
https://issues.apache.org/jira/browse/ARROW-8447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ben Kietzman reassigned ARROW-8447:
---
Assignee: Francois Saint-Jacques
> [C++][Dataset] Ensure Scanner::ToTable preserve ordering
[
https://issues.apache.org/jira/browse/ARROW-8653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096962#comment-17096962
]
Kouhei Sutou commented on ARROW-8653:
-
We'll be able to implement this by checking {{gflags.pc}}.
We
Raphael Taylor-Davies created ARROW-8659:
Summary: ListBuilder and FixedSizeListBuilder capacity
Key: ARROW-8659
URL: https://issues.apache.org/jira/browse/ARROW-8659
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-8659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Raphael Taylor-Davies updated ARROW-8659:
-
Summary: [Rust] ListBuilder and FixedSizeListBuilder capacity (was:
ListBuilder
Wes McKinney created ARROW-8660:
---
Summary: [C++][Gandiva] Reduce dependence on Boost
Key: ARROW-8660
URL: https://issues.apache.org/jira/browse/ARROW-8660
Project: Apache Arrow
Issue Type:
[
https://issues.apache.org/jira/browse/ARROW-8634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Micah Kornfield resolved ARROW-8634.
Resolution: Fixed
Issue resolved by pull request 7066
Wes McKinney created ARROW-8661:
---
Summary: [C++][Gandiva] Reduce number of files and headers
Key: ARROW-8661
URL: https://issues.apache.org/jira/browse/ARROW-8661
Project: Apache Arrow
Issue
[
https://issues.apache.org/jira/browse/ARROW-300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney resolved ARROW-300.
Fix Version/s: 1.0.0
Resolution: Fixed
Issue resolved by pull request 6707
[
https://issues.apache.org/jira/browse/ARROW-8661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney updated ARROW-8661:
Description:
I feel that the Gandiva subpackage is more Java-like in its code organization
than
[
https://issues.apache.org/jira/browse/ARROW-8660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8660:
--
Labels: pull-request-available (was: )
> [C++][Gandiva] Reduce dependence on Boost
>
[
https://issues.apache.org/jira/browse/ARROW-8647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8647:
-
Description:
In the Python ParquetDataset implementation, the partition fields
[
https://issues.apache.org/jira/browse/ARROW-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney closed ARROW-8638.
---
Resolution: Information Provided
Closing since there isn't a bug to fix, further discussion can take
[
https://issues.apache.org/jira/browse/ARROW-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mark Hildreth updated ARROW-8648:
-
Component/s: Rust
> [Rust] Optimize Rust CI Build Times
> ---
>
[
https://issues.apache.org/jira/browse/ARROW-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096500#comment-17096500
]
Vibhatha Lakmal Abeykoon commented on ARROW-8638:
-
I tried the LD_LIBRARY_PATH approach
Joris Van den Bossche created ARROW-8647:
Summary: [C++][Dataset] Optionally encode partition field values
as dictionary type
Key: ARROW-8647
URL: https://issues.apache.org/jira/browse/ARROW-8647
Andy Grove created ARROW-8650:
-
Summary: [Rust] [Website] Add documentation to Arrow website
Key: ARROW-8650
URL: https://issues.apache.org/jira/browse/ARROW-8650
Project: Apache Arrow
Issue
[
https://issues.apache.org/jira/browse/ARROW-8649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andy Grove updated ARROW-8649:
--
Component/s: Website
Java
> [Java] [Website] Java documentation on website is hidden
Krisztian Szucs created ARROW-8653:
--
Summary: [C++] Add support for gflags version detection
Key: ARROW-8653
URL: https://issues.apache.org/jira/browse/ARROW-8653
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-8651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8651:
-
Labels: dataset (was: )
> [Python][Dataset] Support pickling of Dataset objects
Joris Van den Bossche created ARROW-8651:
Summary: [Python][Dataset] Support pickling of Dataset objects
Key: ARROW-8651
URL: https://issues.apache.org/jira/browse/ARROW-8651
Project: Apache
Joris Van den Bossche created ARROW-8655:
Summary: [C++][Dataset][Python][R] Preserve partitioning
information for a discovered Dataset
Key: ARROW-8655
URL:
[
https://issues.apache.org/jira/browse/ARROW-8622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Paddy Horan resolved ARROW-8622.
Fix Version/s: 1.0.0
Resolution: Fixed
Issue resolved by pull request 7059
Andy Grove created ARROW-8649:
-
Summary: [Java] [Website] Java documentation on website is hidden
Key: ARROW-8649
URL: https://issues.apache.org/jira/browse/ARROW-8649
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096567#comment-17096567
]
Wes McKinney commented on ARROW-8642:
-
[~trickarcher] if you have questions it's better to use the
Joris Van den Bossche created ARROW-8652:
Summary: [Python] Test error message when discovering dataset with
invalid files
Key: ARROW-8652
URL: https://issues.apache.org/jira/browse/ARROW-8652
[
https://issues.apache.org/jira/browse/ARROW-8652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8652:
-
Labels: dataset (was: )
> [Python] Test error message when discovering dataset
Mike Macpherson created ARROW-8654:
--
Summary: [Python] pyarrow 0.17.0 fails reading "wide" parquet files
Key: ARROW-8654
URL: https://issues.apache.org/jira/browse/ARROW-8654
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-8638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096493#comment-17096493
]
Uwe Korn commented on ARROW-8638:
-
You either need to extend the environment variable `LD_LIBRARY_PATH`
[
https://issues.apache.org/jira/browse/ARROW-8318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8318:
--
Labels: dataset pull-request-available (was: dataset)
> [C++][Dataset] Dataset should
[
https://issues.apache.org/jira/browse/ARROW-8654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mike Macpherson updated ARROW-8654:
---
Description:
{code:java}
import pandas as pd
num_rows, num_cols = 1000, 45000
df =
[
https://issues.apache.org/jira/browse/ARROW-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wes McKinney updated ARROW-8641:
Fix Version/s: 1.0.0
> [Python] Regression in feather: no longer supports permutation in column
>
[
https://issues.apache.org/jira/browse/ARROW-8641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096566#comment-17096566
]
Wes McKinney commented on ARROW-8641:
-
Too bad this was not tested
> [Python] Regression in feather:
Mark Hildreth created ARROW-8648:
Summary: [Rust] Optimize Rust CI Build Times
Key: ARROW-8648
URL: https://issues.apache.org/jira/browse/ARROW-8648
Project: Apache Arrow
Issue Type:
[
https://issues.apache.org/jira/browse/ARROW-8648?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8648:
--
Labels: pull-request-available (was: )
> [Rust] Optimize Rust CI Build Times
>
[
https://issues.apache.org/jira/browse/ARROW-8647?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8647:
-
Labels: dataset (was: )
> [C++][Dataset] Optionally encode partition field
[
https://issues.apache.org/jira/browse/ARROW-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096569#comment-17096569
]
Anish Biswas commented on ARROW-8642:
-
Okay, I will do that from now on.
> Is there a good way to
[
https://issues.apache.org/jira/browse/ARROW-8639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Krisztian Szucs resolved ARROW-8639.
Fix Version/s: 1.0.0
Resolution: Fixed
Issue resolved by pull request 7067
[
https://issues.apache.org/jira/browse/ARROW-8640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anish Biswas closed ARROW-8640.
---
> pyarrow.UnionArray.from_buffers() expected number of buffers (1) did not
> match the passed number
[
https://issues.apache.org/jira/browse/ARROW-8640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096164#comment-17096164
]
Anish Biswas commented on ARROW-8640:
-
Ah, I see. Yes, that makes more sense. Thanks for the help!
[
https://issues.apache.org/jira/browse/ARROW-8592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8592:
--
Labels: pull-request-available (was: )
> [C++] Docs still list LLVM 7 as compiler used
>
Joris Van den Bossche created ARROW-8641:
Summary: [Python] Regression in feather: no longer supports
permutation in column selection
Key: ARROW-8641
URL: https://issues.apache.org/jira/browse/ARROW-8641
[
https://issues.apache.org/jira/browse/ARROW-8504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Micah Kornfield reassigned ARROW-8504:
--
Assignee: Micah Kornfield
> [C++] Add a method that takes an RLE visitor for a
Joris Van den Bossche created ARROW-8643:
Summary: [Python] Tests with pandas master failing due to freq
assertion
Key: ARROW-8643
URL: https://issues.apache.org/jira/browse/ARROW-8643
Joris Van den Bossche created ARROW-8644:
Summary: [Python] Dask integration tests failing due to change in
not including partition columns
Key: ARROW-8644
URL:
[
https://issues.apache.org/jira/browse/ARROW-8645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8645:
--
Labels: pull-request-available (was: )
> [C++] Missing gflags dependency for plasma
>
Krisztian Szucs created ARROW-8645:
--
Summary: [C++] Missing gflags dependency for plasma
Key: ARROW-8645
URL: https://issues.apache.org/jira/browse/ARROW-8645
Project: Apache Arrow
Issue
Thippana Vamsi Kalyan created ARROW-8646:
Summary: Allow UnionListWriter to write null values
Key: ARROW-8646
URL: https://issues.apache.org/jira/browse/ARROW-8646
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096272#comment-17096272
]
Joris Van den Bossche commented on ARROW-8642:
--
There is a {{from_numpy_dtype}} function for
[
https://issues.apache.org/jira/browse/ARROW-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17096278#comment-17096278
]
Anish Biswas commented on ARROW-8642:
-
Oh okay! That's neat! Thanks!
> Is there a good way to
[
https://issues.apache.org/jira/browse/ARROW-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anish Biswas closed ARROW-8642.
---
Resolution: Fixed
> Is there a good way to convert data types from numpy types to pyarrow
>
Anish Biswas created ARROW-8642:
---
Summary: Is there a good way to convert data types from numpy
types to pyarrow DataType?
Key: ARROW-8642
URL: https://issues.apache.org/jira/browse/ARROW-8642
Project:
[
https://issues.apache.org/jira/browse/ARROW-7955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-7955:
--
Labels: pull-request-available (was: )
> [Java] Support large buffer for file/stream IPC
>
[
https://issues.apache.org/jira/browse/ARROW-8646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-8646:
--
Labels: pull-request-available (was: )
> Allow UnionListWriter to write null values
>
74 matches
Mail list logo