Joris Van den Bossche created ARROW-7638:
Summary: [Python] Segfault when inspecting dataset.Source with
invalid file/partitioning
Key: ARROW-7638
URL: https://issues.apache.org/jira/browse/ARROW-7638
[
https://issues.apache.org/jira/browse/ARROW-7638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7638:
-
Description:
Getting a segfault with:
{code}
In [1]: import pyarrow.dataset as
Joris Van den Bossche created ARROW-7547:
Summary: [C++] [Python] [Dataset] Additional reader options in
ParquetFileFormat
Key: ARROW-7547
URL: https://issues.apache.org/jira/browse/ARROW-7547
Joris Van den Bossche created ARROW-7545:
Summary: [C++] Scanning dataset with dictionary type hangs
Key: ARROW-7545
URL: https://issues.apache.org/jira/browse/ARROW-7545
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-7545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7545:
-
Summary: [C++] [Dataset] Scanning dataset with dictionary type hangs (was:
[
https://issues.apache.org/jira/browse/ARROW-7545?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17012681#comment-17012681
]
Joris Van den Bossche commented on ARROW-7545:
--
So if the table has a single dictionary
[
https://issues.apache.org/jira/browse/ARROW-7413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010713#comment-17010713
]
Joris Van den Bossche commented on ARROW-7413:
--
[~bkietz] I suppose you are not working on
[
https://issues.apache.org/jira/browse/ARROW-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-5757:
-
Fix Version/s: (was: 2.0.0)
1.0.0
> [Python] Stop
[
https://issues.apache.org/jira/browse/ARROW-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014965#comment-17014965
]
Joris Van den Bossche commented on ARROW-5757:
--
Wes:
{quote}We probably need to discuss
[
https://issues.apache.org/jira/browse/ARROW-7561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7561:
-
Fix Version/s: 0.16.0
> [Doc][Python] fix conda environment command
>
[
https://issues.apache.org/jira/browse/ARROW-7555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014961#comment-17014961
]
Joris Van den Bossche commented on ARROW-7555:
--
With conda, on the other hand, Python 2.7 is
[
https://issues.apache.org/jira/browse/ARROW-7555?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014963#comment-17014963
]
Joris Van den Bossche commented on ARROW-7555:
--
But closing this as a duplicate of
[
https://issues.apache.org/jira/browse/ARROW-7555?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche closed ARROW-7555.
Resolution: Duplicate
> [Python] Drop support for python 2.7
>
[
https://issues.apache.org/jira/browse/ARROW-5757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014964#comment-17014964
]
Joris Van den Bossche commented on ARROW-5757:
--
Some discussion happened in ARROW-7555
Joris Van den Bossche created ARROW-7569:
Summary: [Python] Add API to map Arrow types to pandas
ExtensionDtypes for to_pandas conversions
Key: ARROW-7569
URL:
Joris Van den Bossche created ARROW-7497:
Summary: [Python] pandas master failures: pandas.util.testing is
deprecated
Key: ARROW-7497
URL: https://issues.apache.org/jira/browse/ARROW-7497
[
https://issues.apache.org/jira/browse/ARROW-7087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche resolved ARROW-7087.
--
Resolution: Fixed
Issue resolved by pull request 6127
[
https://issues.apache.org/jira/browse/ARROW-7087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche reassigned ARROW-7087:
Assignee: François Blanchard
> [Python] Table Metadata disappear when we
[
https://issues.apache.org/jira/browse/ARROW-7512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7512:
-
Summary: [C++] Dictionary memo missing elements in id_to_dictionary_ map
after
[
https://issues.apache.org/jira/browse/ARROW-8088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8088:
-
Description:
When specifying an explicit schema for the Partitioning, and when
[
https://issues.apache.org/jira/browse/ARROW-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17057780#comment-17057780
]
Joris Van den Bossche commented on ARROW-3391:
--
[~uwe] What kind of wrong results do you
[
https://issues.apache.org/jira/browse/ARROW-8087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8087:
-
Fix Version/s: 0.17.0
> [C++][Dataset] Order of keys with HivePartitioning is
Joris Van den Bossche created ARROW-8087:
Summary: [C++][Dataset] Order of keys with HivePartitioning is
lost in resulting schema
Key: ARROW-8087
URL: https://issues.apache.org/jira/browse/ARROW-8087
Joris Van den Bossche created ARROW-8088:
Summary: [C++][Dataset] Partition columns with specified
dictionary type result in all nulls
Key: ARROW-8088
URL: https://issues.apache.org/jira/browse/ARROW-8088
[
https://issues.apache.org/jira/browse/ARROW-7858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7858:
-
Issue Type: Improvement (was: Test)
> [C++][Python] Support casting an
[
https://issues.apache.org/jira/browse/ARROW-5379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche closed ARROW-5379.
Resolution: Fixed
> [Python] support pandas' nullable Integer type in from_pandas
[
https://issues.apache.org/jira/browse/ARROW-5379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17057723#comment-17057723
]
Joris Van den Bossche commented on ARROW-5379:
--
With the latest releases of pandas and
[
https://issues.apache.org/jira/browse/ARROW-8066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17057712#comment-17057712
]
Joris Van den Bossche commented on ARROW-8066:
--
At least we should normalize to UTC, I think
[
https://issues.apache.org/jira/browse/ARROW-7986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche resolved ARROW-7986.
--
Resolution: Fixed
> [Python] pa.Array.from_pandas cannot convert pandas.Series
[
https://issues.apache.org/jira/browse/ARROW-7986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche closed ARROW-7986.
> [Python] pa.Array.from_pandas cannot convert pandas.Series containing
>
[
https://issues.apache.org/jira/browse/ARROW-7986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17057704#comment-17057704
]
Joris Van den Bossche commented on ARROW-7986:
--
OK, closing the issue here then, since the
Joris Van den Bossche created ARROW-8074:
Summary: [C++][Dataset] Support for file-like objects (buffers) in
FileSystemDataset?
Key: ARROW-8074
URL: https://issues.apache.org/jira/browse/ARROW-8074
[
https://issues.apache.org/jira/browse/ARROW-4633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-4633:
-
Labels: dataset-parquet-read newbie parquet (was: newbie parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-2860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2860:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-2728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2728:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-2659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2659:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-2098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2098:
-
Labels: (was: parquet)
> [Python] Implement "errors as null" option when
[
https://issues.apache.org/jira/browse/ARROW-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-1956:
-
Labels: dataset-parquet-read parquet (was: parquet)
> [Python] Support reading
[
https://issues.apache.org/jira/browse/ARROW-2079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2079:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-2366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2366:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-2444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2444:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-1848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-1848:
-
Labels: dataset-parquet-read filesystem parquet (was: filesystem parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-1682:
-
Labels: dataset-parquet-read filesystem parquet (was: filesystem parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-2077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2077:
-
Labels: dataset-parquet-read parquet (was: parquet)
> [Python] Document on how
[
https://issues.apache.org/jira/browse/ARROW-5825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-5825:
-
Labels: dataset-parquet-read parquet (was: parquet)
> [Python] Exceptions
[
https://issues.apache.org/jira/browse/ARROW-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2801:
-
Labels: dataset dataset-parquet-read parquet pull-request-available (was:
[
https://issues.apache.org/jira/browse/ARROW-5310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-5310:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2882:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
>
[
https://issues.apache.org/jira/browse/ARROW-7385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7385:
-
Labels: dataset parquet parquet-read (was: dataset parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-7385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7385:
-
Labels: dataset parquet (was: parquet)
> [Python] ParquetDataset deadlock with
[
https://issues.apache.org/jira/browse/ARROW-7385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7385:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet
[
https://issues.apache.org/jira/browse/ARROW-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3424:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-3391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3391:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-3705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3705:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-3861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3861:
-
Labels: dataset dataset-parquet-read parquet python (was: dataset parquet
[
https://issues.apache.org/jira/browse/ARROW-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3245:
-
Labels: dataset-parquet-read parquet (was: parquet)
> [Python] Infer index
[
https://issues.apache.org/jira/browse/ARROW-3245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3245:
-
Labels: dataset dataset-parquet-read parquet (was: dataset-parquet-read
[
https://issues.apache.org/jira/browse/ARROW-3244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3244:
-
Labels: dataset dataset-parquet-read parquet (was: dataset parquet)
> [Python]
[
https://issues.apache.org/jira/browse/ARROW-5666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-5666:
-
Labels: dataset-parquet-read parquet (was: parquet)
> [Python] Underscores in
[
https://issues.apache.org/jira/browse/ARROW-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-5572:
-
Labels: dataset-parquet-read parquet (was: parquet)
> [Python] raise error
[
https://issues.apache.org/jira/browse/ARROW-3947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3947:
-
Labels: dataset-parquet-read parquet (was: parquet)
> [Python] query distinct
[
https://issues.apache.org/jira/browse/ARROW-7996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7996:
-
Labels: serialization (was: )
> [Python] Error serializing empty pandas
[
https://issues.apache.org/jira/browse/ARROW-8004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055776#comment-17055776
]
Joris Van den Bossche commented on ARROW-8004:
--
For a more limited use case than general
[
https://issues.apache.org/jira/browse/ARROW-8010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8010:
-
Summary: [Python] Fixed size list not convertible to Numpy Array / pandas
Series
[
https://issues.apache.org/jira/browse/ARROW-7680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055828#comment-17055828
]
Joris Van den Bossche commented on ARROW-7680:
--
Indeed, we are still getting the same error
[
https://issues.apache.org/jira/browse/ARROW-8010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055833#comment-17055833
]
Joris Van den Bossche commented on ARROW-8010:
--
[~balancap] Thanks for the report!
I think
[
https://issues.apache.org/jira/browse/ARROW-7996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055760#comment-17055760
]
Joris Van den Bossche commented on ARROW-7996:
--
The error comes from deserializing the
[
https://issues.apache.org/jira/browse/ARROW-7680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055815#comment-17055815
]
Joris Van den Bossche commented on ARROW-7680:
--
Since ARROW-7677 is not yet resolved, I
[
https://issues.apache.org/jira/browse/ARROW-7677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055814#comment-17055814
]
Joris Van den Bossche edited comment on ARROW-7677 at 3/10/20, 10:56 AM:
[
https://issues.apache.org/jira/browse/ARROW-7677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055814#comment-17055814
]
Joris Van den Bossche commented on ARROW-7677:
--
It came up in a partitioned parquet dataset
[
https://issues.apache.org/jira/browse/ARROW-8010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche closed ARROW-8010.
Resolution: Duplicate
> [Python] Fixed size list not convertible to Numpy Array /
[
https://issues.apache.org/jira/browse/ARROW-2728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-2728:
-
Component/s: C++ - Dataset
> [Python][C++][Dataset] Support partitioned Parquet
[
https://issues.apache.org/jira/browse/ARROW-3154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-3154:
-
Component/s: C++ - Dataset
> [Python][C++] Document how to write _metadata,
[
https://issues.apache.org/jira/browse/ARROW-7997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055766#comment-17055766
]
Joris Van den Bossche commented on ARROW-7997:
--
[~otaviocv] Thanks for the report! That is
[
https://issues.apache.org/jira/browse/ARROW-7997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7997:
-
Component/s: Python
> [Python] Schema equals method with inconsistent docs in
[
https://issues.apache.org/jira/browse/ARROW-7997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7997:
-
Summary: [Python] Schema equals method with inconsistent docs in pyarrow
(was:
[
https://issues.apache.org/jira/browse/ARROW-8052?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055830#comment-17055830
]
Joris Van den Bossche commented on ARROW-8052:
--
I don't think this should be expected to
[
https://issues.apache.org/jira/browse/ARROW-8093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17058256#comment-17058256
]
Joris Van den Bossche commented on ARROW-8093:
--
This is a duplicate of ARROW-7857 (sorry, I
[
https://issues.apache.org/jira/browse/ARROW-8093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche closed ARROW-8093.
Fix Version/s: (was: 0.17.0)
Resolution: Duplicate
> [CI][Crossbow]
[
https://issues.apache.org/jira/browse/ARROW-7857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7857:
-
Fix Version/s: 0.17.0
> [Python] Failing test with pandas master for extension
[
https://issues.apache.org/jira/browse/ARROW-7996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055756#comment-17055756
]
Joris Van den Bossche commented on ARROW-7996:
--
[~jdavidagudelo] Thanks for the report!
A
[
https://issues.apache.org/jira/browse/ARROW-7996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-7996:
-
Summary: [Python] Error serializing empty pandas DataFrame with pyarrow
(was:
[
https://issues.apache.org/jira/browse/ARROW-7956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17055783#comment-17055783
]
Joris Van den Bossche commented on ARROW-7956:
--
[~wesm] I think this was closed by
[
https://issues.apache.org/jira/browse/ARROW-8060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8060:
-
Fix Version/s: 0.17.0
> [Python] Make dataset Expression objects serializable
>
Joris Van den Bossche created ARROW-8059:
Summary: [Python] Make FileSystem objects serializable
Key: ARROW-8059
URL: https://issues.apache.org/jira/browse/ARROW-8059
Project: Apache Arrow
[
https://issues.apache.org/jira/browse/ARROW-8059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-8059:
-
Fix Version/s: 0.17.0
> [Python] Make FileSystem objects serializable
>
Joris Van den Bossche created ARROW-8060:
Summary: [Python] Make dataset Expression objects serializable
Key: ARROW-8060
URL: https://issues.apache.org/jira/browse/ARROW-8060
Project: Apache
Joris Van den Bossche created ARROW-8062:
Summary: [C++][Dataset] Parquet Dataset factory from a
_metadata/_common_metadata file
Key: ARROW-8062
URL: https://issues.apache.org/jira/browse/ARROW-8062
[
https://issues.apache.org/jira/browse/ARROW-8047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17056235#comment-17056235
]
Joris Van den Bossche commented on ARROW-8047:
--
I also created ARROW-8063 for general user
[
https://issues.apache.org/jira/browse/ARROW-8061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17056248#comment-17056248
]
Joris Van den Bossche commented on ARROW-8061:
--
> Note that parallelism of RowGroup is
Joris Van den Bossche created ARROW-8061:
Summary: [C++][Dataset] Ability to specify granularity of
ParquetFileFragment (support row groups)
Key: ARROW-8061
URL:
[
https://issues.apache.org/jira/browse/ARROW-8061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17056201#comment-17056201
]
Joris Van den Bossche commented on ARROW-8061:
--
Example usecase for this: for Dask, wich
Joris Van den Bossche created ARROW-8063:
Summary: [Python] Add user guide documentation for Datasets API
Key: ARROW-8063
URL: https://issues.apache.org/jira/browse/ARROW-8063
Project: Apache
[
https://issues.apache.org/jira/browse/ARROW-7997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17056241#comment-17056241
]
Joris Van den Bossche commented on ARROW-7997:
--
Actually, there is just today work going on
[
https://issues.apache.org/jira/browse/ARROW-8059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17056251#comment-17056251
]
Joris Van den Bossche commented on ARROW-8059:
--
Specifically for dask's usecase, it might
[
https://issues.apache.org/jira/browse/ARROW-8039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17056284#comment-17056284
]
Joris Van den Bossche commented on ARROW-8039:
--
> We might focus this by saying that the
[
https://issues.apache.org/jira/browse/ARROW-8427?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche reassigned ARROW-8427:
Assignee: Ben Kietzman
> [C++][Dataset] Do not ignore file paths with
Joris Van den Bossche created ARROW-8427:
Summary: [C++][Dataset] Do not ignore file paths with
underscore/dot when full path was specified
Key: ARROW-8427
URL:
[
https://issues.apache.org/jira/browse/ARROW-8276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17084332#comment-17084332
]
Joris Van den Bossche commented on ARROW-8276:
--
Fully agreed.
In my mind, all the
[
https://issues.apache.org/jira/browse/ARROW-7385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17084305#comment-17084305
]
Joris Van den Bossche commented on ARROW-7385:
--
No specific update. A PR is certainly
801 - 900 of 1549 matches
Mail list logo