[ 
https://issues.apache.org/jira/browse/ARROW-17614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17603034#comment-17603034
 ] 

Raúl Cumplido commented on ARROW-17614:
---------------------------------------

An update, currently some of our nightlies packaging jobs and verification 
tasks fail since the following commit was merged: 
[https://github.com/apache/arrow/commit/0527197c0f26d5b65678cd63573583165cf645f1]
 which is the PR [~westonpace] suggested.

These are the latest nightly failures related:
 * 
[wheel-macos-big-sur-cp310-universal2|https://github.com/ursacomputing/crossbow/runs/8290965619?check_suite_focus=true]
 * 
[wheel-macos-big-sur-cp39-arm64|https://github.com/ursacomputing/crossbow/runs/8290953406?check_suite_focus=true]
 * 
[wheel-macos-big-sur-cp39-universal2|https://github.com/ursacomputing/crossbow/runs/8290969659?check_suite_focus=true]
 * 
[wheel-macos-high-sierra-cp310-amd64|https://github.com/ursacomputing/crossbow/runs/8290990072?check_suite_focus=true]
 * 
[wheel-macos-high-sierra-cp37-amd64|https://github.com/ursacomputing/crossbow/runs/8290935376?check_suite_focus=true]
 * 
[wheel-macos-high-sierra-cp38-amd64|https://github.com/ursacomputing/crossbow/runs/8290991819?check_suite_focus=true]
 * 
[wheel-macos-high-sierra-cp39-amd64|https://github.com/ursacomputing/crossbow/runs/8290930687?check_suite_focus=true]
 * 
[wheel-manylinux2014-cp38-arm64|https://github.com/ursacomputing/crossbow/runs/8290946013]
 * 
[wheel-manylinux2014-cp39-amd64|https://github.com/ursacomputing/crossbow/runs/8290961284?check_suite_focus=true]
 * 
[wheel-manylinux2014-cp39-arm64|https://github.com/ursacomputing/crossbow/runs/8290937770]
 * 
[verify-rc-source-python-linux-conda-latest-amd64|https://github.com/ursacomputing/crossbow/runs/8292457991?check_suite_focus=true]
 * 
[verify-rc-source-python-linux-ubuntu-18.04-amd64|https://github.com/ursacomputing/crossbow/runs/8292457071?check_suite_focus=true]
 * 
[verify-rc-source-python-linux-ubuntu-20.04-amd64|https://github.com/ursacomputing/crossbow/runs/8292444844?check_suite_focus=true]
 * 
[verify-rc-source-python-linux-ubuntu-22.04-amd64|https://github.com/ursacomputing/crossbow/runs/8292440712?check_suite_focus=true]
 * 
[verify-rc-source-python-macos-amd64|https://github.com/ursacomputing/crossbow/runs/8292445750?check_suite_focus=true]
 * 
[verify-rc-source-python-macos-arm64|https://github.com/ursacomputing/crossbow/runs/8292472391?check_suite_focus=true]
 *  

 

> [CI][Python] test test_write_dataset_max_rows_per_file is producing several 
> nightly build failures
> --------------------------------------------------------------------------------------------------
>
>                 Key: ARROW-17614
>                 URL: https://issues.apache.org/jira/browse/ARROW-17614
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Continuous Integration, Python
>            Reporter: Raúl Cumplido
>            Priority: Critical
>              Labels: Nightly
>             Fix For: 10.0.0
>
>
> The following failure has been seen on multiple nightly builds:
> {code:java}
> _____________________ test_write_dataset_max_rows_per_file 
> _____________________tempdir = 
> PosixPath('/tmp/pytest-of-root/pytest-0/test_write_dataset_max_rows_pe0')    
> @pytest.mark.parquet
>     def test_write_dataset_max_rows_per_file(tempdir):
>         directory = tempdir / 'ds'
>         max_rows_per_file = 10
>         max_rows_per_group = 10
>         num_of_columns = 2
>         num_of_records = 35
>     
>         record_batch = _generate_data_and_columns(num_of_columns,
>                                                   num_of_records)
>     
>         ds.write_dataset(record_batch, directory, format="parquet",
>                          max_rows_per_file=max_rows_per_file,
> >                        
> > max_rows_per_group=max_rows_per_group)usr/local/lib/python3.7/site-packages/pyarrow/tests/test_dataset.py:3921:
> >  
> _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> _ 
> usr/local/lib/python3.7/site-packages/pyarrow/dataset.py:992: in write_dataset
>     min_rows_per_group, max_rows_per_group, create_dir
> pyarrow/_dataset.pyx:2811: in pyarrow._dataset._filesystemdataset_write
>     ???
> _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> _ >   ???
> E   FileNotFoundError: [Errno 2] Failed to open local file 
> '/tmp/pytest-of-root/pytest-0/test_write_dataset_max_rows_pe0/ds/part-1.parquet'.
>  Detail: [errno 2] No such file or directory {code}
> Example of failed builds:
> [verify-rc-source-python-macos-conda-amd64|https://github.com/ursacomputing/crossbow/runs/8176702861?check_suite_focus=true]
> [wheel-manylinux2014-cp37-amd64|https://github.com/ursacomputing/crossbow/runs/8175319639?check_suite_focus=true]
> It seems flaky as there were some nightly jobs executed on a previous day 
> without new commits that were successful.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to