[jira] [Created] (ARROW-17294) [Release] Update remove old artifacts release script

2022-08-03 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-17294:
---

 Summary: [Release] Update remove old artifacts release script
 Key: ARROW-17294
 URL: https://issues.apache.org/jira/browse/ARROW-17294
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 10.0.0


I just executed the remove old artifacts release script which also removed the 
previously created three patch releases for 6.0.2, 7.0.1, 8.0.1. 

That's not desirable since those have just been released so I had to revert to 
an earlier revision.

cc [~kou] [~assignUser] [~raulcd]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17260) [Release] Java jars verification pass despite that nothing has been uploaded

2022-07-29 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-17260:
---

 Summary: [Release] Java jars verification pass despite that 
nothing has been uploaded
 Key: ARROW-17260
 URL: https://issues.apache.org/jira/browse/ARROW-17260
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs


Build do pass, despite that I forgot to upload the java binaries: 
https://github.com/ursacomputing/crossbow/runs/7587084181?check_suite_focus=true
 

cc [~assignUser] [~raulcd]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17238) [Release] Turn off GCS testing during wheel verification

2022-07-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-17238:
---

 Summary: [Release] Turn off GCS testing during wheel verification
 Key: ARROW-17238
 URL: https://issues.apache.org/jira/browse/ARROW-17238
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 9.0.0






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17233) [Crossbow] Outdated artifact patterns for certain linux jobs

2022-07-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-17233:
---

 Summary: [Crossbow] Outdated artifact patterns for certain linux 
jobs
 Key: ARROW-17233
 URL: https://issues.apache.org/jira/browse/ARROW-17233
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs


almalinux-8-arm64 and almalinux-9-arm64:
{code}
  arrow-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow-flight-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
   arrow-flight-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
  arrow-flight-sql-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow-flight-sql-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
   arrow-flight-sql-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-flight-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm 
[MISSING]
arrow[0-9]+-flight-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
   arrow[0-9]+-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow[0-9]+-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-flight-sql-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm 
[MISSING]
arrow[0-9]+-flight-sql-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-flight-sql-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow[0-9]+-flight-sql-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow-glib-devel-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow-glib-doc-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow9-glib-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK]
 arrow9-glib-libs-9.0.0-1.el8.aarch64.rpm [ OK]
arrow9-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK]
  arrow9-libs-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow-python-devel-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow-python-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-python-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm 
[MISSING]
  arrow[0-9]+-python-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
{code}


centos-7-amd64
{code}
  arrow-python-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow[0-9]+-python-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
{code}

centos-8-arm64 and centos-9-arm64:
{code}
 arrow-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow-flight-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
   arrow-flight-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
  arrow-flight-sql-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow-flight-sql-glib-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
   arrow-flight-sql-glib-doc-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-flight-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm 
[MISSING]
arrow[0-9]+-flight-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
   arrow[0-9]+-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow[0-9]+-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-flight-sql-glib-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm 
[MISSING]
arrow[0-9]+-flight-sql-glib-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-flight-sql-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow[0-9]+-flight-sql-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
 arrow-glib-devel-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow-glib-doc-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow9-glib-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK]
 arrow9-glib-libs-9.0.0-1.el8.aarch64.rpm [ OK]
arrow9-libs-debuginfo-9.0.0-1.el8.aarch64.rpm [ OK]
  arrow9-libs-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow-python-devel-9.0.0-1.el8.aarch64.rpm [ OK]
   arrow-python-flight-devel-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
arrow[0-9]+-python-flight-libs-debuginfo-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm 
[MISSING]
  arrow[0-9]+-python-flight-libs-9.0.0-1.[a-z0-9]+.[a-z0-9_]+.rpm [MISSING]
{code}

ubuntu-bionic-amd64 / ubuntu-bionic-arm64:
{code}
libarrow-python-dev_9.0.0-1_[a-z0-9]+.deb [MISSING]
 libarrow-python-flight-dev_9.0.0-1_[a-z0-9]+.deb [MISSING]
 libarrow-python-flight900-dbgsym_9.0.0-1_[a-z0-9]+.d?deb [MISSING]
  libarrow-python-flight900_9.0.0-1_[a-z0-9]+.deb [MISSING]

[jira] [Created] (ARROW-17232) [Release] Missing R binary packages

2022-07-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-17232:
---

 Summary: [Release] Missing R binary packages
 Key: ARROW-17232
 URL: https://issues.apache.org/jira/browse/ARROW-17232
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs


Seems like the binary upload script now expects some R binaries to upload, but 
the {{packaging}} crossbow task group doesn't contain any relevant tasks. 

I assume the {{r-binary-packages}} should be added to the {{packaging}} group. 

cc [~kou][~raulcd][~assignUser]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17227) [C++] Extend hash-join unit tests to cover both empty and length=0 batches

2022-07-27 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-17227:
---

 Summary: [C++] Extend hash-join unit tests to cover both empty and 
length=0 batches
 Key: ARROW-17227
 URL: https://issues.apache.org/jira/browse/ARROW-17227
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: Krisztian Szucs
Assignee: Weston Pace
 Fix For: 9.0.0






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-16767) [Archery] Refactor archery.release submodule to its own subpackage

2022-06-07 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16767:
---

 Summary: [Archery] Refactor archery.release submodule to its own 
subpackage
 Key: ARROW-16767
 URL: https://issues.apache.org/jira/browse/ARROW-16767
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Archery
Reporter: Krisztian Szucs
 Fix For: 9.0.0






--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16654) [Dev][Archery] Support cherry-picking for major releases

2022-05-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16654:
---

 Summary: [Dev][Archery] Support cherry-picking for major releases 
 Key: ARROW-16654
 URL: https://issues.apache.org/jira/browse/ARROW-16654
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Archery, Developer Tools
Reporter: Krisztian Szucs
 Fix For: 9.0.0






--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16589) [CI][Dev] Make tasks.yml easier to maintain

2022-05-16 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16589:
---

 Summary: [CI][Dev] Make tasks.yml easier to maintain
 Key: ARROW-16589
 URL: https://issues.apache.org/jira/browse/ARROW-16589
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Continuous Integration, Developer Tools
Reporter: Krisztian Szucs


I think {{dev/tasks/tasks.yml}} has reached its limits by using jinja2 
templated yml. 

We should think about a better way to define crossbow jobs while:
- keeping it readable
- in a dialect which is natively supported by editors
- while supporting tasks parametrization

Just one idea is to use python files containing python objects, e.g.:

{code}
Task(
  name="wheel-macos-big-sur-cp38-arm64",
  ci="github",
  template="python-wheels/github.osx.arm64.yml",
  params=dict(
arch="arm64",
arrow_simd_level="DEFAULT",
python_version="3.8",
macos_deployment_target="11.0"
  ),
  artifacts=[
"pyarrow-{no_rc_version}-cp38-cp38-macosx_11_0_arm64.whl"
  ]
)
{code}

where {{Task}} would be the crossbow task class (which could be refactored to 
use pydantic or another alternative for less boilerplate). Of course porting to 
the tasks definitions to plain python could make the situation even worse by 
accessing too many scripting utilities. We could try a dynamic config language 
which sits between yaml and python like HCL.

[~kou] what syntax would you be comfortable to work with? Do you have any 
alternatives we could use?

cc [~amol-] [~raulcd] [~assignUser]




--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16332) [Release] Java jars verification pass despite binaries not being uploaded

2022-04-26 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16332:
---

 Summary: [Release] Java jars verification pass despite binaries 
not being uploaded
 Key: ARROW-16332
 URL: https://issues.apache.org/jira/browse/ARROW-16332
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 9.0.0


See results at 
https://github.com/apache/arrow/pull/12991#issuecomment-1109525407



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16315) [Python] Cython api test fails with allocation error on windows

2022-04-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16315:
---

 Summary: [Python] Cython api test fails with allocation error on 
windows
 Key: ARROW-16315
 URL: https://issues.apache.org/jira/browse/ARROW-16315
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 9.0.0


Getting memory pool deallocation errors 
https://github.com/ursacomputing/crossbow/runs/6154173225?check_suite_focus=true#step:6:33401



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16314) [Python][CI] Skip running cython tests in windows verification builds

2022-04-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16314:
---

 Summary: [Python][CI] Skip running cython tests in windows 
verification builds
 Key: ARROW-16314
 URL: https://issues.apache.org/jira/browse/ARROW-16314
 Project: Apache Arrow
  Issue Type: Bug
  Components: Continuous Integration, Python
Reporter: Krisztian Szucs


Getting memory pool errors 
https://github.com/ursacomputing/crossbow/runs/6154173225?check_suite_focus=true#step:6:33401




--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16312) [C++][CI] Install tzdata in the windows verification builds

2022-04-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16312:
---

 Summary: [C++][CI] Install tzdata in the windows verification 
builds
 Key: ARROW-16312
 URL: https://issues.apache.org/jira/browse/ARROW-16312
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++, Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 8.0.0


See build log 
https://github.com/ursacomputing/crossbow/runs/614860?check_suite_focus=true



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16301) [C#][CI] Fix docker configuration for .NET 6

2022-04-24 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16301:
---

 Summary: [C#][CI] Fix docker configuration for .NET 6
 Key: ARROW-16301
 URL: https://issues.apache.org/jira/browse/ARROW-16301
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C#, Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 8.0.0


Forgot to update the docker setup in 
https://github.com/apache/arrow/commit/f275f50792fb80e1615427620fd32681ecf3e07a



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16284) [Python][Packaging] Use delocate-fuse to create universal2 wheels

2022-04-22 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-16284:
---

 Summary: [Python][Packaging] Use delocate-fuse to create 
universal2 wheels
 Key: ARROW-16284
 URL: https://issues.apache.org/jira/browse/ARROW-16284
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Packaging, Python
Reporter: Krisztian Szucs


Previously we used specific universal2 configurations for vcpkg to build the 
dependencies containing symbols for both architectures. This approach proved to 
be fragile to vcpkg changes making it hard to upgrade the vcpkg version. As an 
example https://github.com/apache/arrow/pull/12893 bumps the vcpkg version 
where absl has stopped compiling for two CMAKE_OSX_ARCHITECTURES, it has been 
already fixed in absl's upstream but that hasn't been released yet.

The new approach uses multibuild's delocate to build the wheels for both arm64 
and amd64 separately and fuse them in an upcoming step to a universal2 wheel 
(using {{lipo}} under the hood).



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-15555) [Release] Post release version bumping script tries to push the release tag

2022-02-03 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-1:
---

 Summary: [Release] Post release version bumping script tries to 
push the release tag
 Key: ARROW-1
 URL: https://issues.apache.org/jira/browse/ARROW-1
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 8.0.0


fatal: tag 'apache-arrow-7.0.0' already exists





--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15504) [Python] Ensure to test ORC bindings

2022-01-30 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15504:
---

 Summary: [Python] Ensure to test ORC bindings
 Key: ARROW-15504
 URL: https://issues.apache.org/jira/browse/ARROW-15504
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 8.0.0


See conversation 
https://github.com/apache/arrow/commit/f9f6fdbb7518c09b833cb6b78bc202008d28e865#r64854632



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15499) [Python] Fix import error in pyarrow._orc

2022-01-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15499:
---

 Summary: [Python] Fix import error in pyarrow._orc
 Key: ARROW-15499
 URL: https://issues.apache.org/jira/browse/ARROW-15499
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15486) [Relase][Java] Verify staged maven artifacts

2022-01-27 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15486:
---

 Summary: [Relase][Java] Verify staged maven artifacts
 Key: ARROW-15486
 URL: https://issues.apache.org/jira/browse/ARROW-15486
 Project: Apache Arrow
  Issue Type: Bug
Reporter: Krisztian Szucs


We have two tests right now:
1. Execute {{mvn test}} from the source tarball's java directory testing the 
source 
https://github.com/apache/arrow/blob/master/dev/release/verify-release-candidate.sh#L278
2. Verify the checksums and signatures of the uploaded maven artifacts 
https://github.com/apache/arrow/blob/master/dev/release/verify-release-candidate.sh#L766

But we don't actually *test* the packages. We should add that to the 
verification scripts.

cc [~kou] [~anthonylouis]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15485) [Release][Java] Fix java jars upload script

2022-01-27 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15485:
---

 Summary: [Release][Java] Fix java jars upload script 
 Key: ARROW-15485
 URL: https://issues.apache.org/jira/browse/ARROW-15485
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools, Java
Reporter: Krisztian Szucs
 Fix For: 8.0.0


Locally not existing files get uploaded to maven.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15483) [Release] Exercise source verification builds on a nightly basis

2022-01-27 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15483:
---

 Summary: [Release] Exercise source verification builds on a 
nightly basis
 Key: ARROW-15483
 URL: https://issues.apache.org/jira/browse/ARROW-15483
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 8.0.0


We need to update the verification scripts to support specific git revisions 
without checking the signatures, then we can simply submit the verification 
tasks using crossbow.

cc [~kou]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15456) [Release] Automatize source verification task submission

2022-01-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15456:
---

 Summary: [Release] Automatize source verification task submission
 Key: ARROW-15456
 URL: https://issues.apache.org/jira/browse/ARROW-15456
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Developer Tools
Reporter: Krisztian Szucs


The workflow would look like this:
{code}
git push -u apache release-
git push -u apache release--rc
git push -u apache apache-arrow-

dev/release/02-source.sh  
dev/release/03-source-verify.sh  
{code}

Where {{03-source-verify.sh}} would create a pull request and submit crossbow 
source verification tasks by either:
a. placing a github comment triggering the comment bot 
b. calling crossbow locally then placing a comment to the PR using the same 
{{archery.crossbow.CommentReport}} class

The resulting PR should look like this 
https://github.com/apache/arrow/pull/12262

Opinions @kou?



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15453) [Crossbow] Unable to parse github owner/repository pair

2022-01-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15453:
---

 Summary: [Crossbow] Unable to parse github owner/repository pair
 Key: ARROW-15453
 URL: https://issues.apache.org/jira/browse/ARROW-15453
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 8.0.0


See build log: 
https://github.com/ursacomputing/crossbow/runs/4939685651?check_suite_focus=true#step:12:118

Should support plain http urls, like 
'https://github.com/ursacomputing/crossbow/'



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15450) [Python][Wheel] Flight test receives SIGKILL during in macOS tests

2022-01-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15450:
---

 Summary: [Python][Wheel] Flight test receives SIGKILL during in 
macOS tests
 Key: ARROW-15450
 URL: https://issues.apache.org/jira/browse/ARROW-15450
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build: 
https://github.com/ursacomputing/crossbow/runs/4928437869?check_suite_focus=true#step:4:2967

cc [~davidli]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15449) [Release] Add post-{num}-changelog.sh to update CHANGELOG.md

2022-01-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15449:
---

 Summary: [Release] Add post-{num}-changelog.sh to update 
CHANGELOG.md
 Key: ARROW-15449
 URL: https://issues.apache.org/jira/browse/ARROW-15449
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 8.0.0


See https://github.com/apache/arrow/pull/12235#discussion_r791194366

It's going to prevent issues like 
https://issues.apache.org/jira/browse/ARROW-13460



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15448) [C++] Use apache mirror system to download ORC's source

2022-01-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15448:
---

 Summary: [C++] Use apache mirror system to download ORC's source
 Key: ARROW-15448
 URL: https://issues.apache.org/jira/browse/ARROW-15448
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


By the recent switch to bundled ORC builds in the wheels has surfaced flaky 
download issues from apache dist which should be discouraged to use.





--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15447) [C++] ORC adapter fails to compile due to name conflict

2022-01-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15447:
---

 Summary: [C++] ORC adapter fails to compile due to name conflict
 Key: ARROW-15447
 URL: https://issues.apache.org/jira/browse/ARROW-15447
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build 
https://github.com/ursacomputing/crossbow/runs/4932765676?check_suite_focus=true#step:5:1191




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15442) [Python] GDB test cannot locate libarrow

2022-01-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15442:
---

 Summary: [Python] GDB test cannot locate libarrow
 Key: ARROW-15442
 URL: https://issues.apache.org/jira/browse/ARROW-15442
 Project: Apache Arrow
  Issue Type: Bug
Reporter: Krisztian Szucs


See build 
https://github.com/ursacomputing/crossbow/runs/4930447399?check_suite_focus=true#step:5:16777

cc [~apitrou]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15436) [Release][Python] Disable verification of gdb tests on windows and a flaky test on apple M1

2022-01-24 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15436:
---

 Summary: [Release][Python] Disable verification of gdb tests on 
windows and a flaky test on apple M1
 Key: ARROW-15436
 URL: https://issues.apache.org/jira/browse/ARROW-15436
 Project: Apache Arrow
  Issue Type: Task
  Components: Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 8.0.0


See verification problems occured in https://github.com/apache/arrow/pull/12235



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15420) [Python] Sdist packaging build is failing due to missing GDB script

2022-01-23 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15420:
---

 Summary: [Python] Sdist packaging build is failing due to missing 
GDB script
 Key: ARROW-15420
 URL: https://issues.apache.org/jira/browse/ARROW-15420
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


See nightly build log 
https://github.com/ursacomputing/crossbow/runs/4911185725?check_suite_focus=true



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15417) [Python][Packaging] Windows wheels are crashing due to AWS SDK error

2022-01-23 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15417:
---

 Summary: [Python][Packaging] Windows wheels are crashing due to 
AWS SDK error
 Key: ARROW-15417
 URL: https://issues.apache.org/jira/browse/ARROW-15417
 Project: Apache Arrow
  Issue Type: Bug
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Sadly we have an unexpected crash during the windows wheel
verification which needs to be investigated:
https://github.com/apache/arrow/pull/12224#issuecomment-1018910642



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15416) [Python] Add option to skip gdb tests

2022-01-23 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15416:
---

 Summary: [Python] Add option to skip gdb tests
 Key: ARROW-15416
 URL: https://issues.apache.org/jira/browse/ARROW-15416
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


The newly added gdb feature tests are failing on macos M1 in the wheel 
verification builds due to not universal2 gdb binary: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2022-01-23-0-github-wheel-macos-big-sur-cp39-arm64





--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15404) [Java][Packaging] Use bundled ORC for building java JNI jars

2022-01-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15404:
---

 Summary: [Java][Packaging] Use bundled ORC for building java JNI 
jars
 Key: ARROW-15404
 URL: https://issues.apache.org/jira/browse/ARROW-15404
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java, Packaging
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


Forgot to update the JNI files in https://github.com/apache/arrow/pull/12153



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15403) [Python] Fails to build python wheels due to depending on more recent ORC

2022-01-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15403:
---

 Summary: [Python] Fails to build python wheels due to depending on 
more recent ORC
 Key: ARROW-15403
 URL: https://issues.apache.org/jira/browse/ARROW-15403
 Project: Apache Arrow
  Issue Type: Bug
  Components: Packaging, Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


See build log: 
https://github.com/ursacomputing/crossbow/runs/4894370329?check_suite_focus=true#step:6:1469

That API is available since https://issues.apache.org/jira/browse/ORC-984 but 
vcpkg doesn't ship any of the versions highlighted in the ticket.




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15401) [Python] Gdb tests are failing on windows

2022-01-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15401:
---

 Summary: [Python] Gdb tests are failing on windows
 Key: ARROW-15401
 URL: https://issues.apache.org/jira/browse/ARROW-15401
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build 
https://github.com/ursacomputing/crossbow/runs/4889157090?check_suite_focus=true#step:5:31451

cc [~apitrou]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15400) [Go][CI] Exercise builds on arm machines

2022-01-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15400:
---

 Summary: [Go][CI] Exercise builds on arm machines 
 Key: ARROW-15400
 URL: https://issues.apache.org/jira/browse/ARROW-15400
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Continuous Integration, Go
Reporter: Krisztian Szucs
 Fix For: 8.0.0


Preferably on travis for pull requests and we can create an additional crossbow 
job to also test on apple M1 on a nightly basis. 

cc [~zeroshade]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15399) [Release][JS] Increase minimum NodeJS version to 16

2022-01-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15399:
---

 Summary: [Release][JS] Increase minimum NodeJS version to 16
 Key: ARROW-15399
 URL: https://issues.apache.org/jira/browse/ARROW-15399
 Project: Apache Arrow
  Issue Type: Task
  Components: JavaScript
Reporter: Krisztian Szucs
 Fix For: 7.0.0






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15395) [Release][Ruby] Ruby verification fails on M1

2022-01-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15395:
---

 Summary: [Release][Ruby] Ruby verification fails on M1
 Key: ARROW-15395
 URL: https://issues.apache.org/jira/browse/ARROW-15395
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build log 
https://github.com/ursacomputing/crossbow/runs/4883657307?check_suite_focus=true#step:4:8653

While this is not a blocker I may need to cut another release candidate 
meanwhile.

cc [~kou]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15393) [Release][Crossbow] Fall back to 0 distance when generating scm version

2022-01-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15393:
---

 Summary: [Release][Crossbow] Fall back to 0 distance when 
generating scm version
 Key: ARROW-15393
 URL: https://issues.apache.org/jira/browse/ARROW-15393
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 8.0.0


The generated SCM version number in the verification tasks is `8.0.0devNone` 
which raises an error from setup.py



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15392) [JS] Flaky javascript unittest

2022-01-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15392:
---

 Summary: [JS] Flaky javascript unittest
 Key: ARROW-15392
 URL: https://issues.apache.org/jira/browse/ARROW-15392
 Project: Apache Arrow
  Issue Type: Bug
  Components: JavaScript
Reporter: Krisztian Szucs


See build log: 
https://github.com/ursacomputing/crossbow/runs/4871354453?check_suite_focus=true#step:5:8164

While the error is flaky it occurs pretty often.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15380) [Python][Release] NumPy ABI incompatibility during verification

2022-01-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15380:
---

 Summary: [Python][Release] NumPy ABI incompatibility during 
verification
 Key: ARROW-15380
 URL: https://issues.apache.org/jira/browse/ARROW-15380
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build 
https://github.com/ursacomputing/crossbow/runs/4871349353?check_suite_focus=true#step:5:12115



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15378) [C++][Release] GTest linking error during windows verification

2022-01-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15378:
---

 Summary: [C++][Release] GTest linking error during windows 
verification
 Key: ARROW-15378
 URL: https://issues.apache.org/jira/browse/ARROW-15378
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build 
https://github.com/ursacomputing/crossbow/runs/4871374560?check_suite_focus=true#step:5:1274



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15377) [JS][Release] JavaScript verification fails

2022-01-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15377:
---

 Summary: [JS][Release] JavaScript verification fails
 Key: ARROW-15377
 URL: https://issues.apache.org/jira/browse/ARROW-15377
 Project: Apache Arrow
  Issue Type: Bug
  Components: JavaScript
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build log 
https://github.com/ursacomputing/crossbow/runs/4871354453?check_suite_focus=true#step:5:8164





--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15376) [Go][Release] Go verification fails

2022-01-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15376:
---

 Summary: [Go][Release] Go verification fails
 Key: ARROW-15376
 URL: https://issues.apache.org/jira/browse/ARROW-15376
 Project: Apache Arrow
  Issue Type: Bug
  Components: Go
Reporter: Krisztian Szucs
 Fix For: 7.0.0


See build error 
https://github.com/ursacomputing/crossbow/runs/4871355213?check_suite_focus=true#step:4:2703



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15372) [C++][Gandiva] Gandiva now depends on boost/crc.hpp which is missing from the trimmed boost archive

2022-01-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15372:
---

 Summary: [C++][Gandiva] Gandiva now depends on boost/crc.hpp which 
is missing from the trimmed boost archive
 Key: ARROW-15372
 URL: https://issues.apache.org/jira/browse/ARROW-15372
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++, C++ - Gandiva
Affects Versions: 7.0.0
Reporter: Krisztian Szucs


See build error 
https://github.com/ursacomputing/crossbow/runs/4871392838?check_suite_focus=true#step:5:11762



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15371) [Release] Missing libsqlite-dev from the verification docker images

2022-01-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15371:
---

 Summary: [Release] Missing libsqlite-dev from the verification 
docker images
 Key: ARROW-15371
 URL: https://issues.apache.org/jira/browse/ARROW-15371
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Developer Tools
Reporter: Krisztian Szucs


See build error 
https://github.com/ursacomputing/crossbow/runs/4870407487?check_suite_focus=true#step:5:4852



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15355) [Docs] Trigger sphinx build on documentation changes

2022-01-17 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15355:
---

 Summary: [Docs] Trigger sphinx build on documentation changes
 Key: ARROW-15355
 URL: https://issues.apache.org/jira/browse/ARROW-15355
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation
Reporter: Krisztian Szucs
 Fix For: 7.0.0






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15133) [CI] Removing util_checkout.sh and util_cleanup.sh scripts

2021-12-16 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15133:
---

 Summary: [CI] Removing util_checkout.sh and util_cleanup.sh scripts
 Key: ARROW-15133
 URL: https://issues.apache.org/jira/browse/ARROW-15133
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 7.0.0


- ci/scripts/util_checkout.sh was used to checkout submodules because 
actions/checkout@v2 has removed support for that, but they have restored it 
since.
- ci/scripts/util_cleanup.sh was used to free up disk space on github actions 
runners, because at that time it was limited to 7GB, from a recent run it looks 
like the linux runners now have 32GB free space so we can try to disable the 
cleanup step sparing almost a minute of build time



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-15006) [Python][Doc] Iteratively enable more numpydoc checks

2021-12-07 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-15006:
---

 Summary: [Python][Doc] Iteratively enable more numpydoc checks
 Key: ARROW-15006
 URL: https://issues.apache.org/jira/browse/ARROW-15006
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation, Python
Reporter: Krisztian Szucs


Asof https://github.com/apache/arrow/pull/7732 we're going to have a numpydoc 
check running on pull requests. There is a single rule enabled at the moment: 
PR01

Additional checks we can run:

{code}
ERROR_MSGS = {
"GL01": "Docstring text (summary) should start in the line immediately "
"after the opening quotes (not in the same line, or leaving a "
"blank line in between)",
"GL02": "Closing quotes should be placed in the line after the last text "
"in the docstring (do not close the quotes in the same line as "
"the text, or leave a blank line between the last text and the "
"quotes)",
"GL03": "Double line break found; please use only one blank line to "
"separate sections or paragraphs, and do not leave blank lines "
"at the end of docstrings",
"GL05": 'Tabs found at the start of line "{line_with_tabs}", please use '
"whitespace only",
"GL06": 'Found unknown section "{section}". Allowed sections are: '
"{allowed_sections}",
"GL07": "Sections are in the wrong order. Correct order is: 
{correct_sections}",
"GL08": "The object does not have a docstring",
"GL09": "Deprecation warning should precede extended summary",
"GL10": "reST directives {directives} must be followed by two colons",
"SS01": "No summary found (a short summary in a single line should be "
"present at the beginning of the docstring)",
"SS02": "Summary does not start with a capital letter",
"SS03": "Summary does not end with a period",
"SS04": "Summary contains heading whitespaces",
"SS05": "Summary must start with infinitive verb, not third person "
'(e.g. use "Generate" instead of "Generates")',
"SS06": "Summary should fit in a single line",
"ES01": "No extended summary found",
"PR01": "Parameters {missing_params} not documented",
"PR02": "Unknown parameters {unknown_params}",
"PR03": "Wrong parameters order. Actual: {actual_params}. "
"Documented: {documented_params}",
"PR04": 'Parameter "{param_name}" has no type',
"PR05": 'Parameter "{param_name}" type should not finish with "."',
"PR06": 'Parameter "{param_name}" type should use "{right_type}" instead '
'of "{wrong_type}"',
"PR07": 'Parameter "{param_name}" has no description',
"PR08": 'Parameter "{param_name}" description should start with a '
"capital letter",
"PR09": 'Parameter "{param_name}" description should finish with "."',
"PR10": 'Parameter "{param_name}" requires a space before the colon '
"separating the parameter name and type",
"RT01": "No Returns section found",
"RT02": "The first line of the Returns section should contain only the "
"type, unless multiple values are being returned",
"RT03": "Return value has no description",
"RT04": "Return value description should start with a capital letter",
"RT05": 'Return value description should finish with "."',
"YD01": "No Yields section found",
"SA01": "See Also section not found",
"SA02": "Missing period at end of description for See Also "
'"{reference_name}" reference',
"SA03": "Description should be capitalized for See Also "
'"{reference_name}" reference',
"SA04": 'Missing description for See Also "{reference_name}" reference',
"EX01": "No examples section found",
}
{code}

cc [~alenkaf] [~amol-] [~jorisvandenbossche]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14996) [Python][Gandiva] Deprecate of hide make_projector and make_filter testing utilities

2021-12-06 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14996:
---

 Summary: [Python][Gandiva] Deprecate of hide make_projector and 
make_filter testing utilities
 Key: ARROW-14996
 URL: https://issues.apache.org/jira/browse/ARROW-14996
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs


{{pyarrow.gandiva.{make_filter, make_projector}}} functions are only used from 
gandiva unittests. Additionally unexpected arguments can cause segmentations 
faults. We either should deprecate or hide these functions from the public API.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14995) [Doc][Python] Document missing arguments for pyarrow.flight objects

2021-12-06 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14995:
---

 Summary: [Doc][Python] Document missing arguments for 
pyarrow.flight objects
 Key: ARROW-14995
 URL: https://issues.apache.org/jira/browse/ARROW-14995
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation, Python
Reporter: Krisztian Szucs


To see the list of undocumented arguments:
1. uncomment 
https://github.com/apache/arrow/pull/7732/files#diff-fafe69518755e93c6d34fd8d0b5e722a2dc23c30920223015b8a80faa0b98db8R249
2. execute {{archery numpydoc -a PR01}}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14991) [Packaging][Python] Windows wheel builds are failing due to wrong vcpkg triplet name

2021-12-06 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14991:
---

 Summary: [Packaging][Python] Windows wheel builds are failing due 
to wrong vcpkg triplet name
 Key: ARROW-14991
 URL: https://issues.apache.org/jira/browse/ARROW-14991
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Packaging, Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


See build log 
https://github.com/ursacomputing/crossbow/runs/4426753814?check_suite_focus=true#step:7:192



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14968) [Python] Pin numpy build dependency using oldest-supported-numpy

2021-12-02 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14968:
---

 Summary: [Python] Pin numpy build dependency using 
oldest-supported-numpy
 Key: ARROW-14968
 URL: https://issues.apache.org/jira/browse/ARROW-14968
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14962) [CI] Fix minio installation on s390x

2021-12-01 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14962:
---

 Summary: [CI] Fix minio installation on s390x
 Key: ARROW-14962
 URL: https://issues.apache.org/jira/browse/ARROW-14962
 Project: Apache Arrow
  Issue Type: Bug
  Components: Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 7.0.0






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14932) [CI][Python] Prefer mamba over conda

2021-11-30 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14932:
---

 Summary: [CI][Python] Prefer mamba over conda 
 Key: ARROW-14932
 URL: https://issues.apache.org/jira/browse/ARROW-14932
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Continuous Integration, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Mamba should provide quicker docker image builds compared to conda.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14928) [Python][Packaging] Remove boost-filesystem vcpkg dependency from the wheel dockerfiles

2021-11-30 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14928:
---

 Summary: [Python][Packaging] Remove boost-filesystem vcpkg 
dependency from the wheel dockerfiles
 Key: ARROW-14928
 URL: https://issues.apache.org/jira/browse/ARROW-14928
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


We don't build the C++ tests there so boost-filesystem can be omitted.

See comment https://github.com/apache/arrow/pull/11569#discussion_r759270985



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14879) [Python][Packaging] Remove manylinux2010 wheels

2021-11-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14879:
---

 Summary: [Python][Packaging] Remove manylinux2010 wheels
 Key: ARROW-14879
 URL: https://issues.apache.org/jira/browse/ARROW-14879
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


More recent vcpkg is not compatible with older glibc shipped by manylinux2010 
so we won't be able to regularly update the dependencies. Besides that 
manylinux2010 has reached EOL.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14587) [CI][Crossbow] Fetch a single crossbow branch instead of the full repo on Azure

2021-11-04 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14587:
---

 Summary: [CI][Crossbow] Fetch a single crossbow branch instead of 
the full repo on Azure
 Key: ARROW-14587
 URL: https://issues.apache.org/jira/browse/ARROW-14587
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Since crossbow has a lot of references the checkout step can take a long time, 
see build 
https://dev.azure.com/ursacomputing/crossbow/_build/results?buildId=14952=logs=0da5d1d9-276d-5173-c4c4-9d4d4ed14fdb=5bbb8710-d4c1-5a8b-fc80-a388730cf6ac

We should alter the azure crossbow template to explicitly check out the task's 
branch using 
{{ {{ task.branch }} }} jinja variable.

See azure documentation: 
https://docs.microsoft.com/en-us/azure/devops/pipelines/repos/multi-repo-checkout?view=azure-devops#checking-out-a-specific-ref



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14512) [Java][Doc] JavaDoc errors while building the docs

2021-10-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14512:
---

 Summary: [Java][Doc] JavaDoc errors while building the docs
 Key: ARROW-14512
 URL: https://issues.apache.org/jira/browse/ARROW-14512
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation, Java
Reporter: Krisztian Szucs
 Fix For: 7.0.0


On JDK 11: 
https://github.com/apache/arrow/runs/4037920463?check_suite_focus=true#step:8:4913



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14505) [CI][Docs] Exercise documentation builds on the main branch

2021-10-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14505:
---

 Summary: [CI][Docs] Exercise documentation builds on the main 
branch 
 Key: ARROW-14505
 URL: https://issues.apache.org/jira/browse/ARROW-14505
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Continuous Integration, Documentation
Reporter: Krisztian Szucs
 Fix For: 7.0.0


We regularly have documentation build issues since the build has been disabled 
on github actions. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14499) [Docs] Version dropdown side-by-side with search box

2021-10-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14499:
---

 Summary: [Docs] Version dropdown side-by-side with search box
 Key: ARROW-14499
 URL: https://issues.apache.org/jira/browse/ARROW-14499
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation
Reporter: Krisztian Szucs
Assignee: Joris Van den Bossche
 Fix For: 7.0.0, 6.0.1


Small follow-up on #11283 to improve the styling of the version dropdown.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14498) [Docs] Make it possible to regenerate older docs with additional patch(es)

2021-10-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14498:
---

 Summary: [Docs] Make it possible to regenerate older docs with 
additional patch(es)
 Key: ARROW-14498
 URL: https://issues.apache.org/jira/browse/ARROW-14498
 Project: Apache Arrow
  Issue Type: Wish
  Components: Documentation
Reporter: Krisztian Szucs
 Fix For: 7.0.0


We may need to regenerate older docs to include new changes, e.g. the new 
version dropdown feature. 

Since we need to regenerate the docs for the first time, it would be great if 
we could encapsulate this in a script. After applying the patch {{archery 
docker run ubuntu-docs}} should do the rest, similarly like we use in the 
post-release task 
https://github.com/apache/arrow/blob/master/dev/release/post-09-docs.sh

```
dev/release/generate-docs.sh  

dev/release/generate-docs.sh 6.0.0  # no patch required
dev/release/generate-docs.sh 5.0.0 docs.patch
dev/release/generate-docs.sh 4.0.0 docs.patch
dev/release/generate-docs.sh 3.0.0 docs.patch

# then deploy to asf-site
```

cc [~jorisvandenbossche]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14497) [Docs] Use relative internal links in the sphinx docs

2021-10-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14497:
---

 Summary: [Docs] Use relative internal links in the sphinx docs
 Key: ARROW-14497
 URL: https://issues.apache.org/jira/browse/ARROW-14497
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation
Reporter: Krisztian Szucs
 Fix For: 7.0.0


There are a lot of hardcoded urls referencing non-sphinx documentations across 
the generated HTML files, couple of examples:
- https://arrow.apache.org/docs/r/
- https://arrow.apache.org/docs/js/
- https://arrow.apache.org/docs/c_glib/
- https://arrow.apache.org/docs/java/reference/

Using the new versioned docs the 
{{https://arrow.apache.org/docs/5.0/java/index.html}} links should point to 
{{https://arrow.apache.org/docs/5.0/java/reference/}} instead of 
{{https://arrow.apache.org/docs/java/reference/}}

cc [~jorisvandenbossche]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14490) [Doc] Regenerate CHANGELOG.md to include all versions

2021-10-27 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14490:
---

 Summary: [Doc] Regenerate CHANGELOG.md to include all versions
 Key: ARROW-14490
 URL: https://issues.apache.org/jira/browse/ARROW-14490
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Since the move to release branches we haven't been updating the CHANGELOG.md 
file on the main branch so the versions are missing begining from release 3.0.0.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14489) [Rust][CI] Install stable rust toolchain in the integration docker image

2021-10-27 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14489:
---

 Summary: [Rust][CI] Install stable rust toolchain in the 
integration docker image
 Key: ARROW-14489
 URL: https://issues.apache.org/jira/browse/ARROW-14489
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Continuous Integration, Rust
Reporter: Krisztian Szucs
 Fix For: 7.0.0


To enable the downstream rust pull request: 
https://github.com/apache/arrow-rs/pull/591



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14472) [Dev][Archery] Generate contribution statistics using archery

2021-10-26 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14472:
---

 Summary: [Dev][Archery] Generate contribution statistics using 
archery 
 Key: ARROW-14472
 URL: https://issues.apache.org/jira/browse/ARROW-14472
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Archery, Developer Tools
Reporter: Krisztian Szucs


Currently we use a bash script to do that:
https://github.com/apache/arrow/blob/master/dev/release/post-03-website.sh#L47-L67

Since the rust repository split, this logic needs to be extended.
Additionally the scripts expects {{gnu date}} commands which is not available 
on macOS by default.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14468) [Python] Resolve parquet version deprecation warnings when compiling pyarrow

2021-10-26 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14468:
---

 Summary: [Python] Resolve parquet version deprecation warnings 
when compiling pyarrow
 Key: ARROW-14468
 URL: https://issues.apache.org/jira/browse/ARROW-14468
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs


{code}
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:
 
In function ‘PyObject* 
__pyx_pf_7pyarrow_8_parquet_12FileMetaData_14format_version___get__(__pyx_obj_7pyarrow_8_parquet_FileMetaData*)’:
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:14168:36:
 
warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use 
PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection 
[-Wdeprecated-declarations]
14168 | case  parquet::ParquetVersion::PARQUET_2_0:
   |^~~
In file included from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21,
  from 
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734:
/tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: 
declared here
44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or 
PARQUET_2_6 "
   | ^~~
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:14168:36:
 
warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use 
PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection 
[-Wdeprecated-declarations]
14168 | case  parquet::ParquetVersion::PARQUET_2_0:
   |^~~
In file included from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21,
  from 
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734:
/tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: 
declared here
44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or 
PARQUET_2_6 "
   | ^~~
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:
 
In function ‘std::shared_ptr 
__pyx_f_7pyarrow_8_parquet__create_writer_properties(__pyx_opt_args_7pyarrow_8_parquet__create_writer_properties*)’:
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:23800:62:
 
warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use 
PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection 
[-Wdeprecated-declarations]
23800 |   (void)(__pyx_v_props.version( 
parquet::ParquetVersion::PARQUET_2_0));
   | 
^~~
In file included from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21,
  from 
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734:
/tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: 
declared here
44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or 
PARQUET_2_6 "
   | ^~~
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:23800:62:
 
warning: ‘parquet::ParquetVersion::PARQUET_2_0’ is deprecated: use 
PARQUET_2_4 or PARQUET_2_6 for fine-grained feature selection 
[-Wdeprecated-declarations]
23800 |   (void)(__pyx_v_props.version( 
parquet::ParquetVersion::PARQUET_2_0));
   | 
^~~
In file included from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/types.h:30,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/schema.h:32,
  from 
/tmp/arrow-6.0.0.theE2/install/include/parquet/api/schema.h:21,
  from 
/tmp/arrow-6.0.0.theE2/apache-arrow-6.0.0/python/build/temp.linux-x86_64-3.8/_parquet.cpp:734:
/tmp/arrow-6.0.0.theE2/install/include/parquet/type_fwd.h:44:5: note: 
declared here
44 | PARQUET_2_0 ARROW_DEPRECATED_ENUM_VALUE("use PARQUET_2_4 or 
PARQUET_2_6 "
   | ^~~
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14438) [CI] Don't cancel build on the main branch

2021-10-22 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14438:
---

 Summary: [CI] Don't cancel build on the main branch
 Key: ARROW-14438
 URL: https://issues.apache.org/jira/browse/ARROW-14438
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 7.0.0


When listing the commits from the master branch I often see a bunch of failing 
commits which are actually cancelled due to concurrency groups: 
https://github.com/apache/arrow/blob/master/.github/workflows/dev.yml#L26

While we should keep this feature for the pull requests we should disable it 
for branches.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14437) [Python] CSV test_cancellation unittests fail on Apple M1

2021-10-22 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14437:
---

 Summary: [Python] CSV test_cancellation unittests fail on Apple M1
 Key: ARROW-14437
 URL: https://issues.apache.org/jira/browse/ARROW-14437
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Perhaps M1 is too quick :)

Most noticable when running the release verification tasks: 
https://github.com/apache/arrow/pull/11511

Failing builds:
- 
https://github.com/ursacomputing/crossbow/runs/3969076907?check_suite_focus=true
- 
https://github.com/ursacomputing/crossbow/runs/3974036108?check_suite_focus=true#step:5:2014




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14436) [C++] Disable color diagnostics when compiling with ccache

2021-10-22 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14436:
---

 Summary: [C++] Disable color diagnostics when compiling with ccache
 Key: ARROW-14436
 URL: https://issues.apache.org/jira/browse/ARROW-14436
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Copied from https://github.com/apache/arrow/issues/11279

Steps to reproduce:

Compile arrow_objlib with ccache, clang and CCACHE_DEBUG=1 
CCACHE_LOGFILE=./ccache.log
Find in ./ccache.log:
Failed; falling back to running the real compiler
Result: unsupported compiler option
Dropping -fcolor-diagnostics fixes the issue.

I suggest either opting into color diagnostics with WITH_COLOR_DIAGNOSTICS or 
adding a way to disable it via DISABLE_COLOR_DIAGNOSTICS.
It would be good if this wouldn't be tied to ARROW_USE_CCACHE since its also 
relevant for:
-DARROW_USE_CCACHE=OFF -DCMAKE_CXX_COMPILER_LAUNCHER=emscripten_ccache.

I can open a PR if you tell me which way you prefer.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14435) [Release] Update verification scripts to check python 3.10 wheels

2021-10-22 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14435:
---

 Summary: [Release] Update verification scripts to check python 
3.10 wheels
 Key: ARROW-14435
 URL: https://issues.apache.org/jira/browse/ARROW-14435
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Developer Tools
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


Python 3.10 should be available from conda now, so the verification scripts can 
check the new python 3.10 wheels.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14424) [Packaging][Python] Disable windows wheel testing for python 3.6

2021-10-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14424:
---

 Summary: [Packaging][Python] Disable windows wheel testing for 
python 3.6
 Key: ARROW-14424
 URL: https://issues.apache.org/jira/browse/ARROW-14424
 Project: Apache Arrow
  Issue Type: Bug
  Components: Packaging, Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0


Two layers of the official python 3.6 windows image are not available for 
download.
Docker pull returns with unexpected status resolving reader: 403 Forbidden.

While this is a transient error, it blocks the release process.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14423) [Python] Fix version constraints in pyproject.toml

2021-10-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14423:
---

 Summary: [Python] Fix version constraints in pyproject.toml
 Key: ARROW-14423
 URL: https://issues.apache.org/jira/browse/ARROW-14423
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0


Causes build error during packaging 
https://github.com/ursacomputing/crossbow/runs/3967169617?check_suite_focus=true#step:7:2185



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14411) [Release][Integration] Go integration tests fail for 6.0.0-RC1

2021-10-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14411:
---

 Summary: [Release][Integration] Go integration tests fail for 
6.0.0-RC1
 Key: ARROW-14411
 URL: https://issues.apache.org/jira/browse/ARROW-14411
 Project: Apache Arrow
  Issue Type: Bug
  Components: Integration
Reporter: Krisztian Szucs


Only on linux interestingly: 
https://github.com/apache/arrow/pull/11487#issuecomment-947798453

Here is the build log 
https://github.com/ursacomputing/crossbow/runs/3955744317?check_suite_focus=true#step:6:55443

I wonder whether it was introduced with 
https://github.com/apache/arrow/commit/41529c76fe80d1fe8e60b52c0da3669c901a45bb

The integration tests on the master branch are passing, so this migh be just a 
verification task issue.

cc [~zeroshade]





--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14410) [Python][Packaging] Use numpy 1.21.3 to build python 3.10 wheels for macOS and windows

2021-10-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14410:
---

 Summary: [Python][Packaging] Use numpy 1.21.3 to build python 3.10 
wheels for macOS and windows
 Key: ARROW-14410
 URL: https://issues.apache.org/jira/browse/ARROW-14410
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Numpy has just released new wheels for python 3.10 which we can now use to 
build wheels on macOS and windows.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14409) [Packaging][Python] Update the manylinux platform tags

2021-10-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14409:
---

 Summary: [Packaging][Python] Update the manylinux platform tags
 Key: ARROW-14409
 URL: https://issues.apache.org/jira/browse/ARROW-14409
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Newer versions {{wheel}} produces filenames with future-proof platform tags: 
{{manylinux_2_17_x86_64.manylinux2014_x86_64.whl}} instead of the previous 
{{manylinux2014_x86_64.whl}}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14408) [Packaging][Crossbow] Option for skipping artifact pattern validation

2021-10-21 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14408:
---

 Summary: [Packaging][Crossbow] Option for skipping artifact 
pattern validation
 Key: ARROW-14408
 URL: https://issues.apache.org/jira/browse/ARROW-14408
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Packaging
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 7.0.0


In certain cases we may want to skip artifact pattern validation to still 
download the produced artifacts despite that their names are slightly different 
from the expected patterns.

For example the manylinux platform tags have changed with the more recent wheel 
library and we only noticed it after a successful packaging build for the 
release.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14398) [CI] Don't build doxygen docs in all of the conda builds

2021-10-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14398:
---

 Summary: [CI] Don't build doxygen docs in all of the conda builds
 Key: ARROW-14398
 URL: https://issues.apache.org/jira/browse/ARROW-14398
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 7.0.0


We reuse the yml anchor to define the command for the conda docker builds: 
https://github.com/apache/arrow/blob/master/docker-compose.yml#L240

The {{true}} argument instruments the script to build the documentation. We 
should only enable it in the conda-cpp build which is exercised on all commits 
and disable for the rest of the builds.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14397) [C++] Fix valgrind error in test utility

2021-10-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14397:
---

 Summary: [C++] Fix valgrind error in test utility 
 Key: ARROW-14397
 URL: https://issues.apache.org/jira/browse/ARROW-14397
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Reporter: Krisztian Szucs


See the latest nightly build error 
https://dev.azure.com/ursacomputing/crossbow/_build/results?buildId=14046=logs=0da5d1d9-276d-5173-c4c4-9d4d4ed14fdb=d9b15392-e4ce-5e4c-0c8c-b69645229181=3469



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14393) [C++] GTest linking errors during the source release verification

2021-10-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14393:
---

 Summary: [C++] GTest linking errors during the source release 
verification
 Key: ARROW-14393
 URL: https://issues.apache.org/jira/browse/ARROW-14393
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Reporter: Krisztian Szucs
 Fix For: 6.0.0


https://github.com/ursacomputing/crossbow/runs/3949371326?check_suite_focus=true#step:6:1161



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14392) [C++] Bundled gRPC misses bundled Abseil include path

2021-10-20 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14392:
---

 Summary: [C++] Bundled gRPC misses bundled Abseil include path
 Key: ARROW-14392
 URL: https://issues.apache.org/jira/browse/ARROW-14392
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Reporter: Krisztian Szucs
 Fix For: 6.0.0


{code}
CMake Error in src/arrow/flight/CMakeLists.txt:
  Imported target "gRPC::grpc++" includes non-existent path


"/tmp/arrow-6.0.0.v1qFD/apache-arrow-6.0.0/cpp/build/absl_ep-install/include"

  in its INTERFACE_INCLUDE_DIRECTORIES.  Possible reasons include:

  * The path was deleted, renamed, or moved to another location.

  * An install or uninstall procedure did not complete successfully.

  * The installation package was faulty and references files it does not
  provide.



CMake Error in src/arrow/flight/CMakeLists.txt:
  Imported target "gRPC::grpc++" includes non-existent path


"/tmp/arrow-6.0.0.v1qFD/apache-arrow-6.0.0/cpp/build/absl_ep-install/include"

  in its INTERFACE_INCLUDE_DIRECTORIES.  Possible reasons include:

  * The path was deleted, renamed, or moved to another location.

  * An install or uninstall procedure did not complete successfully.

  * The installation package was faulty and references files it does not
  provide.
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14388) [Python] Add unittests for converter arrays with pandas masks

2021-10-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14388:
---

 Summary: [Python] Add unittests for converter arrays with pandas 
masks
 Key: ARROW-14388
 URL: https://issues.apache.org/jira/browse/ARROW-14388
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Cover the changes in https://github.com/apache/arrow/pull/11465

cc [~amol-]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14381) [CI] Spark integration failures

2021-10-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14381:
---

 Summary: [CI] Spark integration failures
 Key: ARROW-14381
 URL: https://issues.apache.org/jira/browse/ARROW-14381
 Project: Apache Arrow
  Issue Type: Bug
  Components: Continuous Integration
Reporter: Krisztian Szucs
 Fix For: 6.0.0


Both spark-master and spark-3.0 nightly builds are failing:

master: https://github.com/ursacomputing/crossbow/runs/3938861610#step:7:9237
branch-3.0: 
https://github.com/ursacomputing/crossbow/runs/3938887794#step:7:8917

We should also test against branch-3.2

cc [~bryanc]




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14377) [Packaging][Python] Python 3.9 installation fails in macOS wheel build

2021-10-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14377:
---

 Summary: [Packaging][Python] Python 3.9 installation fails in 
macOS wheel build
 Key: ARROW-14377
 URL: https://issues.apache.org/jira/browse/ARROW-14377
 Project: Apache Arrow
  Issue Type: Bug
  Components: Packaging, Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0


Due to a trailing comma in the script 
https://github.com/ursacomputing/crossbow/runs/3938860251#step:8:19



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14373) [Packaging][Java] Missing LLVM dependency in the macOS java-jars build

2021-10-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14373:
---

 Summary: [Packaging][Java] Missing LLVM dependency in the macOS 
java-jars build
 Key: ARROW-14373
 URL: https://issues.apache.org/jira/browse/ARROW-14373
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java, Packaging
Reporter: Krisztian Szucs
 Fix For: 7.0.0






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14372) [CI][C++][Python] Exercise builds on GCC 4.8

2021-10-19 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14372:
---

 Summary: [CI][C++][Python] Exercise builds on GCC 4.8 
 Key: ARROW-14372
 URL: https://issues.apache.org/jira/browse/ARROW-14372
 Project: Apache Arrow
  Issue Type: New Feature
  Components: C++, Continuous Integration, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


Add a build to {{.github/workflows/python.yml}} to avoid issues like 
https://issues.apache.org/jira/browse/ARROW-14369

We may extend our docker-compose configuration to include CentOS 7/8 for 
testing C++ and Python.

cc @kou 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14364) [CI][C++] Support LLVM 13

2021-10-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14364:
---

 Summary: [CI][C++] Support LLVM 13
 Key: ARROW-14364
 URL: https://issues.apache.org/jira/browse/ARROW-14364
 Project: Apache Arrow
  Issue Type: New Feature
  Components: C++, Continuous Integration
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0


Major platforms have started to provide LLVM 13 packages which causes multiple 
build errors.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14363) [C++][Gandiva] LLVM 13 has deprecated CreateGEP and CreateLoad methods without explicit element type

2021-10-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14363:
---

 Summary: [C++][Gandiva] LLVM 13 has deprecated CreateGEP and 
CreateLoad methods without explicit element type
 Key: ARROW-14363
 URL: https://issues.apache.org/jira/browse/ARROW-14363
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++, C++ - Gandiva
Reporter: Krisztian Szucs






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14361) [C++] Define a MAX default value for ARROW_SIMD_LEVEL

2021-10-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14361:
---

 Summary: [C++] Define a MAX default value for ARROW_SIMD_LEVEL
 Key: ARROW-14361
 URL: https://issues.apache.org/jira/browse/ARROW-14361
 Project: Apache Arrow
  Issue Type: New Feature
  Components: C++
Reporter: Krisztian Szucs
 Fix For: 7.0.0


In order to enable {{ARROW_HAVE_NEON}} CMake flag on ARM architectures 
{{ARROW_SIMD_LEVEL}} option must be set to not {{"NONE"}}, see 
https://github.com/apache/arrow/blob/master/cpp/cmake_modules/SetupCxxFlags.cmake#L444

The default value for {{ARROW_SIMD_LEVEL}} is {{SSE4_2}} which is a bit 
misleading on ARM64, it should rather be {{NEON}} which is not listed as a 
valid option for {{ARROW_SIMD_LEVEL}}. We may have a {{"MAX"}} default value 
similarly to the  {{ARROW_RUNTIME_SIMD_LEVEL}} option, see 
https://github.com/apache/arrow/blob/master/cpp/cmake_modules/DefineOptions.cmake#L115

Original github comment: 
https://github.com/apache/arrow/pull/11433#discussion_r729852835

cc [~yibocai] [~apitrou] 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14343) [Packaging][Python] Enable NEON SIMD optimization for M1 wheels

2021-10-15 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14343:
---

 Summary: [Packaging][Python] Enable NEON SIMD optimization for M1 
wheels
 Key: ARROW-14343
 URL: https://issues.apache.org/jira/browse/ARROW-14343
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 6.0.0






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14312) [Python] Integer conversion failures with python 3.10

2021-10-13 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14312:
---

 Summary: [Python] Integer conversion failures with python 3.10
 Key: ARROW-14312
 URL: https://issues.apache.org/jira/browse/ARROW-14312
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Krisztian Szucs


We have conversion issues during testing the python wheels for 3.10:
https://github.com/ursacomputing/crossbow/runs/3882292730?check_suite_focus=true#step:8:658

Some of the failures should be related to the removed {{__int__}} method:
https://docs.python.org/3/whatsnew/3.10.html#removed

cc [~apitrou]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14276) [Packaging] Dependency resolution issues in the nightly conda builds

2021-10-11 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14276:
---

 Summary: [Packaging] Dependency resolution issues in the nightly 
conda builds
 Key: ARROW-14276
 URL: https://issues.apache.org/jira/browse/ARROW-14276
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Packaging
Reporter: Krisztian Szucs
 Fix For: 6.0.0


The majority of the conda nightly builds are failing due to dependency 
resolution problems:

{code}
- conda-linux-gcc-py37-arm64:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py37-arm64
- conda-linux-gcc-py37-cpu-r41:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py37-cpu-r41
- conda-linux-gcc-py37-cuda:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py37-cuda
- conda-linux-gcc-py38-arm64:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py38-arm64
- conda-linux-gcc-py38-cpu:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py38-cpu
- conda-linux-gcc-py38-cuda:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py38-cuda
- conda-linux-gcc-py39-arm64:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py39-arm64
- conda-linux-gcc-py39-cpu:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py39-cpu
- conda-linux-gcc-py39-cuda:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-linux-gcc-py39-cuda
- conda-win-vs2017-py36-r40:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-win-vs2017-py36-r40
- conda-win-vs2017-py38:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-win-vs2017-py38
- conda-win-vs2017-py39:
  URL: 
https://github.com/ursacomputing/crossbow/branches/all?query=nightly-2021-10-11-0-azure-conda-win-vs2017-py39
{code}

I assume that we need to sync the recipes again with up to date pin files. 

cc @uwe



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-14217) [Python][CI] Add support for python 3.10

2021-10-05 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14217:
---

 Summary: [Python][CI] Add support for python 3.10 
 Key: ARROW-14217
 URL: https://issues.apache.org/jira/browse/ARROW-14217
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Continuous Integration, Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0


Python 3.10 has just been released, so exercise builds and ship packages for it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-13921) [Python][Packaging] Pin minimum setuptools version for the macos wheels

2021-09-07 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-13921:
---

 Summary: [Python][Packaging] Pin minimum setuptools version for 
the macos wheels
 Key: ARROW-13921
 URL: https://issues.apache.org/jira/browse/ARROW-13921
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Packaging, Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0


There was a bug in setuptools which caused the recent nightly failures: 
https://github.com/ursacomputing/crossbow/runs/3521607291#step:10:269



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-13914) [C++][Python] Optimize type inference when converting from python values

2021-09-06 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-13914:
---

 Summary: [C++][Python] Optimize type inference when converting 
from python values
 Key: ARROW-13914
 URL: https://issues.apache.org/jira/browse/ARROW-13914
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Krisztian Szucs


Currently we use an extensive set of checks to infer arrow type from python 
sequences. 

Last time I checked using asv, the inference part had a significant overhead. 

We could try other approaches to speed-up the type inference, see comments: 
https://github.com/apache/arrow/pull/11076#discussion_r702808196



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-13635) [Packaging][Python] Define --with-lg-page for jemalloc in the arm manylinux builds

2021-08-16 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-13635:
---

 Summary: [Packaging][Python] Define --with-lg-page for jemalloc in 
the arm manylinux builds
 Key: ARROW-13635
 URL: https://issues.apache.org/jira/browse/ARROW-13635
 Project: Apache Arrow
  Issue Type: Task
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 6.0.0


Follow-up ticket for https://github.com/apache/arrow/issues/10929



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-13557) [Packaging][Python] Skip test_cancellation test case on M1

2021-08-04 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-13557:
---

 Summary: [Packaging][Python] Skip test_cancellation test case on M1
 Key: ARROW-13557
 URL: https://issues.apache.org/jira/browse/ARROW-13557
 Project: Apache Arrow
  Issue Type: Task
  Components: Packaging, Python
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0


The nightly wheel packaging builds have started to fail: 
https://github.com/ursacomputing/crossbow/runs/3238359543



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-13483) [Release][Dev] Port the release note generation script to archery

2021-07-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-13483:
---

 Summary: [Release][Dev] Port the release note generation script to 
archery
 Key: ARROW-13483
 URL: https://issues.apache.org/jira/browse/ARROW-13483
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Developer Tools
Reporter: Krisztian Szucs
 Fix For: 6.0.0


Archery already have a couple of utilities to parse commits between revisions 
and access various metadata from git. Implementing it python would make it more 
portable (e.g. {{date}} function is different from {{GNU date}} on macOS).





--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-13478) [Release] Unnecessary rc-number argument for the version bumping post-release script

2021-07-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-13478:
---

 Summary: [Release] Unnecessary rc-number argument for the version 
bumping post-release script
 Key: ARROW-13478
 URL: https://issues.apache.org/jira/browse/ARROW-13478
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Developer Tools
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-13477) [Release] Pass ARTIFACTORY_API_KEY to the upload script

2021-07-28 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-13477:
---

 Summary: [Release] Pass ARTIFACTORY_API_KEY to the upload script
 Key: ARROW-13477
 URL: https://issues.apache.org/jira/browse/ARROW-13477
 Project: Apache Arrow
  Issue Type: Bug
  Components: Developer Tools
Reporter: Krisztian Szucs
Assignee: Krisztian Szucs
 Fix For: 6.0.0






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


  1   2   3   4   5   6   7   8   9   10   >