[jira] [Created] (ARROW-18237) Add support for mutable version of Table

2022-11-03 Thread Larry White (Jira)
Larry White created ARROW-18237:
---

 Summary: Add support for mutable version of Table
 Key: ARROW-18237
 URL: https://issues.apache.org/jira/browse/ARROW-18237
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Reporter: Larry White






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-18179) [Java] Null pointer closing unused JDBC Consumers

2022-10-27 Thread Larry White (Jira)
Larry White created ARROW-18179:
---

 Summary: [Java] Null pointer closing unused JDBC Consumers
 Key: ARROW-18179
 URL: https://issues.apache.org/jira/browse/ARROW-18179
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 10.0.0
Reporter: Larry White


It’s legal to create a consumer that you never use, and then on close() vector 
will be null.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-18178) [Java] ArrowVectorIterator incorrectly closes Vectors

2022-10-27 Thread Larry White (Jira)
Larry White created ARROW-18178:
---

 Summary: [Java] ArrowVectorIterator incorrectly closes Vectors 
 Key: ARROW-18178
 URL: https://issues.apache.org/jira/browse/ARROW-18178
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 10.0.0
Reporter: Larry White


If you’re using the iterator with reuseVectorSchemaRoot set to false, the first 
n-1 VSRs returned by a call to next() have their resources managed by the 
client code, but the last one gets closed when the iterator is closed. This is 
inconsistent and contradicts the  javadoc for next:
* If \{@link JdbcToArrowConfig#isReuseVectorSchemaRoot()} is false,
* the client is responsible for freeing its resources.
The iterator calls close on its CompositeJDBCConsumer instance, which in turn 
closes the vectors held by each consumer. 

{{/**}}
{{ * Clean up resources.}}
{{ */}}
{{@Override}}
{{public void close() {}}
{{  if (config.isReuseVectorSchemaRoot()) {}}
{{  nextBatch.close();}}
{{  }}}
{{  compositeConsumer.close();}}
{{}}}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-18168) [Java] implement index filtering for Table

2022-10-26 Thread Larry White (Jira)
Larry White created ARROW-18168:
---

 Summary: [Java] implement index filtering for Table
 Key: ARROW-18168
 URL: https://issues.apache.org/jira/browse/ARROW-18168
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Reporter: Larry White


Provide tables with the ability to return a subset of their data filtered 
according to a set of row-indexes. The row-index set can be created by 
arbitrary filtering algorithms. The result will be a new Table object



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-18044) [Java] upgrade error-prone library to 2.16.0

2022-10-13 Thread Larry White (Jira)
Larry White created ARROW-18044:
---

 Summary: [Java] upgrade error-prone library to 2.16.0
 Key: ARROW-18044
 URL: https://issues.apache.org/jira/browse/ARROW-18044
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Reporter: Larry White


Current version of errorprone interacts badly with Intellij, leading to 
erroneous (ironically ) reporting of an error for using "non-standard ascii 
characters".

 

This causes intermittent but frequent failures of arbitrary tests and is thus 
crazy-making. 

See Errorprone issue https://github.com/google/error-prone/issues/3092



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-18014) Add copy functions to Table

2022-10-12 Thread Larry White (Jira)
Larry White created ARROW-18014:
---

 Summary: Add copy functions to Table
 Key: ARROW-18014
 URL: https://issues.apache.org/jira/browse/ARROW-18014
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 10.0.0
Reporter: Larry White


Add copyVector() and copy() (to copy the whole Table) to the Table interface



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17962) [Java] Bug in test causes CI failures

2022-10-07 Thread Larry White (Jira)
Larry White created ARROW-17962:
---

 Summary: [Java] Bug in test causes CI failures
 Key: ARROW-17962
 URL: https://issues.apache.org/jira/browse/ARROW-17962
 Project: Apache Arrow
  Issue Type: Improvement
Reporter: Larry White


In RoundTripTest in the c module, a schema is released incorrectly causing an 
exception: 

 
 
Error:  org.apache.arrow.c.RoundtripTest.testTable  Time elapsed: 0.031 s  
<<< ERROR!
java.lang.IllegalStateException: Cannot import released ArrowSchema
at 
org.apache.arrow.util.Preconditions.checkState(Preconditions.java:458)
at org.apache.arrow.c.SchemaImporter.importField(SchemaImporter.java:63)
at org.apache.arrow.c.SchemaImporter.importField(SchemaImporter.java:56)
at org.apache.arrow.c.Data.importField(Data.java:246)
at org.apache.arrow.c.Data.importSchema(Data.java:266)
at org.apache.arrow.c.Data.importVectorSchemaRoot(Data.java:377)
at 
org.apache.arrow.c.RoundtripTest.testTable(RoundtripTest.java:683)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17955) Add Arrow user documentation for Table

2022-10-06 Thread Larry White (Jira)
Larry White created ARROW-17955:
---

 Summary: Add Arrow user documentation for Table
 Key: ARROW-17955
 URL: https://issues.apache.org/jira/browse/ARROW-17955
 Project: Apache Arrow
  Issue Type: Improvement
Reporter: Larry White
Assignee: Larry White






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17883) [Java] Implement an immutable table object

2022-09-28 Thread Larry White (Jira)
Larry White created ARROW-17883:
---

 Summary: [Java] Implement an immutable table object
 Key: ARROW-17883
 URL: https://issues.apache.org/jira/browse/ARROW-17883
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 10.0.0
Reporter: Larry White


Implement an immutable Table object without the batch semantics provided by 
VectorSchemaRoot. 

See original design document/discussion here: 
https://docs.google.com/document/d/1J77irZFWNnSID7vK71z26Nw_Pi99I9Hb9iryno8B03c/edit?usp=sharing

Note that this ticket covers only the immutable Table implementation. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17833) [Java][Doc] Add missing information to docs on dictionary encoding

2022-09-23 Thread Larry White (Jira)
Larry White created ARROW-17833:
---

 Summary: [Java][Doc] Add missing information to docs on dictionary 
encoding
 Key: ARROW-17833
 URL: https://issues.apache.org/jira/browse/ARROW-17833
 Project: Apache Arrow
  Issue Type: Improvement
Reporter: Larry White
Assignee: Larry White


The documentation (see: vector.rst) fails to say what value to use for the 
DictionaryEncoding id, and why. This should be included. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17819) [Doc] Remove dependency on PyArrow from rst build process

2022-09-22 Thread Larry White (Jira)
Larry White created ARROW-17819:
---

 Summary: [Doc] Remove dependency on PyArrow from rst build process
 Key: ARROW-17819
 URL: https://issues.apache.org/jira/browse/ARROW-17819
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation
Reporter: Larry White


The process for building the tutorial documentation (see: 
apache/arrow/docs/source/developers/documentation.rst) identifies PyArrow as a 
requirement

Ideally, it would not be necessary to install PyArrow to modify text 
documentation for Java or any other language. 

FWIW, On my machine, the PyArrow install has been running for over 20 minutes 
with repeated failure messages.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17811) [Doc][Java] Document how dictionary encoding works

2022-09-21 Thread Larry White (Jira)
Larry White created ARROW-17811:
---

 Summary: [Doc][Java] Document how dictionary encoding works
 Key: ARROW-17811
 URL: https://issues.apache.org/jira/browse/ARROW-17811
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation, Java
Affects Versions: 9.0.0
Reporter: Larry White


The ValueVector documentation does not include any discussion of dictionary 
encoding. There is example code on the IPC page 
https://arrow.apache.org/docs/dev/java/ipc.html, but it doesn't provide an 
overview. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17801) [Doc][Java] Fix typos in slice page in Cookbook

2022-09-21 Thread Larry White (Jira)
Larry White created ARROW-17801:
---

 Summary: [Doc][Java] Fix typos in slice page in Cookbook 
 Key: ARROW-17801
 URL: https://issues.apache.org/jira/browse/ARROW-17801
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 9.0.0
Reporter: Larry White
Assignee: Larry White


The slice instructions say "splice" in a couple of places. 

Check for other typos 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17585) [Java] Extend types supported by GenerateSampleData

2022-08-31 Thread Larry White (Jira)
Larry White created ARROW-17585:
---

 Summary: [Java] Extend types supported by GenerateSampleData
 Key: ARROW-17585
 URL: https://issues.apache.org/jira/browse/ARROW-17585
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Reporter: Larry White


org.apache.arrow.vector.GenerateSampleTypes does not support the Uint vector 
types.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17570) [Java][Documentation] Add JavaDoc to TransferPair interface

2022-08-30 Thread Larry White (Jira)
Larry White created ARROW-17570:
---

 Summary: [Java][Documentation] Add JavaDoc to TransferPair 
interface
 Key: ARROW-17570
 URL: https://issues.apache.org/jira/browse/ARROW-17570
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation, Java
Reporter: Larry White


The TransferPair interface is important to the Java vector module's memory 
management, but there is only a single line class comment, and no comments for 
the methods. The implementations of those methods have subtitles that are 
hidden in the method names. For example, the method transferTo() clears the 
memory in the original vector and resets the rowCount to 0, but 
splitAndTransferTo() only copies the values into new memory and the original is 
unchanged.  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17530) [Java] VectorSchemaRoot#addVector() cannot add a vector to the end of the current vector collection

2022-08-25 Thread Larry White (Jira)
Larry White created ARROW-17530:
---

 Summary: [Java] VectorSchemaRoot#addVector() cannot add a vector 
to the end of the current vector collection
 Key: ARROW-17530
 URL: https://issues.apache.org/jira/browse/ARROW-17530
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 9.0.0, 9.0.1
Reporter: Larry White
Assignee: Larry White


The current implementation of Java VectorSchemaRoot cannot add a vector at the 
end of the current list (which is the generally understood meaning of "add").

The Precondition check in the method's second line prevents providing an 
appropriate index for adding at the end:

 
{code:java}
public VectorSchemaRoot addVector(int index, FieldVector vector) {
  Preconditions.checkNotNull(vector);
  Preconditions.checkArgument(index >= 0 && index < fieldVectors.size());
  List newVectors = new ArrayList<>();
  for (int i = 0; i < fieldVectors.size(); i++) {
if (i == index) {
  newVectors.add(vector);
}
newVectors.add(fieldVectors.get(i));
  }
  return new VectorSchemaRoot(newVectors);
}
 {code}
 

 

One possible implementation resolving the issue is shown below.

 
{code:java}
public VectorSchemaRoot addVector(int index, FieldVector vector) {
  Preconditions.checkNotNull(vector);
  Preconditions.checkArgument(index >= 0 && index <= fieldVectors.size());
  List newVectors = new ArrayList<>();
  if (index == fieldVectors.size()) {
newVectors.addAll(fieldVectors);
newVectors.add(vector); 
  } else {
for (int i = 0; i < fieldVectors.size(); i++) {
  if (i == index) {
newVectors.add(vector);
  }
  newVectors.add(fieldVectors.get(i));
}
  }
  return new VectorSchemaRoot(newVectors);
}
{code}
 

 

 

 

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17480) [Java] add setNull() to ValueVector interface

2022-08-19 Thread Larry White (Jira)
Larry White created ARROW-17480:
---

 Summary: [Java] add setNull() to ValueVector interface
 Key: ARROW-17480
 URL: https://issues.apache.org/jira/browse/ARROW-17480
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 9.0.0
Reporter: Larry White


Only three ValueVector types do not implement setNull(): UnionVector, 
DenseUnionVector and NullVector, preventing it from being addable to the 
ValueVector interface and thus requiring a cast when used. 

For NullVector, it could be added as a no-op. For the other two, it could be 
added by having it write a null to a child instead of setting a null in the 
parent. Note that this is similar to the implementation in UnionListWriter. 

See also a similar issue opened for FieldWriter (ARROW-2538), which may be 
closable given the current code base. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17346) Document the use of the batchSize argument in Dataset ScanOptions

2022-08-08 Thread Larry White (Jira)
Larry White created ARROW-17346:
---

 Summary: Document the use of the batchSize argument in Dataset 
ScanOptions
 Key: ARROW-17346
 URL: https://issues.apache.org/jira/browse/ARROW-17346
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 9.0.0
Reporter: Larry White
Assignee: Larry White


Several ScanOptions methods take a batchSize argument as shown: 

{{public ScanOptions(long batchSize) {}}
{{    this(batchSize, Optional.empty());}}
{{}}}

Since the scanner reads one ArrowRecordBatch per load invocation, setting the 
parameter to a size larger than the RecordBatch has no effect. It only works 
when it's smaller than the number of rows in the RecordBatch, (i.e., the number 
or records read is equal to min(batchSize, recordBatch rowCount), potentially 
leading to some confusion. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-16992) Separate JNI compilation & linking from main arrow CMake

2022-07-06 Thread Larry White (Jira)
Larry White created ARROW-16992:
---

 Summary: Separate JNI compilation & linking from main arrow CMake
 Key: ARROW-16992
 URL: https://issues.apache.org/jira/browse/ARROW-16992
 Project: Apache Arrow
  Issue Type: Improvement
Reporter: Larry White






--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-16941) [Java] Consolidate Dataset code

2022-06-30 Thread Larry White (Jira)
Larry White created ARROW-16941:
---

 Summary: [Java] Consolidate Dataset code
 Key: ARROW-16941
 URL: https://issues.apache.org/jira/browse/ARROW-16941
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: Larry White
Assignee: Larry White
 Fix For: 9.0.0


Move the Dataset JNI C code from the Arrow cpp subproject to the Arrow Java 
subproject



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-16851) [CI][SPARK] Nightlies test "test-conda-python-3.9-spark-master" failing

2022-06-17 Thread Larry White (Jira)
Larry White created ARROW-16851:
---

 Summary: [CI][SPARK] Nightlies test 
"test-conda-python-3.9-spark-master" failing  
 Key: ARROW-16851
 URL: https://issues.apache.org/jira/browse/ARROW-16851
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Affects Versions: 9.0.0
Reporter: Larry White


test-conda-python-3.9-spark-master failing commit 
[d89c0b4|https://github.com/apache/arrow/commit/d89c0b4aadecd09dbeb8051688f347c9d0af89ad]

{{Error:  Failed to execute goal 
net.alchim31.maven:scala-maven-plugin:4.6.2:compile (scala-compile-first) on 
project spark-tags_2.12: Execution scala-compile-first of goal 
net.alchim31.maven:scala-maven-plugin:4.6.2:compile failed: rt.jar (class 
sbt.internal.inc.DummyVirtualFile) is not supported -> [Help 1]}}

- Error message reported at: 
https://github.com/ursacomputing/crossbow/runs/6929559148?check_suite_focus=true#step:5:11193

See also: 
[https://bytemeta.vip/repo/davidB/scala-maven-plugin/issues/615#google_vignette]
 for a discussion of similar/same? issue involving Spark and vip.

[~LuciferYang] Is this something you can comment on? Thanks very much. 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16571) update git to skip binaries generated by Java native compilation

2022-05-13 Thread Larry White (Jira)
Larry White created ARROW-16571:
---

 Summary: update git to skip binaries generated by Java native 
compilation
 Key: ARROW-16571
 URL: https://issues.apache.org/jira/browse/ARROW-16571
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 8.0.0
Reporter: Larry White


running the CMake and Maven jobs to build and link Arrow Java with its native 
dependencies  creates more than 15,000 files in the arrow folder that are 
automatically added by git. 

 

The proposed change would modify the .gitignore to include the three folders, 
and run 

{{git rm --cached java-dist}}

{{git rm --cached java-native-c}}

{{git rm --cached java-native-cpp}}

to remove the folders from the repo 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16534) Update gandiva protobuf library version to support M1

2022-05-11 Thread Larry White (Jira)
Larry White created ARROW-16534:
---

 Summary: Update gandiva protobuf library version to support M1
 Key: ARROW-16534
 URL: https://issues.apache.org/jira/browse/ARROW-16534
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java
Affects Versions: 8.0.0, 9.0.0
 Environment: macOS, M1
Reporter: Larry White


Gandiva needs to generate Protobuf Java sources from the definitions, and this 
relies on a JAR that has the native Protobuf compiler embedded in it - but the 
current package doesn't have an ARMv8 build available.  protobuf-java version 
3.20.1 does have M1 support.
 
This means that building from source as documented 
(https://arrow.apache.org/docs/developers/java/building.html) cannot be done on 
M1 as the following exception occurs:
 

[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time:  03:38 min
[INFO] Finished at: 2022-05-10T16:19:24-04:00
[INFO] 
[ERROR] Failed to execute goal 
org.xolstice.maven.plugins:protobuf-maven-plugin:0.6.1:compile (default) on 
project arrow-gandiva: Unable to resolve artifact: Missing:
[ERROR] --
[ERROR] 1) com.google.protobuf:protoc:exe:osx-aarch_64:2.5.0
[ERROR]
[ERROR]   Try downloading the file manually from the project website.
[ERROR]
[ERROR]   Then, install it using the command:
[ERROR]   mvn install:install-file -DgroupId=com.google.protobuf 
-DartifactId=protoc -Dversion=2.5.0 -Dclassifier=osx-aarch_64 -Dpackaging=exe 
-Dfile=/path/to/file
[ERROR]
[ERROR]   Alternatively, if you host your own repository you can deploy the 
file there:
[ERROR]   mvn deploy:deploy-file -DgroupId=com.google.protobuf 
-DartifactId=protoc -Dversion=2.5.0 -Dclassifier=osx-aarch_64 -Dpackaging=exe 
-Dfile=/path/to/file -Durl=[url] -DrepositoryId=[id]
[ERROR]
[ERROR]   Path to dependency:
[ERROR] 1) org.apache.arrow.gandiva:arrow-gandiva:jar:9.0.0-SNAPSHOT
[ERROR] 2) com.google.protobuf:protoc:exe:osx-aarch_64:2.5.0
[ERROR]
[ERROR] --
[ERROR] 1 required artifact is missing.
[ERROR]
[ERROR] for artifact:
[ERROR]   org.apache.arrow.gandiva:arrow-gandiva:jar:9.0.0-SNAPSHOT
[ERROR]
[ERROR] from the specified remote repositories:
[ERROR]   apache.snapshots (https://repository.apache.org/snapshots, 
releases=false, snapshots=true),
[ERROR]   central (https://repo.maven.apache.org/maven2, releases=true, 
snapshots=false)
[ERROR]
[ERROR] -> [Help 1]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
[ERROR]
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn  -rf :arrow-gandiva
 
 
 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16533) Update gandiva protobuf compilation support to include M1

2022-05-11 Thread Larry White (Jira)
Larry White created ARROW-16533:
---

 Summary: Update gandiva protobuf compilation support to include M1
 Key: ARROW-16533
 URL: https://issues.apache.org/jira/browse/ARROW-16533
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java
Affects Versions: 8.0.0, 9.0.0
 Environment: macOS, M1
Reporter: Larry White


Gandiva needs to generate Protobuf Java sources from the definitions, and this 
relies on a JAR that has the native Protobuf compiler embedded in it - but the 
current package doesn't have an ARMv8 build available.  protobuf-java version 
3.20.1 does have M1 support.
 
This means that building from source as documented 
(https://arrow.apache.org/docs/developers/java/building.html) cannot be done on 
M1 as the following exception occurs:
 

[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time:  03:38 min
[INFO] Finished at: 2022-05-10T16:19:24-04:00
[INFO] 
[ERROR] Failed to execute goal 
org.xolstice.maven.plugins:protobuf-maven-plugin:0.6.1:compile (default) on 
project arrow-gandiva: Unable to resolve artifact: Missing:
[ERROR] --
[ERROR] 1) com.google.protobuf:protoc:exe:osx-aarch_64:2.5.0
[ERROR]
[ERROR]   Try downloading the file manually from the project website.
[ERROR]
[ERROR]   Then, install it using the command:
[ERROR]   mvn install:install-file -DgroupId=com.google.protobuf 
-DartifactId=protoc -Dversion=2.5.0 -Dclassifier=osx-aarch_64 -Dpackaging=exe 
-Dfile=/path/to/file
[ERROR]
[ERROR]   Alternatively, if you host your own repository you can deploy the 
file there:
[ERROR]   mvn deploy:deploy-file -DgroupId=com.google.protobuf 
-DartifactId=protoc -Dversion=2.5.0 -Dclassifier=osx-aarch_64 -Dpackaging=exe 
-Dfile=/path/to/file -Durl=[url] -DrepositoryId=[id]
[ERROR]
[ERROR]   Path to dependency:
[ERROR] 1) org.apache.arrow.gandiva:arrow-gandiva:jar:9.0.0-SNAPSHOT
[ERROR] 2) com.google.protobuf:protoc:exe:osx-aarch_64:2.5.0
[ERROR]
[ERROR] --
[ERROR] 1 required artifact is missing.
[ERROR]
[ERROR] for artifact:
[ERROR]   org.apache.arrow.gandiva:arrow-gandiva:jar:9.0.0-SNAPSHOT
[ERROR]
[ERROR] from the specified remote repositories:
[ERROR]   apache.snapshots (https://repository.apache.org/snapshots, 
releases=false, snapshots=true),
[ERROR]   central (https://repo.maven.apache.org/maven2, releases=true, 
snapshots=false)
[ERROR]
[ERROR] -> [Help 1]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
[ERROR]
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn  -rf :arrow-gandiva
 
 
 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16532) Update gandiva protobuf compilation support to include M1

2022-05-11 Thread Larry White (Jira)
Larry White created ARROW-16532:
---

 Summary: Update gandiva protobuf compilation support to include M1
 Key: ARROW-16532
 URL: https://issues.apache.org/jira/browse/ARROW-16532
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java
Affects Versions: 8.0.0, 9.0.0
 Environment: macOS, M1
Reporter: Larry White


Gandiva needs to generate Protobuf Java sources from the definitions, and this 
relies on a JAR that has the native Protobuf compiler embedded in it - but the 
current package doesn't have an ARMv8 build available.  protobuf-java version 
3.20.1 does have M1 support.
 
This means that building from source as documented 
(https://arrow.apache.org/docs/developers/java/building.html) cannot be done on 
M1 as the following exception occurs:
 

[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time:  03:38 min
[INFO] Finished at: 2022-05-10T16:19:24-04:00
[INFO] 
[ERROR] Failed to execute goal 
org.xolstice.maven.plugins:protobuf-maven-plugin:0.6.1:compile (default) on 
project arrow-gandiva: Unable to resolve artifact: Missing:
[ERROR] --
[ERROR] 1) com.google.protobuf:protoc:exe:osx-aarch_64:2.5.0
[ERROR]
[ERROR]   Try downloading the file manually from the project website.
[ERROR]
[ERROR]   Then, install it using the command:
[ERROR]   mvn install:install-file -DgroupId=com.google.protobuf 
-DartifactId=protoc -Dversion=2.5.0 -Dclassifier=osx-aarch_64 -Dpackaging=exe 
-Dfile=/path/to/file
[ERROR]
[ERROR]   Alternatively, if you host your own repository you can deploy the 
file there:
[ERROR]   mvn deploy:deploy-file -DgroupId=com.google.protobuf 
-DartifactId=protoc -Dversion=2.5.0 -Dclassifier=osx-aarch_64 -Dpackaging=exe 
-Dfile=/path/to/file -Durl=[url] -DrepositoryId=[id]
[ERROR]
[ERROR]   Path to dependency:
[ERROR] 1) org.apache.arrow.gandiva:arrow-gandiva:jar:9.0.0-SNAPSHOT
[ERROR] 2) com.google.protobuf:protoc:exe:osx-aarch_64:2.5.0
[ERROR]
[ERROR] --
[ERROR] 1 required artifact is missing.
[ERROR]
[ERROR] for artifact:
[ERROR]   org.apache.arrow.gandiva:arrow-gandiva:jar:9.0.0-SNAPSHOT
[ERROR]
[ERROR] from the specified remote repositories:
[ERROR]   apache.snapshots (https://repository.apache.org/snapshots, 
releases=false, snapshots=true),
[ERROR]   central (https://repo.maven.apache.org/maven2, releases=true, 
snapshots=false)
[ERROR]
[ERROR] -> [Help 1]
[ERROR]
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR]
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:
[ERROR] [Help 1] 
http://cwiki.apache.org/confluence/display/MAVEN/MojoExecutionException
[ERROR]
[ERROR] After correcting the problems, you can resume the build with the command
[ERROR]   mvn  -rf :arrow-gandiva
 
 
 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16493) Documented Java build process for native code fails on Apple M1 hardware

2022-05-06 Thread Larry White (Jira)
Larry White created ARROW-16493:
---

 Summary: Documented Java build process for native code fails on 
Apple M1 hardware
 Key: ARROW-16493
 URL: https://issues.apache.org/jira/browse/ARROW-16493
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java
Affects Versions: 8.0.0
 Environment: Apple MacBook Pro M1 hardware 
OSX Monterey 12.2.1
Reporter: Larry White
 Attachments: errors.txt

A warning occurs while linking files in C Data Interface: 

[100%] *Linking CXX shared library libarrow_cdata_jni.dylib*


ld: warning: ignoring file 
/Library/Java/JavaVirtualMachines/jdk-11.0.13.jdk/Contents/Home/lib/server/libjvm.dylib,
 building for macOS-arm64 but attempting to link with file built for 
macOS-x86_64

 

Followed by numerous test failures later in the process, for example:

 

{{{}[ERROR] Tests run: 45, Failures: 0, Errors: 45, Skipped: 0, Time elapsed: 
0.725 s <<< FAILURE! - in org.apache.arrow.c.RoundtripTest{}}}{{{}[ERROR] 
testUInt8Vector  Time elapsed: 0.054 s  <<< 
ERROR!{}}}{{{}java.lang.ExceptionInInitializerError{}}}{{{}at 
org.apache.arrow.c.RoundtripTest.vectorRoundtrip(RoundtripTest.java:129){}}}{{{}at
 org.apache.arrow.c.RoundtripTest.roundtrip(RoundtripTest.java:155){}}}{{{}at 
org.apache.arrow.c.RoundtripTest.testUInt8Vector(RoundtripTest.java:463){}}}{{{}*Caused
 by: java.lang.IllegalStateException: error loading native libraries: 
java.io.FileNotFoundException: libarrow_cdata_jni.dylib* {}}}{{{}at 
org.apache.arrow.c.RoundtripTest.vectorRoundtrip(RoundtripTest.java:129){}}}{{{}at
 org.apache.arrow.c.RoundtripTest.roundtrip(RoundtripTest.java:155){}}}{{{}at 
org.apache.arrow.c.RoundtripTest.testUInt8Vector(RoundtripTest.java:463){}}}

 

{{Steps to reproduce: }}

{{On Apple M1 hardware, follow the build instructions at 
[https://arrow.apache.org/docs/dev/developers/java/building.html] }}

{{{}Follow instructions to build the C Data interface. (warning 
occurs){}}}{{{}{}}}

{{Continue to follow instructions to build remaining JNI interfaces. (test 
failures occur). }}{{}}

 

{{Note that the file that cannot be found is the same file being linked when 
the earlier warning occurs.}}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16466) Bundle DLLs for JNI interfaces into Maven Jars

2022-05-04 Thread Larry White (Jira)
Larry White created ARROW-16466:
---

 Summary: Bundle DLLs for JNI interfaces into Maven Jars
 Key: ARROW-16466
 URL: https://issues.apache.org/jira/browse/ARROW-16466
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 8.0.0
Reporter: Larry White






--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Created] (ARROW-16465) Create build scripts and documentation for producing DLLs for JNI interfaces

2022-05-04 Thread Larry White (Jira)
Larry White created ARROW-16465:
---

 Summary: Create build scripts and documentation for producing DLLs 
for JNI interfaces
 Key: ARROW-16465
 URL: https://issues.apache.org/jira/browse/ARROW-16465
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Java
Affects Versions: 8.0.0
Reporter: Larry White






--
This message was sent by Atlassian Jira
(v8.20.7#820007)