[ 
https://issues.apache.org/jira/browse/ARROW-6206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16911955#comment-16911955
 ] 

Jim Northrup commented on ARROW-6206:
-------------------------------------

(previsouly responded as email, sorry if this creates a dupe)

 

I admire Arrow for doing a thing well.  I hope that if I simply call “mvn 
maven-versions-plugin:latest” in the future this simple jdbc code will work 
better than before.

 

I appreciate the attention to the details.

 

I think through this discussion the jist is that tensorflow one-hot columns may 
quickly test the expected norms of arrow.  Likewise, timeseries datasets have 
us blowing gaskets all over the place in terms of time-to-completion and RAM 
using pandas.  What do we do with a 300 gig numpy dataset living in swap that 
takes 3 dasy to build? There’s no LSTM examples to demonstrate anything but toy 
datasets.

 

Turbodbc looks like a good fit for reducing transcription times.

 

For what I need in the space of Arrow, I think the ideal tool is something to 
work in and out of numpy and delegate to and from apache Geode or Hazelcast as 
the main substrate. 

 

If perchance arrow can act as a window to memory grids, all the better.

 

As I find the time for signups and 2fa’s I will compose this to the lists

 

> [Java][Docs] Document environment variables/java properties
> -----------------------------------------------------------
>
>                 Key: ARROW-6206
>                 URL: https://issues.apache.org/jira/browse/ARROW-6206
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Documentation, Java
>            Reporter: Micah Kornfield
>            Assignee: Ji Liu
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.15.0
>
>          Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Specifically, "-Dio.netty.tryReflectionSetAccessible=true" for JVMs >= 9 and 
> BoundsChecking/NullChecking for get.
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

Reply via email to