Re: Graph model in arrow

2020-11-18 Thread Fan Liya
Hi Leo, For graph data model, I can think of two popular ways of representations: 1) adjacent matrix: an n x n matrix A (where n is the number of vertices), and Aij = 1 indicates an arc from i to j. 2) adjacent list: a table head node for each vertex, and a list for each vertex to store arcs.

Re: [Discuss] Arrow Release Schedule

2020-11-18 Thread Wes McKinney
>From searching for "java maven" in our Jira issues: * https://issues.apache.org/jira/browse/ARROW-6103 * https://issues.apache.org/jira/browse/ARROW-1234 I just created https://issues.apache.org/jira/browse/ARROW-10648 On Wed, Nov 18, 2020 at 3:40 PM Keerat Singh wrote: > > Hi Neal, > > Do

Re: [Discuss] Arrow Release Schedule

2020-11-18 Thread Keerat Singh
Hi Neal, Do you have any information on the status of the tickets ? Regards, Keerat On Mon, Nov 16, 2020 at 11:56 AM Keerat Singh wrote: > Thank you, Kou and Wes, for your responses. > > As per discussions in the last sync call[11-Nov], there were talks about > releasing more frequently and

Re: [C++] 0x00 in Binary type

2020-11-18 Thread Micah Kornfield
+1 to what Francois said. You either want to use the method that takes a length or string_view for this case: https://github.com/apache/arrow/blob/843e8bb556a03f0e4c18841a623d1a0e9c236ee5/cpp/src/arrow/array/builder_binary.h#L72 On Wed, Nov 18, 2020 at 11:05 AM Francois Saint-Jacques <

Re: [C++] 0x00 in Binary type

2020-11-18 Thread Francois Saint-Jacques
I would say at first sight that it's due to your usage of char[] and builder.Append(d) implicitly does a strlen. François On Wed, Nov 18, 2020 at 2:00 PM Ying Zhou wrote: > > Sure! > > BinaryBuilder builder; > char d[] = "\x00\x01\xbf\x5b”; > (void)(builder.Append(d)); > std::shared_ptr array;

Re: [C++] 0x00 in Binary type

2020-11-18 Thread Ying Zhou
Sure! BinaryBuilder builder; char d[] = "\x00\x01\xbf\x5b”; (void)(builder.Append(d)); std::shared_ptr array; (void)(builder.Finish()); int32_t dataLength = 0; auto aarray = std::static_pointer_cast(array); const uint8_t* data = aarray->GetValue(0, ); data = aarray->GetValue(3, );

Re: [C++] 0x00 in Binary type

2020-11-18 Thread Antoine Pitrou
Hello, Le 18/11/2020 à 19:06, Ying Zhou a écrit : > > According to the documentation BINARY is "Variable-length bytes (no guarantee > of UTF8-ness)”. However in practice if I embed 0x00 in the middle of a char > array and Append it to a BinaryBuilder the 0x00 is converted to 0xff, >

[C++] 0x00 in Binary type

2020-11-18 Thread Ying Zhou
Hello, According to the documentation BINARY is "Variable-length bytes (no guarantee of UTF8-ness)”. However in practice if I embed 0x00 in the middle of a char array and Append it to a BinaryBuilder the 0x00 is converted to 0xff, everything after it is not appended and the length is computed

Re: Using arrow/compute/kernels/*internal.h headers

2020-11-18 Thread Wes McKinney
On #2, I think this discussion might be overly speculative until a full-fledged multithreaded hash aggregation is implemented in the Apache Arrow C++ library. There are other analytic database systems in the wild which might provide a blueprint for the way that we should approach this, and I don't

Re: Using arrow/compute/kernels/*internal.h headers

2020-11-18 Thread Benjamin Kietzman
1: Excellent! 2: The open JIRA for grouped aggregation is https://issues.apache.org/jira/browse/ARROW-4124 (though it's out of date since it predates the addition of ScalarAggregateKernel). To summarize: for *grouped* aggregation we want the kernel to do the work of evaluating group condition(s)

Re: Travis CI jobs gummed up on Arrow PRs?

2020-11-18 Thread Andrew Lamb
Thanks for follow up. Reading between the lines this sounds like it was simply a capacity issue at Travis CI. And indeed, when I looked at the original job that I reported appeared to be stuck, it had indeed run (and

[NIGHTLY] Arrow Build Report for Job nightly-2020-11-18-0

2020-11-18 Thread Crossbow
Arrow Build Report for Job nightly-2020-11-18-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-11-18-0 Failed Tasks: - conda-osx-clang-py36-r36: URL: