Thanks, Owen. This sounds perfect. On Wed, May 4, 2022 at 5:52 PM Owen O'Malley <owen.omal...@gmail.com> wrote:
> For context, the SalesGenerator was a schema and data profile from a large > company that stored their table in ORC. I had to scrub the details of the > table schema, but it generated random data that looked similar to their > production data. > > .. Owen > > On Wed, May 4, 2022 at 9:48 PM Owen O'Malley <owen.omal...@gmail.com> > wrote: > >> If you just want to test all of the types, I think >> examples/TestOrcFile.test1.orc will do it. It doesn't have much data in it, >> but it should test the functionality. You can also use the java/bench code >> to generate data. Look at the >> java/bench/core/src/java/org/apache/orc/bench/core/SalesGenerator.java >> class for how to generate arbitrary data. >> >> .. Owen >> >> On Wed, May 4, 2022 at 9:00 PM Larry White <ljw1...@gmail.com> wrote: >> >>> No worries. I appreciate the effort and understand the lack of time :) >>> >>> On Wed, May 4, 2022 at 4:29 PM Ian Kaplan <i...@bearcave.com> wrote: >>> >>>> >>>> The library was in Maven. I'm not sure why it's not there now. >>>> >>>> I need to update the code to use the latest Java ORC base code and >>>> track down this issue. Unfortunately I don't have time to do this right >>>> now. I apologize for any difficulties with the package. >>>> >>>> Ian >>>> >>>> >>>> On Wed, May 4, 2022 at 4:12 PM Larry White <ljw1...@gmail.com> wrote: >>>> >>>>> Thanks, Ian. This looks helpful and your API seems very nice. >>>>> Unfortunately, it doesn't seem like javaorc is on maven central currently. >>>>> I tried google search and maven centrals own search. >>>>> >>>>> >>>>> >>>>> On Wed, May 4, 2022 at 11:35 AM Ian Kaplan <i...@bearcave.com> wrote: >>>>> >>>>>> >>>>>> I have written some tests for my JavaOrc library. I have not had a >>>>>> chance to update the library for the recent releases of the base Java Orc >>>>>> library. My tests can be found here: >>>>>> https://github.com/IanLKaplan/javaorc/tree/master/src/test/java/com/topstonesoftware/javaorc >>>>>> >>>>>> On Wed, May 4, 2022 at 11:12 AM Larry White <ljw1...@gmail.com> >>>>>> wrote: >>>>>> >>>>>>> Hi all, >>>>>>> >>>>>>> I'm testing Arrow's ORC adaptor to see if there are data types it >>>>>>> doesn't handle. Are there any orc test files that cover all data types >>>>>>> (including complex types like union and lists)? The initial goal is to >>>>>>> document any unsupported types, with a secondary goal of improving >>>>>>> support >>>>>>> if any unsupported types are found. >>>>>>> >>>>>>> If someone knows of ORC tests that cover reading and all types that >>>>>>> would be extra awesome. >>>>>>> >>>>>>> Thanks, >>>>>>> >>>>>>> larry >>>>>>> >>>>>>