Re: FYI: Apache Hive community finally succeeded to migrate to Apache ORC 1.8.3

2023-04-13 Thread Owen O'Malley
That is great! I’m glad that they can now take advantage of the continuing ORC development. .. Owen > On Apr 14, 2023, at 04:40, Gang Wu wrote: > > Congrats and thanks to everyone involved! > > Best, > Gang > >> On Fri, Apr 14, 2023 at 4:56 AM Panagiotis Garefalakis >> wrote: >> >>

attending ApacheCon NA 2022?

2022-09-22 Thread Owen O'Malley
Is anyone else attending ApacheCon next month? It would be nice to get together. .. Owen

Re: Haskell Implementation of ORC

2022-08-23 Thread Owen O'Malley
Huw, Generally we assign each ORC File writer implementation a unique writer id so that we can determine the writer of the file. Would you like a number assigned to your writer? We'd ask that your writer always set its id into the Footer.writer field.

Re: [QUESTION] Can compression cross row boundaries?

2022-07-26 Thread Owen O'Malley
Compression in ORC not only crosses rows, but across the row groups (every 10k rows) that are the index points. Look at the ORC specification ( https://orc.apache.org/specification/ORCv1/) on Compression. Compression does not cross stripe boundaries, because that would violate the constraint that

[jira] [Created] (ORC-1204) Introduce a mechanism for the row-by-row to write when there are long arrays

2022-06-16 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-1204: -- Summary: Introduce a mechanism for the row-by-row to write when there are long arrays Key: ORC-1204 URL: https://issues.apache.org/jira/browse/ORC-1204 Project: ORC

Re: Name for the official ORC Rust crate

2022-05-31 Thread Owen O'Malley
gt; > > > > > > > >> There is already a Rust ORC reader: > > > > >> https://rustrepo.com/repo/travisbrown-orcrs > > > > >> We still need a writer though. If I have 6 months to do so I can > > write > > > > >> one. > >

Re: [orc] tag v1.7.2 created (now a9d6d1b)

2021-12-20 Thread Owen O'Malley
Thank you for making the release, Dongjoon! At Apache, you need to create the tag for the final release with the "rel/" prefix, which is why the previous releases were tagged as "rel/release-X.Y.Z". That guarantees that no one accidentally deletes or replaces it. Thanks, Owen On Mon, Dec 20,

[jira] [Created] (ORC-1017) Create a new tool that summarizes the size of a file by column

2021-10-01 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-1017: -- Summary: Create a new tool that summarizes the size of a file by column Key: ORC-1017 URL: https://issues.apache.org/jira/browse/ORC-1017 Project: ORC Issue

[jira] [Created] (ORC-1014) Add details when we get IOExceptions from file system

2021-09-30 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-1014: -- Summary: Add details when we get IOExceptions from file system Key: ORC-1014 URL: https://issues.apache.org/jira/browse/ORC-1014 Project: ORC Issue Type

[jira] [Created] (ORC-1004) ORC writer doesn't support the selected vector

2021-09-16 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-1004: -- Summary: ORC writer doesn't support the selected vector Key: ORC-1004 URL: https://issues.apache.org/jira/browse/ORC-1004 Project: ORC Issue Type: Bug

Re: [VOTE] Release Apache ORC 1.7.0 (RC0)

2021-09-14 Thread Owen O'Malley
+1 (binding) * Checked sig and checksum * Ran docker builds * Built with -Pbenchmark & -Panalyze and -Papache-release On Mon, Sep 13, 2021 at 1:09 AM Dongjoon Hyun wrote: > Please vote on releasing the following candidate as Apache ORC version > 1.7.0. > > [ ] +1 Release this package as Apache

Re: [VOTE] Release Apache ORC 1.5.13 (RC0)

2021-09-14 Thread Owen O'Malley
+1 (binding) * Checked sig & checksum * Ran the docker builds * Compiled with -Panalyze & -Pbenchmark and -Papache-release One annoyance is that TestByteRle.cc has a double semicolon that clang complains about and thus fails the ubuntu 20 with clang build. diff --git a/c++/test/TestByteRle.cc

Re: [VOTE] Release Apache ORC 1.6.11 (RC0)

2021-09-14 Thread Owen O'Malley
+1 (binding) * Ran the docker builds * Checked the sig/checksum * Built with -Panalyze and -Papache-release On Tue, Sep 14, 2021 at 10:47 PM Pavan Lanka wrote: > +1 (non-binding) > > Performed the following: > * C++ build and test > * Java build and test using OpenJDK 8 > > Regards, > Pavan >

[jira] [Created] (ORC-994) Fix javadoc so that it doesn't put files into the source tree

2021-09-14 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-994: - Summary: Fix javadoc so that it doesn't put files into the source tree Key: ORC-994 URL: https://issues.apache.org/jira/browse/ORC-994 Project: ORC Issue Type

[jira] [Created] (ORC-985) ORC branch 1.7 is producing larger files from java writer

2021-09-03 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-985: - Summary: ORC branch 1.7 is producing larger files from java writer Key: ORC-985 URL: https://issues.apache.org/jira/browse/ORC-985 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-984) Create new writer versions for orc 1.7 and 1.8

2021-09-03 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-984: - Summary: Create new writer versions for orc 1.7 and 1.8 Key: ORC-984 URL: https://issues.apache.org/jira/browse/ORC-984 Project: ORC Issue Type: Improvement

Re: [E] Re: Performance of hive 1.2 with ORC-1.5.x

2021-08-26 Thread Owen O'Malley
Upgrading the internal version of ORC that is bundled into Hive 1.2 will be a lot of work. To be honest, you should strongly consider moving to Hive 2.3 (or later), which uses the standalone ORC 1.3. Upgrading that to ORC 1.5 or 1.6 would be relatively straightforward. The short answer is that

[RESULT][VOTE] Should we release ORC 1.6.10rc0?

2021-08-09 Thread Owen O'Malley
h Java and C++. > > Best, > Gang > > On Tue, Aug 10, 2021 at 1:31 AM Owen O'Malley > wrote: > > > Thanks for the votes, Dongjoon, Kyle, and William! > > > > We need one more PMC vote. > > > > Thanks, > >Owen > > > > On Sat, Aug 7, 2021

Re: [VOTE] Should we release ORC 1.6.10rc0?

2021-08-09 Thread Owen O'Malley
apshot + Spark > > > 3.3.0-SNAPSHOT) > > >https://github.com/dongjoon-hyun/spark/pull/63 > > > > > > I only noticed that the commit message has a typo, `Preparing for > release > > > 1.5.10.`. > > > > > >https://github.com/apa

[VOTE] Should we release ORC 1.6.10rc0?

2021-08-04 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.6.10? tar: http://home.apache.org/~omalley/orc-1.6.10/ tag: https://github.com/apache/orc/releases/tag/release-1.6.10rc0 jiras: https://issues.apache.org/jira/projects/ORC/versions/12350446 Thanks!

[jira] [Created] (ORC-906) Upgrade branch-1.6 to storage-api 2.7.3

2021-08-03 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-906: - Summary: Upgrade branch-1.6 to storage-api 2.7.3 Key: ORC-906 URL: https://issues.apache.org/jira/browse/ORC-906 Project: ORC Issue Type: Improvement

[REPORT] ORC Board Report for July 2021

2021-07-13 Thread Owen O'Malley
## Description: The mission of ORC is the creation and maintenance of software related to the smallest, fastest columnar storage for Hadoop workloads ## Issues: There are no issues requiring board attention. ## Membership Data: Apache ORC was founded 2015-04-22 (6 years ago) There are currently

Re: [VOTE] Should we release ORC 1.6.8rc0?

2021-05-21 Thread Owen O'Malley
+1 I verified the checksum & signature. I also checked the docker builds. Thanks, Owen On Thu, May 20, 2021 at 3:42 PM Pavan Lanka wrote: > +1 (non-binding) > > I verified C++ and Java builds and tests > > Regards, > Pavan > > > On May 19, 2021, at 11:22 PM, Dongjoon Hyun > wrote: > > > >

[jira] [Created] (ORC-797) Allow writers to get the stripe information

2021-05-14 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-797: - Summary: Allow writers to get the stripe information Key: ORC-797 URL: https://issues.apache.org/jira/browse/ORC-797 Project: ORC Issue Type: Improvement

Re: Apache ORC Versioning (Semantic Versioning)

2021-04-28 Thread Owen O'Malley
The goal was to follow semantic versioning. Clearly the 1.6 release broke too much of the interface and made clear how much of the implementation Hive and LLAP are accessing. Certainly as the two projects with the deepest integration, we should build Hive and Spark with the ORC 1.7 rcs before they

Re: Renaming `master` branch to `main`?

2021-04-14 Thread Owen O'Malley
+1 to renaming the main branch. .. Owen On Wed, Apr 14, 2021 at 12:05 PM William Hyun wrote: > Given that the default branch is named `main` by GitHub, I believe it > could be > a welcome change to ORC newcomers. > > +1 > > On 2021/04/14 09:47:55, Panos Garefalakis wrote: > > Hey Dongjoon, >

[DRAFT] ORC Board Report April 2021

2021-04-12 Thread Owen O'Malley
All, Please send me any feedback. Thanks! ## Description: The mission of ORC is the creation and maintenance of software related to the smallest, fastest columnar storage for Hadoop workloads ## Issues: There are no issues that require board attention. Based on the feedback last quarter we

[jira] [Created] (ORC-767) Add docker support for jdk 8 in debian 10

2021-03-19 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-767: - Summary: Add docker support for jdk 8 in debian 10 Key: ORC-767 URL: https://issues.apache.org/jira/browse/ORC-767 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-766) Generalize the docker scripts to handle build-args

2021-03-18 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-766: - Summary: Generalize the docker scripts to handle build-args Key: ORC-766 URL: https://issues.apache.org/jira/browse/ORC-766 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-750) Fix benchmark to pass checkstyle:check

2021-02-16 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-750: - Summary: Fix benchmark to pass checkstyle:check Key: ORC-750 URL: https://issues.apache.org/jira/browse/ORC-750 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-749) Add checkstyle to -Panalzye

2021-02-16 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-749: - Summary: Add checkstyle to -Panalzye Key: ORC-749 URL: https://issues.apache.org/jira/browse/ORC-749 Project: ORC Issue Type: Improvement Reporter

Re: Should we remove the requirement that dictionaries should be sorted?

2021-02-08 Thread Owen O'Malley
Ugh, I mean that the data stream doesn't need to be held in an array of longs. The dictionary itself needs to stay decompressed. :) .. Owen On Mon, Feb 8, 2021 at 8:57 PM Owen O'Malley wrote: > > > On Mon, Feb 8, 2021 at 8:44 PM Gopal V wrote: > >> >> > Reason to s

Re: Should we remove the requirement that dictionaries should be sorted?

2021-02-08 Thread Owen O'Malley
On Mon, Feb 8, 2021 at 8:44 PM Gopal V wrote: > > > Reason to stay sorted: > > > > 1. Searching for values in the dictionaries can use binary search. > > We did get some compression advantages from this in the past, but the > write-throughput is hurt by this one factor both on memory bloat

Should we remove the requirement that dictionaries should be sorted?

2021-02-08 Thread Owen O'Malley
All, Now that Lei is working on creating a replacement for the red-black string dictionaries, it is a good time to discuss whether we should continue to sort the dictionaries as they are written. Reason to stay sorted: 1. Searching for values in the dictionaries can use binary search.

Re: [VOTE] Should we release ORC 1.6.7rc0?

2021-01-22 Thread Owen O'Malley
+1 for releasing 1.6.7rc0 * Checked signature and hash * Ran the docker script builds The only issue that I hit is that centos7 fails because of the old maven. Using the maven wrapper makes the build work, but then RAT complains about unlicensed files (build/apache-maven-3.6.3/bin/m2.conf). But

[jira] [Created] (ORC-699) Minor improvements to the scan tool

2020-12-15 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-699: - Summary: Minor improvements to the scan tool Key: ORC-699 URL: https://issues.apache.org/jira/browse/ORC-699 Project: ORC Issue Type: Improvement

[jira] [Created] (ORC-698) Add safety check for negative dictionary lengths

2020-12-14 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-698: - Summary: Add safety check for negative dictionary lengths Key: ORC-698 URL: https://issues.apache.org/jira/browse/ORC-698 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-697) Improve Scan tool to report where files are corrupted.

2020-12-14 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-697: - Summary: Improve Scan tool to report where files are corrupted. Key: ORC-697 URL: https://issues.apache.org/jira/browse/ORC-697 Project: ORC Issue Type

Re: [VOTE] Should we release ORC 1.6.6rc0?

2020-12-09 Thread Owen O'Malley
+1 for releasing 1.6.6rc0 I checked the signatures & hash. Ran the docker scripts to verify that the build ran on all of the supported Linux variants. Built the java benchmark profile. Ran Apache RAT. Looked over the commits from 1.6.5. .. Owen On Sat, Dec 5, 2020 at 11:23 AM Panos Garefalakis

[DISCUSS] Remove support for Debian 8 and Ubuntu 14

2020-10-20 Thread Owen O'Malley
All, In ORC-674 , I commented that some of the recent changes in trunk have broken older versions of cmake. Rather than figure out how to fix it, I propose that we drop both Debian 8 and Ubuntu 14 from our supported Linux versions. (They are both EOLed by

[jira] [Created] (ORC-674) Update docker files adding Ubuntu 20 and removing Debian 8 and Ubuntu 14

2020-10-19 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-674: - Summary: Update docker files adding Ubuntu 20 and removing Debian 8 and Ubuntu 14 Key: ORC-674 URL: https://issues.apache.org/jira/browse/ORC-674 Project: ORC

Re: [VOTE] Shall we release 1.6.5rc0?

2020-10-01 Thread Owen O'Malley
and hash, checked to make sure all the source >>> files had the license and no binary files beyond images and type fonts were >>> included. Did a build in ubuntu 18 docker. >>> >>> Alan. >>> >>> On Wed, Sep 30, 2020 at 9:28 AM Owen O'Malley

Re: [VOTE] Shall we release ORC 1.5.12rc0?

2020-09-30 Thread Owen O'Malley
no binary files beyond images and type fonts were > included. Did a build in ubuntu 18 docker. > > Alan. > > On Tue, Sep 29, 2020 at 10:27 PM Dongjoon Hyun > wrote: > > > +1 for Apache ORC 1.5.12-rc0. > > > > Thanks, > > Dongjoon. > >

Re: [VOTE] Shall we release 1.6.5rc0?

2020-09-30 Thread Owen O'Malley
rks > Tests were previously failing when I tried to use 1.6.4 > > Best, > Shardul > On 2020/09/25 21:48:36, "Owen O'Malley" wrote: > > All, > > > > Should we release the following artifacts as ORC 1.6.5? > > > > tar: http://home.apache.org/~omalley

Re: [VOTE] Should we release ORC-1.6.4rc1?

2020-09-28 Thread Owen O'Malley
currently and > gives a wrong result for all downstream. > > Could you review the following PR? > > https://github.com/apache/orc/pull/545 > ORC-667: Positional mapping for nested struct types should not applied > by default > > Bests, > Dongjoon. > > > On

[VOTE] Shall we release ORC 1.5.12rc0?

2020-09-25 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.5.12? tar: http://home.apache.org/~omalley/orc-1.5.12/ tag: https://github.com/apache/orc/releases/tag/release-1.5.12rc0 jiras: https://issues.apache.org/jira/projects/ORC/versions/12348828 Thanks!

[VOTE] Shall we release 1.6.5rc0?

2020-09-25 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.6.5? tar: http://home.apache.org/~omalley/orc-1.6.5/ tag: https://github.com/apache/orc/releases/tag/release-1.6.5rc0 jiras: https://issues.apache.org/jira/projects/ORC/versions/12348819 Thanks!

New releases

2020-09-25 Thread Owen O'Malley
All, As Dongjoon mentioned, we need to release replacement 1.5 and 1.6 releases. Does anyone have any additional fixes that need to go in? .. Owen

Re: [VOTE] Should we release ORC-1.5.11rc0?

2020-09-14 Thread Owen O'Malley
0 at 10:50 AM Alan Gates wrote: > > > +1. Checked signature, hash, license, and notice files. Made sure there > > were no binaries in the distribution (beyond images & fonts). Did a > build. > > > > Alan. > > > > On 2020/09/03 01:00:17, "Owen O

Re: ORC vector rollback

2020-09-11 Thread Owen O'Malley
Children().get(c); > > Object fieldValue = record.get(thisField); > Schema.Field avroField = currSchema.getField(thisField); > > // If this fails on some column X, I want to rollback the data I've > written for batch.numCols - X > addToVector(type, colVector, avroField.sche

Re: ORC vector rollback

2020-09-11 Thread Owen O'Malley
Where is the failure happening? If it is happening in the ORC writer code, there isn't a way to do that. Can I ask what kind of exception you are hitting? In the column (aka tree) writers, there shouldn't be much that can go wrong. It doesn't even write to the file handle, just buffering in

[jira] [Created] (ORC-664) docker image for centos7 fails to build zstd

2020-09-09 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-664: - Summary: docker image for centos7 fails to build zstd Key: ORC-664 URL: https://issues.apache.org/jira/browse/ORC-664 Project: ORC Issue Type: Task

Re: [VOTE] Should we release ORC-1.5.11rc0?

2020-09-08 Thread Owen O'Malley
Can some PMC members take a look at the 1.5 & 1.6 RCS? Thanks, Owen On Thu, Sep 3, 2020 at 1:00 AM Owen O'Malley wrote: > All, Should we release the following artifacts as ORC 1.5.11? > > tar: http://home.apache.org/~omalley/orc-1.5.11/ > tag: https://github.com/apache/

[VOTE] Should we release ORC-1.5.11rc0?

2020-09-02 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.5.11? tar: http://home.apache.org/~omalley/orc-1.5.11/ tag: https://github.com/apache/orc/releases/tag/release-1.5.11rc0 jiras: https://issues.apache.org/jira/browse/ORC/fixforversion/12348175 Thanks!

Re: [VOTE] Should we release ORC-1.6.4rc1?

2020-09-02 Thread Owen O'Malley
OK, I accidentally messed up the version setting in the java benchmark code, so I rolled a new RC. All, Should we release the following artifacts as ORC 1.6.4? tar: http://home.apache.org/~omalley/orc-1.6.4/ tag: https://github.com/apache/orc/releases/tag/release-1.6.4rc1 jiras:

[VOTE] Should we release ORC-1.6.4rc0?

2020-09-02 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.6.4? tar: http://home.apache.org/~omalley/orc-1.6.4/ tag: https://github.com/apache/orc/releases/tag/release-1.6.4rc0 jiras: https://issues.apache.org/jira/browse/ORC/fixforversion/12348176 Thanks!

Bug fix releases for ORC 1.5 and 1.6

2020-09-01 Thread Owen O'Malley
All, Are there any more bug fixes that we really should get in before an ORC 1.5 and 1.6 release? Thanks, Owen

[jira] [Created] (ORC-661) DateColumnStatistics uses Date, which is not timezone agnostic.

2020-08-24 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-661: - Summary: DateColumnStatistics uses Date, which is not timezone agnostic. Key: ORC-661 URL: https://issues.apache.org/jira/browse/ORC-661 Project: ORC Issue Type

Re: uniontypes and Spark

2020-07-31 Thread Owen O'Malley
Ryan, I did just look at that code in Spark last week. The problem as you correctly surmised is that Spark doesn't have a uniontype. I think we probably need a fix that converts the uniontype into a struct for Spark. In such a translation, you would have fields for each variant of the union and

Re: Interpreting ORC Java Reference

2020-07-22 Thread Owen O'Malley
On Mon, Jul 20, 2020 at 7:01 PM Ryan Schachte wrote: > Hi team, > apologies for the last email, believe I sent too early. I'm interested in > better understanding the ORC reference guide in the docs and wanted to > clarify some things to see if I'm understanding correctly. > > I realize for the

Re: Interpreting ORC Java Reference

2020-07-20 Thread Owen O'Malley
A good place to start would be: - The core java documentation - https://orc.apache.org/docs/core-java.html - The orc to json tool - https://github.com/apache/orc/blob/master/java/tools/src/java/org/apache/orc/tools/PrintData.java Feel free to ask questions here on the dev list too.

[DISCUSS] Support for CentOS 6?

2020-06-23 Thread Owen O'Malley
Hi all, The master branch is currently broken on CentOS 6. (Some of the new C++ work used some of the C++ 11 constructs that the old g++ didn't support. We could fix it, but CentOS 6 is EOL at the end of the year. I'd propose that we just drop support for it. Thoughts? .. Owen

[jira] [Created] (ORC-643) Change logging of codec creation to debug

2020-06-19 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-643: - Summary: Change logging of codec creation to debug Key: ORC-643 URL: https://issues.apache.org/jira/browse/ORC-643 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-637) create a new recovery tools that handles missing blocks

2020-05-17 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-637: - Summary: create a new recovery tools that handles missing blocks Key: ORC-637 URL: https://issues.apache.org/jira/browse/ORC-637 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-635) Add some improvements to the random data generator

2020-05-13 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-635: - Summary: Add some improvements to the random data generator Key: ORC-635 URL: https://issues.apache.org/jira/browse/ORC-635 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-634) Fix the json output for double NaN and infinite

2020-05-13 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-634: - Summary: Fix the json output for double NaN and infinite Key: ORC-634 URL: https://issues.apache.org/jira/browse/ORC-634 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-632) Fix OrcFile.open's mutation of the options argument

2020-05-07 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-632: - Summary: Fix OrcFile.open's mutation of the options argument Key: ORC-632 URL: https://issues.apache.org/jira/browse/ORC-632 Project: ORC Issue Type: Bug

[jira] [Created] (ORC-631) Add guava dependency to tools jar

2020-05-07 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-631: - Summary: Add guava dependency to tools jar Key: ORC-631 URL: https://issues.apache.org/jira/browse/ORC-631 Project: ORC Issue Type: Bug Reporter

[jira] [Created] (ORC-628) Add a new java tool to count rows from ORC files under a directory

2020-05-06 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-628: - Summary: Add a new java tool to count rows from ORC files under a directory Key: ORC-628 URL: https://issues.apache.org/jira/browse/ORC-628 Project: ORC Issue

Re: [VOTE] Shall we release ORC-1.6.3rc1?

2020-04-26 Thread Owen O'Malley
both Hive > > profiles due to the unit test failure. > > 1.6.22019/11/21 This worked with Spark 3.0 with Hive 1.2.1 default > > profile while it fails Hive 2.3.7 profile > > 2020/01/09 Apache Spark 3.0 switch to Apache Hive 2.3.7 > > profil

Re: [VOTE] Shall we release ORC-1.5.10rc1?

2020-04-26 Thread Owen O'Malley
k you, Owen. > > > > Bests, > > Dongjoon > > > > On Thu, Apr 23, 2020 at 5:15 PM Owen O'Malley > > wrote: > > > > > All, > > > > > > Should we release the following artifacts as ORC 1.5.10? > > > > > > tar: htt

Re: [VOTE] Shall we release ORC-1.5.10rc1?

2020-04-23 Thread Owen O'Malley
seems to be changed to some unknown key which Apache > ORC KEYS doesn't include. > > Bests, > Dongjoon. > > > On Thu, Apr 23, 2020 at 5:15 PM Owen O'Malley > wrote: > > > All, > > > > Should we release the following artifacts as ORC 1.5.10? > > >

[VOTE] Shall we release ORC-1.6.3rc1?

2020-04-23 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.6.3? tar: http://home.apache.org/~omalley/orc-1.6.3/ tag: https://github.com/apache/orc/releases/tag/release-1.6.3rc1 jiras: https://issues.apache.org/jira/projects/ORC/versions/12346545 Thanks!

[VOTE] Shall we release ORC-1.5.10rc1?

2020-04-23 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.5.10? tar: http://home.apache.org/~omalley/orc-1.5.10/ tag: https://github.com/apache/orc/releases/tag/release-1.5.10rc1 jiras: https://issues.apache.org/jira/projects/ORC/versions/12346912 Thanks!

Re: New 1.6 release?

2020-04-23 Thread Owen O'Malley
gt; Bests, > Dongjoon. > > On Mon, Mar 30, 2020 at 9:59 AM Panos Garefalakis > wrote: > > > Hey Owen, > > > > I believe ORC-616 is an important fix that should be included, also > > ORC-611 is related to Timestamps and could be included. > > > > Che

[VOTE] Shall we release ORC-1.6.3rc0?

2020-04-23 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.6.3? tar: http://home.apache.org/~omalley/orc-1.6.3/ tag: https://github.com/apache/orc/releases/tag/release-1.6.3rc0 jiras: https://issues.apache.org/jira/projects/ORC/versions/12346545 Thanks!

[VOTE] Shall we release ORC-1.5.10rc0?

2020-04-23 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.5.10? tar: http://home.apache.org/~omalley/orc-1.5.10/ tag: https://github.com/apache/orc/releases/tag/release-1.5.10rc0 jiras: https://issues.apache.org/jira/projects/ORC/versions/12346912 Thanks!

[jira] [Created] (ORC-621) Need reader fix for ORC-569

2020-04-20 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-621: - Summary: Need reader fix for ORC-569 Key: ORC-621 URL: https://issues.apache.org/jira/browse/ORC-621 Project: ORC Issue Type: Bug Reporter: Owen

[jira] [Created] (ORC-620) Modify the row filter API to use BiFunction

2020-04-19 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-620: - Summary: Modify the row filter API to use BiFunction Key: ORC-620 URL: https://issues.apache.org/jira/browse/ORC-620 Project: ORC Issue Type: Bug

[REPORT] ORC Board Report

2020-04-10 Thread Owen O'Malley
## Description: The mission of ORC is the creation and maintenance of software related to the smallest, fastest columnar storage for Hadoop workloads ## Issues: There are no issues requiring board attention. ## Membership Data: Apache ORC was founded 2015-04-21 (5 years ago) There are currently

New 1.6 release?

2020-03-30 Thread Owen O'Malley
Hi all, In working on Iceberg, it would be good to make a new ORC 1.6 release to get the calendar work that we did. Does anyone have any other patches that should go in? .. Owen

Re: ORC C++ write, avoid stradling HDFS blocks

2020-03-03 Thread Owen O'Malley
tripe. Please do contribute it back to the project. I also found out that the libhdfspp does not support the "writing" and the > "short circuit reads" functionality. I plan to use libhdfs3 from Apache > HAWQ project which promises both of these features. > There is always more work to be done. Thanks, Owen O'Malley

[jira] [Created] (ORC-603) Update current Hadoop version to 2.7.7.

2020-02-24 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-603: - Summary: Update current Hadoop version to 2.7.7. Key: ORC-603 URL: https://issues.apache.org/jira/browse/ORC-603 Project: ORC Issue Type: Bug Components

[jira] [Created] (ORC-602) Create adaptor for using FSDataInputStream for Java ORC reader

2020-02-24 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-602: - Summary: Create adaptor for using FSDataInputStream for Java ORC reader Key: ORC-602 URL: https://issues.apache.org/jira/browse/ORC-602 Project: ORC Issue Type

Re: [VOTE] Should we release ORC-1.5.9rc1?

2020-01-30 Thread Owen O'Malley
endency `threeten-extra-1.5.0.jar` >> (228K) >> 4. Tested with Spark ORC tests with new `hive-storage-api` (2.7.1). >> >> Thank you, Owen. >> >> Bests, >> Dongjoon. >> >>> On 2020/01/28 00:37:36, "Owen O'Malley" wrote:

[VOTE] Should we release ORC-1.5.9rc1?

2020-01-27 Thread Owen O'Malley
All, I've fixed the API change and rolled a new RC: Should we release the following artifacts as ORC 1.5.9? tar: http://home.apache.org/~omalley/orc-1.5.9/ tag: https://github.com/apache/orc/releases/tag/release-1.5.9rc1 jiras: https://issues.apache.org/jira/browse/ORC/fixforversion/12346546

[jira] [Created] (ORC-594) Fix compatibility with ORC 1.5.8 on OrcTail.getStripeStatistics

2020-01-27 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-594: - Summary: Fix compatibility with ORC 1.5.8 on OrcTail.getStripeStatistics Key: ORC-594 URL: https://issues.apache.org/jira/browse/ORC-594 Project: ORC Issue Type

Re: [VOTE] Should we release ORC-1.5.9rc0?

2020-01-27 Thread Owen O'Malley
r ProlepticCalendar > > From user side, 2.7.1 seems to work with Hive 3.1.3+. > Is there any expected incompatibility? > I'm wondering if this works with Hive 1.2.1 and Hive 2.3.6? > > Bests, > Dongjoon. > > On 2020/01/24 23:00:31, "Owen O'Malley" wrote: >> Al

[VOTE] Should we release ORC-1.5.9rc0?

2020-01-24 Thread Owen O'Malley
All, Should we release the following artifacts as ORC 1.5.9? tar: http://home.apache.org/~omalley/orc-1.5.9/ tag: https://github.com/apache/orc/releases/tag/release-1.5.9rc0 jiras: https://issues.apache.org/jira/browse/ORC/fixforversion/12346546 Thanks!

Release 1.5.9

2020-01-24 Thread Owen O'Malley
I'd like to get a 1.5 release out with ORC-27 and ORC-578 on the Java side and ORC-574 on the C++ side. Is there anything else we should get into a new 1.5 release? Thanks, Owen

Re: contribution

2020-01-16 Thread Owen O'Malley
Zoltan, It looks like someone already added you as a contributor on ORC jira, so you should be able to assign issues to yourself. Let us know if it isn't working. On Thu, Jan 16, 2020 at 5:32 AM Yukihiro Okada wrote: > I'm one of contributors. I often take an assignment myself. > The ORC

[REPORT] ORC Board report for January 2020

2020-01-08 Thread Owen O'Malley
## Description: The mission of ORC is the creation and maintenance of software related to the smallest, fastest columnar storage for Hadoop workloads ## Issues: There are no issues requiring board attention. ## Membership Data: Apache ORC was founded 2015-04-21 (5 years ago) There are currently

[jira] [Created] (ORC-578) IllegalArgumentException: Can't use LongColumnVector to read proleptic Gregorian dates.

2019-12-11 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-578: - Summary: IllegalArgumentException: Can't use LongColumnVector to read proleptic Gregorian dates. Key: ORC-578 URL: https://issues.apache.org/jira/browse/ORC-578 Project

[jira] [Created] (ORC-577) Allow row-level filtering

2019-12-11 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-577: - Summary: Allow row-level filtering Key: ORC-577 URL: https://issues.apache.org/jira/browse/ORC-577 Project: ORC Issue Type: New Feature Reporter: Owen

Re: [VOTE] Shall we release ORC 1.4.5rc1?

2019-12-09 Thread Owen O'Malley
t; > https://github.com/apache/spark/commit/dfb0a8bb048d43f8fd1fb05b1027bd2fc7438dbc > > > > Bests, > > Dongjoon. > > > > > > On Fri, Dec 6, 2019 at 4:19 PM Alan Gates wrote: > > > >> +1. Did a build on ubuntu 16, checked the signatu

[VOTE] Shall we release ORC 1.4.5rc1?

2019-12-06 Thread Owen O'Malley
All, Ok, I backported a few more fixes in to rc1: - ORC-480 - ORC-552 - ORC-576 Should we release the following artifacts as ORC 1.4.5? tar: http://home.apache.org/~omalley/orc-1.4.5/ tag: https://github.com/apache/orc/releases/tag/release-1.4.5rc1 jiras:

Re: [VOTE] Shall we release ORC 1.4.5rc0?

2019-12-06 Thread Owen O'Malley
ndran > > > > On Fri, Dec 6, 2019 at 9:09 AM Alan Gates wrote: > > > I think that's probably fine. > > > > So +1 for the release. > > > > Alan. > > > > On Fri, Dec 6, 2019 at 9:00 AM Owen O'Malley > > wrote: > > > >

[jira] [Created] (ORC-576) Improve LICENSE file

2019-12-06 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-576: - Summary: Improve LICENSE file Key: ORC-576 URL: https://issues.apache.org/jira/browse/ORC-576 Project: ORC Issue Type: Task Reporter: Owen O'Malley

Re: [VOTE] Shall we release ORC 1.4.5rc0?

2019-12-06 Thread Owen O'Malley
n interested user can easily find them? > > Other than that, all looks good. I checked the signature and hash, did a > build on centos7, looked for any wayward files in the tarball. > > Alan. > > On Tue, Dec 3, 2019 at 2:02 PM Owen O'Malley > wrote: > > > All, >

[jira] [Created] (ORC-575) Give hints to file system for opening ORC files

2019-12-04 Thread Owen O'Malley (Jira)
Owen O'Malley created ORC-575: - Summary: Give hints to file system for opening ORC files Key: ORC-575 URL: https://issues.apache.org/jira/browse/ORC-575 Project: ORC Issue Type: Bug

  1   2   3   4   5   >