[jira] [Updated] (ORC-1204) Introduce a mechanism for the row-by-row to write when there are long arrays

2022-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-1204: --- Fix Version/s: 1.7.6 1.8.0 > Introduce a mechanism for the row-by-row to write when

[jira] [Commented] (ORC-1204) Introduce a mechanism for the row-by-row to write when there are long arrays

2022-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555292#comment-17555292 ] Owen O'Malley commented on ORC-1204: The backport is trivial

[jira] [Commented] (ORC-1204) Introduce a mechanism for the row-by-row to write when there are long arrays

2022-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555285#comment-17555285 ] Owen O'Malley commented on ORC-1204: As a bug fix, can I backport this to 1.7 and 1.8? > Introduce a

[jira] [Assigned] (ORC-1204) Introduce a mechanism for the row-by-row to write when there are long arrays

2022-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-1204: -- > Introduce a mechanism for the row-by-row to write when there are long arrays >

[jira] [Commented] (ORC-1017) Create a new tool that summarizes the size of a file by column

2021-10-01 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17423128#comment-17423128 ] Owen O'Malley commented on ORC-1017: We could break down the index by column, but we'd need to read the

[jira] [Assigned] (ORC-1017) Create a new tool that summarizes the size of a file by column

2021-10-01 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-1017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-1017: -- > Create a new tool that summarizes the size of a file by column >

[jira] [Assigned] (ORC-1014) Add details when we get IOExceptions from file system

2021-09-30 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-1014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-1014: -- > Add details when we get IOExceptions from file system >

[jira] [Commented] (ORC-985) ORC branch 1.7 is producing larger files from java writer

2021-09-07 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17411400#comment-17411400 ] Owen O'Malley commented on ORC-985: --- So, the new hash table implementation causes a significant regression

[jira] [Commented] (ORC-985) ORC branch 1.7 is producing larger files from java writer

2021-09-03 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-985?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17409744#comment-17409744 ] Owen O'Malley commented on ORC-985: --- Talking to Pavan, he pointed out that we changed the dictionaries. I

[jira] [Assigned] (ORC-985) ORC branch 1.7 is producing larger files from java writer

2021-09-03 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-985: - > ORC branch 1.7 is producing larger files from java writer >

[jira] [Assigned] (ORC-984) Create new writer versions for orc 1.7 and 1.8

2021-09-03 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-984: - > Create new writer versions for orc 1.7 and 1.8 > -- > >

[jira] [Resolved] (ORC-977) Update webpages and TestVectorOrcFile.java to be more neutral

2021-08-30 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-977. --- Fix Version/s: 1.8.0 Assignee: Dongjoon Hyun Resolution: Fixed I just committed this.

[jira] [Resolved] (ORC-811) Benchmarks for Filters

2021-08-30 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-811. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. > Benchmarks for Filters >

[jira] [Resolved] (ORC-743) Conversion of SArg into Filters, to take advantage of LazyIO

2021-08-10 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-743. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. Thanks, Pavan! > Conversion of

[jira] [Commented] (ORC-906) Upgrade branch-1.6 to storage-api 2.7.3

2021-08-03 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17392584#comment-17392584 ] Owen O'Malley commented on ORC-906: --- The rationale for this is to get HIVE-25400 & HIVE-25190 fixed. >

[jira] [Assigned] (ORC-906) Upgrade branch-1.6 to storage-api 2.7.3

2021-08-03 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-906: - > Upgrade branch-1.6 to storage-api 2.7.3 > --- > >

[jira] [Commented] (ORC-846) Refactoring Memory-Manager for better extensibility

2021-07-15 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17381614#comment-17381614 ] Owen O'Malley commented on ORC-846: --- In general, it is very difficult to get a reasonable bytes/row

[jira] [Commented] (ORC-846) Refactoring Memory-Manager for better extensibility

2021-07-15 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17381580#comment-17381580 ] Owen O'Malley commented on ORC-846: --- And yes, it would be far better if Gobblin was able to write a single

[jira] [Assigned] (ORC-797) Allow writers to get the stripe information

2021-05-14 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-797: - > Allow writers to get the stripe information > --- > >

[jira] [Resolved] (ORC-790) TIMESTAMP_INSTANT should be primitive

2021-04-28 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-790. --- Fix Version/s: 1.6.8 1.7.0 Resolution: Fixed I just committed this. Thanks,

[jira] [Resolved] (ORC-758) Avoid decompressing compressed streams if already decompressed

2021-03-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-758. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. Thanks, Pavan! > Avoid

[jira] [Resolved] (ORC-765) Added build option to compile libraries with position independent code

2021-03-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-765. --- Fix Version/s: 1.6.8 1.7.0 Resolution: Fixed I just committed this. Thanks,

[jira] [Assigned] (ORC-765) Added build option to compile libraries with position independent code

2021-03-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-765: - Assignee: Ben Givertz > Added build option to compile libraries with position independent code >

[jira] [Assigned] (ORC-767) Add docker support for jdk 8 in debian 10

2021-03-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-767: - > Add docker support for jdk 8 in debian 10 > - > >

[jira] [Resolved] (ORC-694) Update docker files adding Java11 support

2021-03-18 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-694. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. Thanks, Yukihiro! > Update docker

[jira] [Assigned] (ORC-766) Generalize the docker scripts to handle build-args

2021-03-18 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-766: - > Generalize the docker scripts to handle build-args >

[jira] [Resolved] (ORC-755) Introduce OrcFilterContext

2021-03-04 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-755. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. Thanks, Pavan! > Introduce

[jira] [Resolved] (ORC-754) Code cleanup

2021-02-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-754. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. Thanks, Pavan! > Code cleanup >

[jira] [Updated] (ORC-50) Replace the red/black tree representation for the string dictionaries with hash tables

2021-02-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-50?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-50: - Issue Type: Improvement (was: Bug) > Replace the red/black tree representation for the string dictionaries

[jira] [Assigned] (ORC-50) Replace the red/black tree representation for the string dictionaries with hash tables

2021-02-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-50?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-50: Assignee: Lei Sun > Replace the red/black tree representation for the string dictionaries with >

[jira] [Resolved] (ORC-747) Abstract Dictionary interface and refactoring

2021-02-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-747. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. Thanks, Lei! > Abstract

[jira] [Assigned] (ORC-747) Abstract Dictionary interface and refactoring

2021-02-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-747: - Assignee: Lei Sun > Abstract Dictionary interface and refactoring >

[jira] [Resolved] (ORC-749) Add checkstyle to -Panalzye

2021-02-17 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-749. --- Fix Version/s: 1.7.0 Resolution: Fixed I committed this. Thanks for the review, Dongjoon! > Add

[jira] [Commented] (ORC-617) Provide facility to read ORC data from FSDataInputStream, not only from Path.

2021-02-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17285559#comment-17285559 ] Owen O'Malley commented on ORC-617: --- I commented on ORC-618, but I guess it is relevant here too: I'd

[jira] [Commented] (ORC-618) Make several methods in TreeReader public

2021-02-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-618?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17285557#comment-17285557 ] Owen O'Malley commented on ORC-618: --- I'd recommend using the org.apache.orc.util.StreamWrapperFileSystem.

[jira] [Assigned] (ORC-749) Add checkstyle to -Panalzye

2021-02-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-749: - > Add checkstyle to -Panalzye > --- > > Key: ORC-749 >

[jira] [Resolved] (ORC-737) Upgrade Spark to 3.1.0

2021-02-08 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-737. --- Fix Version/s: 1.6.8 1.7.0 Resolution: Fixed I committed this. Thanks,

[jira] [Updated] (ORC-748) Add separate writer implementation for Trino

2021-02-08 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-748: -- Fix Version/s: 1.6.8 > Add separate writer implementation for Trino >

[jira] [Commented] (ORC-741) Schema Evolution missing column is not handled in the presence of filters

2021-01-26 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17272359#comment-17272359 ] Owen O'Malley commented on ORC-741: --- I committed this. Thank you, Pavan! > Schema Evolution missing

[jira] [Updated] (ORC-741) Schema Evolution missing column is not handled in the presence of filters

2021-01-26 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-741: -- Fix Version/s: 1.7.0 > Schema Evolution missing column is not handled in the presence of filters >

[jira] [Commented] (ORC-508) Add a reader/writer that does not depend on Hadoop FileSystem

2021-01-22 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-508?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17270349#comment-17270349 ] Owen O'Malley commented on ORC-508: --- Ok, I'm going to take a look at this today. At a high level, here are

[jira] [Resolved] (ORC-710) Update maven plugins

2021-01-04 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-710. --- Fix Version/s: 1.6.7 1.7.0 Assignee: Dongjoon Hyun Resolution: Fixed

[jira] [Updated] (ORC-699) Minor improvements to the scan tool

2020-12-15 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-699: -- Description: As a follow up to ORC-697, I have a couple more adjustments. Changes: * Change the stripe id

[jira] [Assigned] (ORC-699) Minor improvements to the scan tool

2020-12-15 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-699: - > Minor improvements to the scan tool > --- > > Key:

[jira] [Assigned] (ORC-698) Add safety check for negative dictionary lengths

2020-12-14 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-698: - > Add safety check for negative dictionary lengths >

[jira] [Commented] (ORC-697) Improve Scan tool to report where files are corrupted.

2020-12-14 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249369#comment-17249369 ] Owen O'Malley commented on ORC-697: --- The output looks like: {noformat} Processing data file bad.orc

[jira] [Assigned] (ORC-697) Improve Scan tool to report where files are corrupted.

2020-12-14 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-697: - > Improve Scan tool to report where files are corrupted. >

[jira] [Commented] (ORC-299) Improve heuristics for bailing on dictionary encoding

2020-11-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17235763#comment-17235763 ] Owen O'Malley commented on ORC-299: --- The *orc.column.encoding.direct* was added in ORC-397 (1.5.3 and

[jira] [Commented] (ORC-299) Improve heuristics for bailing on dictionary encoding

2020-11-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17235759#comment-17235759 ] Owen O'Malley commented on ORC-299: --- You should be able to specify *orc.column.encoding.direct* with a

[jira] [Assigned] (ORC-674) Update docker files adding Ubuntu 20 and removing Debian 8 and Ubuntu 14

2020-10-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-674: - > Update docker files adding Ubuntu 20 and removing Debian 8 and Ubuntu 14 >

[jira] [Commented] (ORC-669) Reduce breaking changes in ReaderImpl.java

2020-10-02 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17206374#comment-17206374 ] Owen O'Malley commented on ORC-669: --- Although we should understand why Spark is accessing those private

[jira] [Resolved] (ORC-669) Reduce breaking changes in ReaderImpl.java

2020-10-02 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-669. --- Fix Version/s: 1.7.0 1.6.6 Assignee: Dongjoon Hyun Resolution: Fixed

[jira] [Closed] (ORC-668) Use `TestSchemaEvolution` as a test file prefix to prevent test failure

2020-10-01 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley closed ORC-668. - Released as part of 1.6.5.   > Use `TestSchemaEvolution` as a test file prefix to prevent test failure >

[jira] [Resolved] (ORC-667) Positional mapping for nested struct types should not applied by default

2020-09-22 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-667. --- Fix Version/s: 1.6.5 1.7.0 1.5.12 Resolution: Fixed I just

[jira] [Assigned] (ORC-667) Positional mapping for nested struct types should not applied by default

2020-09-22 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-667: - Assignee: Dongjoon Hyun > Positional mapping for nested struct types should not applied by default

[jira] [Assigned] (ORC-664) docker image for centos7 fails to build zstd

2020-09-09 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-664: - > docker image for centos7 fails to build zstd > > >

[jira] [Resolved] (ORC-658) Fix NoClassDefFoundError during benchmark data generation

2020-09-02 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-658. --- Resolution: Fixed This has been committed. Thanks, Dongjoon! > Fix NoClassDefFoundError during

[jira] [Resolved] (ORC-623) Potentially incorrect Sarg evaluation for not(in) and not(isNull)

2020-09-02 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-623. --- Resolution: Fixed I committed this. Thanks for the review, Gang and Shardul! > Potentially incorrect

[jira] [Assigned] (ORC-623) Potentially incorrect Sarg evaluation for not(in) and not(isNull)

2020-09-01 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-623: - Fix Version/s: 1.7.0 1.6.4 1.5.11 Assignee: Owen

[jira] [Resolved] (ORC-611) Incorrect min-max stats for sub-millisecond timestamps

2020-08-31 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-611. --- Fix Version/s: 1.7.0 1.6.4 Resolution: Fixed I just committed this. Thank

[jira] [Commented] (ORC-623) Potentially incorrect Sarg evaluation for not(in) and not(isNull)

2020-08-31 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17188003#comment-17188003 ] Owen O'Malley commented on ORC-623: --- Thank you very much for the bug report with the unit test cases. That

[jira] [Resolved] (ORC-370) ORC column statistics should not use java.sql.Date

2020-08-26 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-370. --- Resolution: Duplicate I've committed the corresponding ORC-661. > ORC column statistics should not use

[jira] [Resolved] (ORC-661) DateColumnStatistics uses Date, which is not timezone agnostic.

2020-08-26 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-661. --- Fix Version/s: 1.7.0 1.6.4 Resolution: Fixed I committed this. Thanks for the

[jira] [Commented] (ORC-661) DateColumnStatistics uses Date, which is not timezone agnostic.

2020-08-24 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183429#comment-17183429 ] Owen O'Malley commented on ORC-661: --- I've create a new issue to avoid using the mis-used ORC-370 label. >

[jira] [Assigned] (ORC-661) DateColumnStatistics uses Date, which is not timezone agnostic.

2020-08-24 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-661?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-661: - > DateColumnStatistics uses Date, which is not timezone agnostic. >

[jira] [Commented] (ORC-370) ORC column statistics should not use java.sql.Date

2020-08-24 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17183425#comment-17183425 ] Owen O'Malley commented on ORC-370: --- The PR for ORC-495 incorrectly used this Jira number, so I'm going to

[jira] [Commented] (ORC-370) ORC column statistics should not use java.sql.Date

2020-08-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182188#comment-17182188 ] Owen O'Malley commented on ORC-370: --- This has also come up in the Iceberg use case and needs to be fixed.

[jira] [Updated] (ORC-370) ORC column statistics should not use java.sql.Date

2020-08-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-370: -- Summary: ORC column statistics should not use java.sql.Date (was: ORC PPD evaluation for date should use

[jira] [Assigned] (ORC-370) ORC PPD evaluation for date should use DateWritable

2020-08-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-370?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-370: - Assignee: Owen O'Malley > ORC PPD evaluation for date should use DateWritable >

[jira] [Commented] (ORC-370) ORC PPD evaluation for date should use DateWritable

2020-08-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-370?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17182063#comment-17182063 ] Owen O'Malley commented on ORC-370: --- [~prasanth_j] was there a patch for this? The referenced PR seems to

[jira] [Resolved] (ORC-644) nested struct evolution does not respect to orc.force.positional.evolution

2020-08-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-644. --- Fix Version/s: 1.7.0 1.6.4 1.5.11 Resolution: Fixed Thank

[jira] [Assigned] (ORC-626) Reading Struct Column Having Multiple Fields With Same Name Causes java.io.EOFException

2020-08-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-626: - Assignee: Syed Shameerur Rahman > Reading Struct Column Having Multiple Fields With Same Name

[jira] [Resolved] (ORC-626) Reading Struct Column Having Multiple Fields With Same Name Causes java.io.EOFException

2020-08-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-626. --- Fix Version/s: 1.7.0 1.6.4 1.5.11 Resolution: Fixed I just

[jira] [Commented] (ORC-644) nested struct evolution does not respect to orc.force.positional.evolution

2020-07-21 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17162340#comment-17162340 ] Owen O'Malley commented on ORC-644: --- [~arvin.zheng] the Apache Iceberg specification is here - 

[jira] [Commented] (ORC-644) nested struct evaluation does not respect to orc.force.positional.evolution

2020-07-17 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17160194#comment-17160194 ] Owen O'Malley commented on ORC-644: --- Can I ask what your use case is? It wouldn't be hard to add a config

[jira] [Resolved] (ORC-643) Change logging of codec creation to debug

2020-06-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-643. --- Resolution: Fixed This was fixed by PR #516 fa9c011e13 . > Change logging of codec creation to debug >

[jira] [Updated] (ORC-643) Change logging of codec creation to debug

2020-06-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-643: -- Fix Version/s: 1.7.0 1.6.4 1.5.11 > Change logging of codec

[jira] [Resolved] (ORC-638) ORCMapredRecordWriter enlarge columnVector with factors when child array size is not large enough

2020-06-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-638. --- Fix Version/s: 1.7.0 1.6.4 1.5.11 Resolution: Fixed I just

[jira] [Comment Edited] (ORC-641) orc-core includes packages from io.airlift.slice

2020-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17137895#comment-17137895 ] Owen O'Malley edited comment on ORC-641 at 6/16/20, 7:49 PM: - The classes were

[jira] [Commented] (ORC-641) orc-core includes packages from io.airlift.slice

2020-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17137895#comment-17137895 ] Owen O'Malley commented on ORC-641: --- The classes were copied because until aircompressor was upgraded to

[jira] [Updated] (ORC-609) Upgrade aircompressor to 0.16

2020-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated ORC-609: -- Fix Version/s: 1.6.4 This fix is also required in the 1.6 line so that we can fix Presto & Iceberg. >

[jira] [Assigned] (ORC-641) orc-core includes packages from io.airlift.slice

2020-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-641: - Assignee: David Phillips > orc-core includes packages from io.airlift.slice >

[jira] [Resolved] (ORC-641) orc-core includes packages from io.airlift.slice

2020-06-16 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-641?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-641. --- Fix Version/s: 1.7.0 1.6.4 Resolution: Fixed I just committed this. Thanks,

[jira] [Assigned] (ORC-637) create a new recovery tools that handles missing blocks

2020-05-17 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-637: - > create a new recovery tools that handles missing blocks >

[jira] [Resolved] (ORC-631) Add guava dependency to tools jar

2020-05-13 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-631. --- Resolution: Fixed > Add guava dependency to tools jar > - > >

[jira] [Resolved] (ORC-543) libprotobuf-lite compile fails on Ubuntu 16/Mint

2020-05-13 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-543. --- Resolution: Cannot Reproduce > libprotobuf-lite compile fails on Ubuntu 16/Mint >

[jira] [Assigned] (ORC-635) Add some improvements to the random data generator

2020-05-13 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-635: - > Add some improvements to the random data generator >

[jira] [Assigned] (ORC-634) Fix the json output for double NaN and infinite

2020-05-13 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-634: - > Fix the json output for double NaN and infinite > --- >

[jira] [Resolved] (ORC-622) Refactoring of TreeReader into TypeReader and BatchReader

2020-05-07 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-622. --- Fix Version/s: 1.7.0 Resolution: Fixed I just committed this. Thanks, Pavan! > Refactoring of

[jira] [Resolved] (ORC-628) Add a new java tool to count rows from ORC files under a directory

2020-05-07 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-628. --- Fix Version/s: 1.7.0 Resolution: Fixed Thanks for the review, Dongjoon! > Add a new java tool

[jira] [Resolved] (ORC-630) Fix orc-tools uber jar by adding guava dependency back

2020-05-07 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-630. --- Fix Version/s: 1.7.0 Assignee: Dongjoon Hyun Resolution: Fixed I just committed this.

[jira] [Assigned] (ORC-631) Add guava dependency to tools jar

2020-05-07 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-631: - > Add guava dependency to tools jar > - > > Key:

[jira] [Assigned] (ORC-628) Add a new java tool to count rows from ORC files under a directory

2020-05-06 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-628: - > Add a new java tool to count rows from ORC files under a directory >

[jira] [Assigned] (ORC-616) In Patched Base encoding, the value of headerThirdByte goes beyond the range of byte

2020-04-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-616: - Assignee: Ruochen Zou > In Patched Base encoding, the value of headerThirdByte goes beyond the

[jira] [Assigned] (ORC-616) In Patched Base encoding, the value of headerThirdByte goes beyond the range of byte

2020-04-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-616: - Assignee: (was: Prasanth Jayachandran) > In Patched Base encoding, the value of

[jira] [Resolved] (ORC-616) In Patched Base encoding, the value of headerThirdByte goes beyond the range of byte

2020-04-23 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley resolved ORC-616. --- Fix Version/s: 1.6.3 1.5.10 1.4.6 Resolution: Fixed I just

[jira] [Assigned] (ORC-622) Refactoring of TreeReader into TypeReader and BatchReader

2020-04-22 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-622: - Assignee: Pavan Lanka > Refactoring of TreeReader into TypeReader and BatchReader >

[jira] [Assigned] (ORC-621) Need reader fix for ORC-569

2020-04-20 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley reassigned ORC-621: - > Need reader fix for ORC-569 > --- > > Key: ORC-621 >

[jira] [Commented] (ORC-620) Modify the row filter API to use BiFunction

2020-04-20 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17087831#comment-17087831 ] Owen O'Malley commented on ORC-620: ---  [~pgaref] wrote HIVE-22959 and I did HIVE-23215 precisely so that

[jira] [Commented] (ORC-620) Modify the row filter API to use BiFunction

2020-04-19 Thread Owen O'Malley (Jira)
[ https://issues.apache.org/jira/browse/ORC-620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17087210#comment-17087210 ] Owen O'Malley commented on ORC-620: --- My proposed interface would look like: {code:java} /** *

  1   2   3   4   5   6   7   8   >