[jira] [Resolved] (IMPALA-2935) Introduce database level auto create/update statistics

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-2935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-2935. --- Resolution: Won't Do This assumed a particular design for the epic, will just close the

[jira] [Resolved] (IMPALA-744) Return estimated run time in the Explain plan

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-744. -- Resolution: Later l > Return estimated run time in the Explain plan >

[jira] [Resolved] (IMPALA-452) Add support for string concatenation operator using || construct

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-452. -- Fix Version/s: Impala 4.0 Resolution: Fixed > Add support for string concatenation

[jira] [Resolved] (IMPALA-4489) Remove the ident_or_default non-terminal from the parser

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-4489. --- Resolution: Won't Fix > Remove the ident_or_default non-terminal from the parser >

[jira] [Resolved] (IMPALA-4958) Simplify binary predicates in the FE

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-4958. --- Resolution: Later > Simplify binary predicates in the FE >

[jira] [Resolved] (IMPALA-5727) Join Order Optimization time increases non-linearly with the number of tables

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-5727. --- Resolution: Later Too open ended. > Join Order Optimization time increases non-linearly

[jira] [Resolved] (IMPALA-4973) Convert UnionStmt class into to SetOperationStmt

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-4973. --- Fix Version/s: Impala 4.0 Resolution: Fixed Commit

[jira] [Resolved] (IMPALA-5726) Impala Planning time on Kudu table is much longer than HDFS table

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-5726. --- Resolution: Cannot Reproduce Not really enough info. IMPALA-9903 is something that improved

[jira] [Updated] (IMPALA-5607) Add additional units to EXTRACT, DATE_PART, TRUNC for temporal data types

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-5607: -- Component/s: (was: Frontend) > Add additional units to EXTRACT, DATE_PART, TRUNC for

[jira] [Assigned] (IMPALA-5607) Add additional units to EXTRACT, DATE_PART, TRUNC for temporal data types

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-5607: - Assignee: (was: Jin Chul Kim) > Add additional units to EXTRACT, DATE_PART, TRUNC

[jira] [Resolved] (IMPALA-5637) Hive will start parsing "EXTERNAL" tbl property as case insensitive

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-5637. --- Resolution: Won't Do > Hive will start parsing "EXTERNAL" tbl property as case insensitive

[jira] [Work stopped] (IMPALA-5607) Add additional units to EXTRACT, DATE_PART, TRUNC for temporal data types

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on IMPALA-5607 stopped by Tim Armstrong. - > Add additional units to EXTRACT, DATE_PART, TRUNC for temporal data types

[jira] [Resolved] (IMPALA-6145) Assign runtime filter ids lazily

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6145. --- Resolution: Won't Fix > Assign runtime filter ids lazily >

[jira] [Resolved] (IMPALA-6440) Impala cannot read / write HBase tables when metadata is created with newer versions of Hive

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6440. --- Resolution: Cannot Reproduce > Impala cannot read / write HBase tables when metadata is

[jira] [Resolved] (IMPALA-6445) Whitespace should be stripped or detected in kudu master addresses metadata

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6445. --- Fix Version/s: Impala 2.12.0 Resolution: Fixed > Whitespace should be stripped or

[jira] [Updated] (IMPALA-7287) Speed up printing of tab delimited output in impala-shell

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7287: -- Component/s: Clients > Speed up printing of tab delimited output in impala-shell >

[jira] [Updated] (IMPALA-10325) Parquet scan should use min/max statistics to skip pages based on equi-join predicate

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10325: --- Component/s: Backend > Parquet scan should use min/max statistics to skip pages based on

[jira] [Updated] (IMPALA-10210) Avoid authentication for connection from a trusted domain over http

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10210: --- Component/s: Clients > Avoid authentication for connection from a trusted domain over http

[jira] [Updated] (IMPALA-10354) impala-shell hs2-http 3x slower than hs2 with high-latency network

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10354: --- Component/s: Clients > impala-shell hs2-http 3x slower than hs2 with high-latency network

[jira] [Resolved] (IMPALA-10210) Avoid authentication for connection from a trusted domain over http

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-10210. Fix Version/s: Impala 4.0 Resolution: Fixed > Avoid authentication for connection

[jira] [Updated] (IMPALA-10319) Support arbitrary encodings on Text/Sequence files

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10319: --- Component/s: Backend > Support arbitrary encodings on Text/Sequence files >

[jira] [Updated] (IMPALA-10355) DROP FUNCTION IF EXISTS taking 4-5 sec to execute when function does not exist

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10355: --- Component/s: Catalog > DROP FUNCTION IF EXISTS taking 4-5 sec to execute when function

[jira] [Updated] (IMPALA-7674) Impala should compress older log files

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7674?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7674: -- Component/s: Backend > Impala should compress older log files >

[jira] [Updated] (IMPALA-8166) ParquetBytesReadPerColumn is displayed for non-Parquet scans

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-8166: -- Labels: observability (was: ) > ParquetBytesReadPerColumn is displayed for non-Parquet scans

[jira] [Updated] (IMPALA-8166) ParquetBytesReadPerColumn is displayed for non-Parquet scans

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-8166: -- Component/s: Backend > ParquetBytesReadPerColumn is displayed for non-Parquet scans >

[jira] [Updated] (IMPALA-7712) Impala read from and write to GCS

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7712: -- Component/s: Backend > Impala read from and write to GCS > -

[jira] [Updated] (IMPALA-7501) Slim down metastore Partition objects in LocalCatalog cache

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7501: -- Component/s: Catalog > Slim down metastore Partition objects in LocalCatalog cache >

[jira] [Work stopped] (IMPALA-7501) Slim down metastore Partition objects in LocalCatalog cache

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on IMPALA-7501 stopped by Tim Armstrong. - > Slim down metastore Partition objects in LocalCatalog cache >

[jira] [Updated] (IMPALA-7229) Include DataSinks in swimlanes

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7229: -- Component/s: Backend > Include DataSinks in swimlanes > -- > >

[jira] [Updated] (IMPALA-7230) Include codegen in swimlanes

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7230: -- Component/s: Backend > Include codegen in swimlanes > > >

[jira] [Updated] (IMPALA-7489) Run tests with -Xcheck:jni enabled for debug mode

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7489: -- Component/s: Infrastructure > Run tests with -Xcheck:jni enabled for debug mode >

[jira] [Resolved] (IMPALA-6923) Update/Cleanup $IMPALA_HOME/tests/benchmark folder

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6923. --- Resolution: Fixed > Update/Cleanup $IMPALA_HOME/tests/benchmark folder >

[jira] [Updated] (IMPALA-7219) 7.5% of Catalog Server heap wasted by empty HashMaps and ArrayLists

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7219: -- Priority: Minor (was: Major) > 7.5% of Catalog Server heap wasted by empty HashMaps and

[jira] [Resolved] (IMPALA-6207) Avro - new column added from impala does not show up in describe on impala

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6207. --- Resolution: Not A Problem I believe this is the intended behaviour when you specify the

[jira] [Updated] (IMPALA-7219) 7.5% of Catalog Server heap wasted by empty HashMaps and ArrayLists

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7219: -- Component/s: Catalog > 7.5% of Catalog Server heap wasted by empty HashMaps and ArrayLists >

[jira] [Assigned] (IMPALA-7219) 7.5% of Catalog Server heap wasted by empty HashMaps and ArrayLists

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-7219: - Assignee: (was: Misha Dmitriev) > 7.5% of Catalog Server heap wasted by empty

[jira] [Updated] (IMPALA-6458) Create scan ranges in the coordinator

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-6458: -- Component/s: Frontend > Create scan ranges in the coordinator >

[jira] [Updated] (IMPALA-6604) rethink stress test binary search cutoff point

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-6604: -- Component/s: Infrastructure > rethink stress test binary search cutoff point >

[jira] [Resolved] (IMPALA-6797) Update Sentry version for ULP

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6797. --- Resolution: Won't Do > Update Sentry version for ULP > - > >

[jira] [Updated] (IMPALA-6828) Expose more detailed info in profile for REFRESH

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-6828: -- Component/s: Catalog > Expose more detailed info in profile for REFRESH >

[jira] [Resolved] (IMPALA-7133) Beeswax methods return Default TException instead of real exception

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-7133. --- Resolution: Won't Fix We're moving away from beeswax so not worth fixing at this point -

[jira] [Updated] (IMPALA-7711) Hash code improvements for CatalogdMetaProvider

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-7711: -- Component/s: Catalog > Hash code improvements for CatalogdMetaProvider >

[jira] [Resolved] (IMPALA-8080) Improve planner to use disk attributes when applicable

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-8080. --- Resolution: Later > Improve planner to use disk attributes when applicable >

[jira] [Resolved] (IMPALA-8709) Add Damerau-Levenshtein edit distance built-in function

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-8709. --- Resolution: Fixed > Add Damerau-Levenshtein edit distance built-in function >

[jira] [Updated] (IMPALA-8709) Add Damerau-Levenshtein edit distance built-in function

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8709?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-8709: -- Fix Version/s: Impala 3.4.0 > Add Damerau-Levenshtein edit distance built-in function >

[jira] [Commented] (IMPALA-10400) Floor/Ceil/Trunc should return BIGINT instead of DOUBLE

2020-12-21 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17252964#comment-17252964 ] Tim Armstrong commented on IMPALA-10400: There's no real right answer here. I don't think we

[jira] [Commented] (IMPALA-10343) control_service_queue_mem_limit default is too low for large clusters

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17250088#comment-17250088 ] Tim Armstrong commented on IMPALA-10343: [~guojingfeng] we never saw any problems in the

[jira] [Resolved] (IMPALA-1638) Investigate using c++ templates option when generating thrift and increasing the transport buffer

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-1638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-1638. --- Resolution: Won't Do I was able to get this working but it didn't have a noticeable impact

[jira] [Resolved] (IMPALA-10390) impala-profile-tool JSON output

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-10390. Fix Version/s: Impala 4.0 Resolution: Fixed > impala-profile-tool JSON output >

[jira] [Commented] (IMPALA-9382) Prototype denser runtime profile implementation

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249938#comment-17249938 ] Tim Armstrong commented on IMPALA-9382: --- Avoiding the virtual function calls while decoding thrift

[jira] [Assigned] (IMPALA-1695) impala-shell pretty-printing is slow

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-1695?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-1695: - Assignee: (was: Henry Robinson) > impala-shell pretty-printing is slow >

[jira] [Assigned] (IMPALA-1638) Investigate using c++ templates option when generating thrift and increasing the transport buffer

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-1638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-1638: - Assignee: Tim Armstrong > Investigate using c++ templates option when generating

[jira] [Updated] (IMPALA-4568) Cache Parquet footer cache to speedup scans & predicate evaluation against Min/Max indexes

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-4568: -- Description: Implement an LRU based footer cache for Parquet to speedup scans & predicate

[jira] [Commented] (IMPALA-4568) Cache Parquet footer cache to speedup scans & predicate evaluation against Min/Max indexes

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249913#comment-17249913 ] Tim Armstrong commented on IMPALA-4568: --- IMPALA-1638 might improve the thrift decoding time >

[jira] [Commented] (IMPALA-4568) Cache Parquet footer cache to speedup scans & predicate evaluation against Min/Max indexes

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249912#comment-17249912 ] Tim Armstrong commented on IMPALA-4568: --- IMPALA-8341 caches remote reads, which can benefit a lot

[jira] [Resolved] (IMPALA-1747) Improve the response time for Invalidate Metadata

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-1747. --- Resolution: Cannot Reproduce > Improve the response time for Invalidate Metadata >

[jira] [Assigned] (IMPALA-4568) Cache Parquet footer cache to speedup scans & predicate evaluation against Min/Max indexes

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-4568: - Assignee: (was: Michael Ho) > Cache Parquet footer cache to speedup scans &

[jira] [Resolved] (IMPALA-4045) Catalog cache update should not tied to statestore update frequency

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-4045. --- Resolution: Won't Fix This general problem is solved by the local catalog - IMPALA-7127 >

[jira] [Resolved] (IMPALA-5767) Automated perf job which doesn't rely on OS buffer cache

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-5767. --- Resolution: Invalid > Automated perf job which doesn't rely on OS buffer cache >

[jira] [Commented] (IMPALA-6834) Enforce consistent, pseudo-random replica order during local, non-random scheduling

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249906#comment-17249906 ] Tim Armstrong commented on IMPALA-6834: --- [~joemcdonnell] do you know if this is still a problem? I

[jira] [Assigned] (IMPALA-5961) Test data for TPC-DS schema contains a non-Unicode character

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-5961: - Assignee: (was: Tim Wood) > Test data for TPC-DS schema contains a non-Unicode

[jira] [Resolved] (IMPALA-6087) Revisit tests withheld from TPC-DS suite for use of TRUNCATE

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6087?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6087. --- Resolution: Invalid > Revisit tests withheld from TPC-DS suite for use of TRUNCATE >

[jira] [Resolved] (IMPALA-8077) Avoid converting timestamps in dropped rows during Parquet scanning

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-8077. --- Resolution: Won't Fix We should focus on IMPALA-2017 instead > Avoid converting

[jira] [Resolved] (IMPALA-1728) sub-query with duplicate values used IN conditional operator should discard the duplicate values before applying the operator

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-1728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-1728. --- Resolution: Duplicate IMPALA-1270 implemented this and changed the plan for TPC-DS Q95 >

[jira] [Resolved] (IMPALA-9637) Scan range load-balancing within backend

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-9637. --- Resolution: Duplicate > Scan range load-balancing within backend >

[jira] [Resolved] (IMPALA-3101) AnalyticEvalNode should use codegened TupleRowComparator instead of PrevRowCompare

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-3101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-3101. --- Resolution: Duplicate > AnalyticEvalNode should use codegened TupleRowComparator instead of

[jira] [Commented] (IMPALA-1706) Join returning single distinct column unnecessarily computes cross product

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249901#comment-17249901 ] Tim Armstrong commented on IMPALA-1706: --- Note that the rewrite would be: {noformat} select

[jira] [Resolved] (IMPALA-9890) Add more TPCDS queries to Impala's test suite

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-9890. --- Resolution: Duplicate > Add more TPCDS queries to Impala's test suite >

[jira] [Resolved] (IMPALA-5960) Add TPC-DS reason table to data load and enable q85 and q93

2020-12-15 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-5960. --- Fix Version/s: Impala 4.0 Resolution: Fixed Fixed by IMPALA-8291 > Add TPC-DS

[jira] [Commented] (IMPALA-10394) Container for impala-shell

2020-12-14 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17249379#comment-17249379 ] Tim Armstrong commented on IMPALA-10394: https://gerrit.cloudera.org/#/c/15966 actually does

[jira] [Resolved] (IMPALA-6361) File handle cache should be shared across multiple IO threads

2020-12-14 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-6361. --- Resolution: Duplicate > File handle cache should be shared across multiple IO threads >

[jira] [Work stopped] (IMPALA-9433) Change FileHandleCache from using a multimap to an unordered_map

2020-12-14 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on IMPALA-9433 stopped by Tim Armstrong. - > Change FileHandleCache from using a multimap to an unordered_map >

[jira] [Assigned] (IMPALA-5212) consider switching to pread by default

2020-12-14 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-5212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-5212: - Assignee: (was: Sahil Takiar) > consider switching to pread by default >

[jira] [Updated] (IMPALA-2603) Incorrect results and plan for inline view referencing several collection types correlated with different ancestor blocks

2020-12-11 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-2603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-2603: -- Labels: complextype correctness crash downgraded nested_types query_generator ramp-up (was:

[jira] [Updated] (IMPALA-8721) Wrong result when Impala reads a Hive written parquet TimeStamp column

2020-12-11 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-8721: -- Priority: Critical (was: Major) > Wrong result when Impala reads a Hive written parquet

[jira] [Updated] (IMPALA-8721) Wrong result when Impala reads a Hive written parquet TimeStamp column

2020-12-11 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-8721: -- Labels: Interoperability correctness hive impala parquet timestamp (was: Interoperability

[jira] [Assigned] (IMPALA-7816) Race condition in HdfsScanNodeBase::StopAndFinalizeCounters

2020-12-11 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-7816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-7816: - Assignee: (was: Sahil Takiar) > Race condition in

[jira] [Assigned] (IMPALA-8131) Impala is unable to read Parquet decimal columns with higher scale than table metadata

2020-12-11 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-8131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-8131: - Assignee: (was: Sahil Takiar) > Impala is unable to read Parquet decimal columns

[jira] [Assigned] (IMPALA-3430) Runtime filter : Extend runtime filter to support Min/Max values for HDFS scans

2020-12-10 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-3430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-3430: - Assignee: Qifan Chen > Runtime filter : Extend runtime filter to support Min/Max

[jira] [Resolved] (IMPALA-10343) control_service_queue_mem_limit default is too low for large clusters

2020-12-10 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-10343. Fix Version/s: Impala 4.0 Resolution: Fixed > control_service_queue_mem_limit

[jira] [Created] (IMPALA-10390) impala-profile-tool JSON output

2020-12-10 Thread Tim Armstrong (Jira)
Tim Armstrong created IMPALA-10390: -- Summary: impala-profile-tool JSON output Key: IMPALA-10390 URL: https://issues.apache.org/jira/browse/IMPALA-10390 Project: IMPALA Issue Type:

[jira] [Created] (IMPALA-10389) Container for impala-profile-tool

2020-12-10 Thread Tim Armstrong (Jira)
Tim Armstrong created IMPALA-10389: -- Summary: Container for impala-profile-tool Key: IMPALA-10389 URL: https://issues.apache.org/jira/browse/IMPALA-10389 Project: IMPALA Issue Type:

[jira] [Work started] (IMPALA-10343) control_service_queue_mem_limit default is too low for large clusters

2020-12-09 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on IMPALA-10343 started by Tim Armstrong. -- > control_service_queue_mem_limit default is too low for large clusters >

[jira] [Assigned] (IMPALA-10343) control_service_queue_mem_limit default is too low for large clusters

2020-12-09 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-10343: -- Assignee: Tim Armstrong > control_service_queue_mem_limit default is too low for

[jira] [Commented] (IMPALA-10382) Predicate with coalesce on both sides of LOJ isn't NULL filtering

2020-12-08 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17245988#comment-17245988 ] Tim Armstrong commented on IMPALA-10382: IMPALA-10252 is a related bug. > Predicate with

[jira] [Resolved] (IMPALA-10252) Query returns less number of rows with run-time filtering on integer column in a subquery against functional_parquet schema

2020-12-08 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-10252. Fix Version/s: Impala 4.0 Resolution: Fixed > Query returns less number of rows

[jira] [Updated] (IMPALA-10382) Predicate with coalesce on both sides of LOJ isn't NULL filtering

2020-12-07 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10382: --- Priority: Critical (was: Major) > Predicate with coalesce on both sides of LOJ isn't NULL

[jira] [Updated] (IMPALA-10382) Predicate with coalesce on both sides of LOJ isn't NULL filtering

2020-12-07 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10382: --- Labels: correctness (was: ) > Predicate with coalesce on both sides of LOJ isn't NULL

[jira] [Updated] (IMPALA-10382) Predicate with coalesce on both sides of LOJ isn't NULL filtering

2020-12-07 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-10382: --- Target Version: Impala 4.0 > Predicate with coalesce on both sides of LOJ isn't NULL

[jira] [Assigned] (IMPALA-10377) Improve the accuracy of resource estimation

2020-12-07 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-10377: -- Assignee: liuyao (was: Tim Armstrong) > Improve the accuracy of resource

[jira] [Assigned] (IMPALA-10377) Improve the accuracy of resource estimation

2020-12-07 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong reassigned IMPALA-10377: -- Assignee: Tim Armstrong (was: liuyao) > Improve the accuracy of resource

[jira] [Resolved] (IMPALA-10295) Fix analytic limit pushdown when no predicates are present

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-10295. Fix Version/s: Impala 4.0 Resolution: Fixed > Fix analytic limit pushdown when no

[jira] [Updated] (IMPALA-6434) Add support to decode RLE_DICTIONARY encoded pages

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-6434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-6434: -- Labels: newbie parquet ramp-up (was: parquet) > Add support to decode RLE_DICTIONARY encoded

[jira] [Commented] (IMPALA-9884) TestAdmissionControllerStress.test_mem_limit failing occasionally

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17244283#comment-17244283 ] Tim Armstrong commented on IMPALA-9884: --- I tried just making the query run longer but ran into

[jira] [Updated] (IMPALA-9884) TestAdmissionControllerStress.test_mem_limit failing occasionally

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong updated IMPALA-9884: -- Priority: Critical (was: Blocker) > TestAdmissionControllerStress.test_mem_limit failing

[jira] [Commented] (IMPALA-10378) Retire support for Debian 8

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17244253#comment-17244253 ] Tim Armstrong commented on IMPALA-10378: Makes sense > Retire support for Debian 8 >

[jira] [Resolved] (IMPALA-9058) S3 tests failing with FileNotFoundException getVersionMarkerItem on ../VERSION

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Armstrong resolved IMPALA-9058. --- Resolution: Won't Fix We will be disabling S3guard for tests since S3 now has strong

[jira] [Commented] (IMPALA-9453) S3 build failed with many strange symptoms

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17244202#comment-17244202 ] Tim Armstrong commented on IMPALA-9453: --- We may be able to close this - we will be disabling

[jira] [Commented] (IMPALA-9058) S3 tests failing with FileNotFoundException getVersionMarkerItem on ../VERSION

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-9058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17244204#comment-17244204 ] Tim Armstrong commented on IMPALA-9058: --- We may be able to close this - we will be disabling

[jira] [Commented] (IMPALA-10251) test_decimal_queries and test_tpcds_queries may run out of memory on ASAN builds

2020-12-04 Thread Tim Armstrong (Jira)
[ https://issues.apache.org/jira/browse/IMPALA-10251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17244198#comment-17244198 ] Tim Armstrong commented on IMPALA-10251: I'm not going to have time to investigate. I think

<    4   5   6   7   8   9   10   11   12   13   >