[jira] [Updated] (IMPALA-13035) Querying metadata tables from non-Iceberg tables throws IllegalArgumentException

2024-05-06 Thread Quanlong Huang (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quanlong Huang updated IMPALA-13035:

Fix Version/s: Impala 4.5.0

> Querying metadata tables from non-Iceberg tables throws 
> IllegalArgumentException
> 
>
> Key: IMPALA-13035
> URL: https://issues.apache.org/jira/browse/IMPALA-13035
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.3.0
>Reporter: Peter Rozsa
>Assignee: Daniel Becker
>Priority: Minor
>  Labels: impala-iceberg
> Fix For: Impala 4.5.0
>
>
> If a query targets an Iceberg metadata table like default.xy.`files` and the 
> xy table is not an Iceberg table then the analyzer throws 
> IllegalArgumentException. 
> The main concern is that IcebergMetadataTable.java:isIcebergMetadataTable is 
> called before it's validated that the table is indeed an IcebergTable.
> Example: 
> {code:java}
> create table xy(a int);
> select * from default.xy.`files`;{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-13009) Potential leak of partition deletions in the catalog topic

2024-05-06 Thread Quanlong Huang (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quanlong Huang resolved IMPALA-13009.
-
Fix Version/s: Impala 4.5.0
   Resolution: Fixed

> Potential leak of partition deletions in the catalog topic
> --
>
> Key: IMPALA-13009
> URL: https://issues.apache.org/jira/browse/IMPALA-13009
> Project: IMPALA
>  Issue Type: Bug
>  Components: Catalog
>Affects Versions: Impala 4.0.0, Impala 4.1.0, Impala 4.2.0, Impala 4.1.1, 
> Impala 4.1.2, Impala 4.3.0
>Reporter: Quanlong Huang
>Assignee: Quanlong Huang
>Priority: Critical
> Fix For: Impala 4.5.0
>
>
> Catalogd might not send partition deletions to the catalog topic in the 
> following scenario:
> * Some partitions of a table are dropped.
> * The HdfsTable object is removed sequentially before catalogd collects the 
> dropped partitions.
> In such case, catalogd loses track of the dropped partitions so their updates 
> keep existing in the catalog topic, until the partition names are reused 
> again.
> Note that the HdfsTable object can be removed by commands like DropTable or 
> INVALIDATE.
> The leaked partitions will be detected when a coordinator restarts. An 
> IllegalStateException complaining stale partitions will be reported, causing 
> the table not being added to the catalog cache of coordinator.
> {noformat}
> E0417 16:41:22.317298 20746 ImpaladCatalog.java:264] Error adding catalog 
> object: Received stale partition in a statestore update: 
> THdfsPartition(partitionKeyExprs:[TExpr(nodes:[TExprNode(node_type:INT_LITERAL,
>  type:TColumnType(types:[TTypeNode(type:SCALAR, 
> scalar_type:TScalarType(type:INT))]), num_children:0, is_constant:true, 
> int_literal:TIntLiteral(value:106), is_codegen_disabled:false)])], 
> location:THdfsPartitionLocation(prefix_index:0, suffix:p=106), id:138, 
> file_desc:[THdfsFileDesc(file_desc_data:18 00 00 00 00 00 00 00 00 00 0E 00 
> 1C 00 18 00 10 00 00 00 08 00 04 00 0E 00 00 00 18 00 00 00 8B 0E 2D EB 8E 01 
> 00 00 04 00 00 00 00 00 00 00 0C 00 00 00 01 00 00 00 4C 00 00 00 36 00 00 00 
> 34 34 34 37 62 35 66 34 62 30 65 64 66 64 65 31 2D 32 33 33 61 64 62 38 35 30 
> 30 30 30 30 30 30 30 5F 36 36 34 31 30 39 33 37 33 5F 64 61 74 61 2E 30 2E 74 
> 78 74 00 00 0C 00 14 00 00 00 0C 00...)], access_level:READ_WRITE, 
> stats:TTableStats(num_rows:-1), is_marked_cached:false, 
> hms_parameters:{transient_lastDdlTime=1713342582, totalSize=4, 
> numFilesErasureCoded=0, numFiles=1}, num_blocks:1, total_file_size_bytes:4, 
> has_incremental_stats:false, write_id:0, db_name:default, tbl_name:my_part, 
> partition_name:p=106, 
> hdfs_storage_descriptor:THdfsStorageDescriptor(lineDelim:10, fieldDelim:1, 
> collectionDelim:1, mapKeyDelim:1, escapeChar:0, quoteChar:1, fileFormat:TEXT, 
> blockSize:0))
> Java exception follows:
> java.lang.IllegalStateException: Received stale partition in a statestore 
> update: 
> THdfsPartition(partitionKeyExprs:[TExpr(nodes:[TExprNode(node_type:INT_LITERAL,
>  type:TColumnType(types:[TTypeNode(type:SCALAR, 
> scalar_type:TScalarType(type:INT))]), num_children:0, is_constant:true, 
> int_literal:TIntLiteral(value:106), is_codegen_disabled:false)])], 
> location:THdfsPartitionLocation(prefix_index:0, suffix:p=106), id:138, 
> file_desc:[THdfsFileDesc(file_desc_data:18 00 00 00 00 00 00 00 00 00 0E 00 
> 1C 00 18 00 10 00 00 00 08 00 04 00 0E 00 00 00 18 00 00 00 8B 0E 2D EB 8E 01 
> 00 00 04 00 00 00 00 00 00 00 0C 00 00 00 01 00 00 00 4C 00 00 00 36 00 00 00 
> 34 34 34 37 62 35 66 34 62 30 65 64 66 64 65 31 2D 32 33 33 61 64 62 38 35 30 
> 30 30 30 30 30 30 30 5F 36 36 34 31 30 39 33 37 33 5F 64 61 74 61 2E 30 2E 74 
> 78 74 00 00 0C 00 14 00 00 00 0C 00...)], access_level:READ_WRITE, 
> stats:TTableStats(num_rows:-1), is_marked_cached:false, 
> hms_parameters:{transient_lastDdlTime=1713342582, totalSize=4, 
> numFilesErasureCoded=0, numFiles=1}, num_blocks:1, total_file_size_bytes:4, 
> has_incremental_stats:false, write_id:0, db_name:default, tbl_name:my_part, 
> partition_name:p=106, 
> hdfs_storage_descriptor:THdfsStorageDescriptor(lineDelim:10, fieldDelim:1, 
> collectionDelim:1, mapKeyDelim:1, escapeChar:0, quoteChar:1, fileFormat:TEXT, 
> blockSize:0))
> at 
> com.google.common.base.Preconditions.checkState(Preconditions.java:512)
> at 
> org.apache.impala.catalog.ImpaladCatalog.addTable(ImpaladCatalog.java:523)
> at 
> org.apache.impala.catalog.ImpaladCatalog.addCatalogObject(ImpaladCatalog.java:334)
> at 
> org.apache.impala.catalog.ImpaladCatalog.updateCatalog(ImpaladCatalog.java:262)
> at 
> org.apache.impala.service.FeCatalogManager$CatalogdImpl.updateCatalogCache(FeCatalogManager.java:120)
> at 
> 

[jira] [Commented] (IMPALA-13058) TestRuntimeFilters.test_basic_filters failed on ARM builds

2024-05-06 Thread Wenzhe Zhou (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844132#comment-17844132
 ] 

Wenzhe Zhou commented on IMPALA-13058:
--

Thanks Riza to look in this.  The unit-test was very old test. It failed only 
for ARM builds, which were added to nightly builds since last week.

> TestRuntimeFilters.test_basic_filters failed on ARM builds
> --
>
> Key: IMPALA-13058
> URL: https://issues.apache.org/jira/browse/IMPALA-13058
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Reporter: Wenzhe Zhou
>Assignee: Riza Suminto
>Priority: Major
>
> Stacktrace:
> query_test/test_runtime_filters.py:98: in test_basic_filters
> self.run_test_case('QueryTest/runtime_filters', vector, 
> test_file_vars=vars)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/impala_test_suite.py:851:
>  in run_test_case
> update_section=pytest.config.option.update_results)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/test_result_verifier.py:701:
>  in verify_runtime_profile
> actual))
> E   AssertionError: Did not find matches for lines in runtime profile:
> E   EXPECTED LINES:
> E   row_regex: .*REMOTE.*ms.*ms.*true



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13009) Potential leak of partition deletions in the catalog topic

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844099#comment-17844099
 ] 

ASF subversion and git services commented on IMPALA-13009:
--

Commit ee21427d26620b40d38c706b4944d2831f84f6f5 in impala's branch 
refs/heads/master from stiga-huang
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=ee21427d2 ]

IMPALA-13009: Fix catalogd not sending deletion updates for some dropped 
partitions

*Background*

Since IMPALA-3127, catalogd sends incremental partition updates based on
the last sent table snapshot ('maxSentPartitionId_' to be specific).
Dropped partitions since the last catalog update are tracked in
'droppedPartitions_' of HdfsTable. When catalogd collects the next
catalog update, they will be collected. HdfsTable then clears the set.
See details in CatalogServiceCatalog#addHdfsPartitionsToCatalogDelta().

If an HdfsTable is invalidated, it's replaced with an IncompleteTable
which doesn't track any partitions. The HdfsTable object is then added
to the deleteLog so catalogd can send deletion updates for all its
partitions. The same if the HdfsTable is dropped. However, the
previously dropped partitions are not collected in this case, which
results in a leak in the catalog topic if the partition name is not
reused anymore. Note that in the catalog topic, the key of a partition
update consists of the table name and the partition name. So if the
partition is added back to the table, the topic key will be reused then
resolves the leak.

The leak will be observed when a coordinator restarts. In the initial
catalog update sent from statestore, coordinator will find some
partition updates that are not referenced by the HdfsTable (assuming the
table is used again after the INVALIDATE). Then a Precondition check
fails and the table is not added to the coordinator.

*Overview of the patch*

This patch fixes the leak by also collecting the dropped partitions when
adding the HdfsTable to the deleteLog. A new field, dropped_partitions,
is added in THdfsTable to collect them. It's only used when catalogd
collects catalog updates.

Removes the Precondition check in coordinator and just reports the stale
partitions since IMPALA-12831 could also introduce them.

Also adds a log line in CatalogOpExecutor.alterTableDropPartition() to
show the dropped partition names for better diagnostics.

Tests
 - Added e2e tests

Change-Id: I12a68158dca18ee48c9564ea16b7484c9f5b5d21
Reviewed-on: http://gerrit.cloudera.org:8080/21326
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Potential leak of partition deletions in the catalog topic
> --
>
> Key: IMPALA-13009
> URL: https://issues.apache.org/jira/browse/IMPALA-13009
> Project: IMPALA
>  Issue Type: Bug
>  Components: Catalog
>Affects Versions: Impala 4.0.0, Impala 4.1.0, Impala 4.2.0, Impala 4.1.1, 
> Impala 4.1.2, Impala 4.3.0
>Reporter: Quanlong Huang
>Assignee: Quanlong Huang
>Priority: Critical
>
> Catalogd might not send partition deletions to the catalog topic in the 
> following scenario:
> * Some partitions of a table are dropped.
> * The HdfsTable object is removed sequentially before catalogd collects the 
> dropped partitions.
> In such case, catalogd loses track of the dropped partitions so their updates 
> keep existing in the catalog topic, until the partition names are reused 
> again.
> Note that the HdfsTable object can be removed by commands like DropTable or 
> INVALIDATE.
> The leaked partitions will be detected when a coordinator restarts. An 
> IllegalStateException complaining stale partitions will be reported, causing 
> the table not being added to the catalog cache of coordinator.
> {noformat}
> E0417 16:41:22.317298 20746 ImpaladCatalog.java:264] Error adding catalog 
> object: Received stale partition in a statestore update: 
> THdfsPartition(partitionKeyExprs:[TExpr(nodes:[TExprNode(node_type:INT_LITERAL,
>  type:TColumnType(types:[TTypeNode(type:SCALAR, 
> scalar_type:TScalarType(type:INT))]), num_children:0, is_constant:true, 
> int_literal:TIntLiteral(value:106), is_codegen_disabled:false)])], 
> location:THdfsPartitionLocation(prefix_index:0, suffix:p=106), id:138, 
> file_desc:[THdfsFileDesc(file_desc_data:18 00 00 00 00 00 00 00 00 00 0E 00 
> 1C 00 18 00 10 00 00 00 08 00 04 00 0E 00 00 00 18 00 00 00 8B 0E 2D EB 8E 01 
> 00 00 04 00 00 00 00 00 00 00 0C 00 00 00 01 00 00 00 4C 00 00 00 36 00 00 00 
> 34 34 34 37 62 35 66 34 62 30 65 64 66 64 65 31 2D 32 33 33 61 64 62 38 35 30 
> 30 30 30 30 30 30 30 5F 36 36 34 31 30 39 33 37 33 5F 64 61 74 61 2E 30 2E 74 
> 78 74 00 00 0C 00 14 00 00 00 0C 00...)], access_level:READ_WRITE, 
> stats:TTableStats(num_rows:-1), is_marked_cached:false, 
> 

[jira] [Commented] (IMPALA-12831) HdfsTable.toMinimalTCatalogObject() should hold table read lock to generate incremental updates

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844101#comment-17844101
 ] 

ASF subversion and git services commented on IMPALA-12831:
--

Commit ee21427d26620b40d38c706b4944d2831f84f6f5 in impala's branch 
refs/heads/master from stiga-huang
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=ee21427d2 ]

IMPALA-13009: Fix catalogd not sending deletion updates for some dropped 
partitions

*Background*

Since IMPALA-3127, catalogd sends incremental partition updates based on
the last sent table snapshot ('maxSentPartitionId_' to be specific).
Dropped partitions since the last catalog update are tracked in
'droppedPartitions_' of HdfsTable. When catalogd collects the next
catalog update, they will be collected. HdfsTable then clears the set.
See details in CatalogServiceCatalog#addHdfsPartitionsToCatalogDelta().

If an HdfsTable is invalidated, it's replaced with an IncompleteTable
which doesn't track any partitions. The HdfsTable object is then added
to the deleteLog so catalogd can send deletion updates for all its
partitions. The same if the HdfsTable is dropped. However, the
previously dropped partitions are not collected in this case, which
results in a leak in the catalog topic if the partition name is not
reused anymore. Note that in the catalog topic, the key of a partition
update consists of the table name and the partition name. So if the
partition is added back to the table, the topic key will be reused then
resolves the leak.

The leak will be observed when a coordinator restarts. In the initial
catalog update sent from statestore, coordinator will find some
partition updates that are not referenced by the HdfsTable (assuming the
table is used again after the INVALIDATE). Then a Precondition check
fails and the table is not added to the coordinator.

*Overview of the patch*

This patch fixes the leak by also collecting the dropped partitions when
adding the HdfsTable to the deleteLog. A new field, dropped_partitions,
is added in THdfsTable to collect them. It's only used when catalogd
collects catalog updates.

Removes the Precondition check in coordinator and just reports the stale
partitions since IMPALA-12831 could also introduce them.

Also adds a log line in CatalogOpExecutor.alterTableDropPartition() to
show the dropped partition names for better diagnostics.

Tests
 - Added e2e tests

Change-Id: I12a68158dca18ee48c9564ea16b7484c9f5b5d21
Reviewed-on: http://gerrit.cloudera.org:8080/21326
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> HdfsTable.toMinimalTCatalogObject() should hold table read lock to generate 
> incremental updates
> ---
>
> Key: IMPALA-12831
> URL: https://issues.apache.org/jira/browse/IMPALA-12831
> Project: IMPALA
>  Issue Type: Bug
>  Components: Catalog
>Affects Versions: Impala 4.0.0, Impala 4.1.0, Impala 4.2.0, Impala 4.1.1, 
> Impala 4.1.2, Impala 4.3.0
>Reporter: Quanlong Huang
>Assignee: Quanlong Huang
>Priority: Critical
> Fix For: Impala 4.4.0
>
>
> When enable_incremental_metadata_updates=true (default), catalogd sends 
> incremental partition updates to coordinators, which goes into 
> HdfsTable.toMinimalTCatalogObject():
> {code:java}
>   public TCatalogObject toMinimalTCatalogObject() {
> TCatalogObject catalogObject = super.toMinimalTCatalogObject();
> if (!BackendConfig.INSTANCE.isIncrementalMetadataUpdatesEnabled()) {
>   return catalogObject;
> }
> catalogObject.getTable().setTable_type(TTableType.HDFS_TABLE);
> THdfsTable hdfsTable = new THdfsTable(hdfsBaseDir_, getColumnNames(),
> nullPartitionKeyValue_, nullColumnValue_,
> /*idToPartition=*/ new HashMap<>(),
> /*prototypePartition=*/ new THdfsPartition());
> for (HdfsPartition part : partitionMap_.values()) {
>   hdfsTable.partitions.put(part.getId(), part.toMinimalTHdfsPartition());
> }
> hdfsTable.setHas_full_partitions(false);
> // The minimal catalog object of partitions contain the partition names.
> hdfsTable.setHas_partition_names(true);
> catalogObject.getTable().setHdfs_table(hdfsTable);
> return catalogObject;
>   }{code}
> Accessing table fields without holding the table read lock might be failed by 
> concurrent DDLs. All workloads that use this method (e.g. INVALIDATE 
> commands) could hit this issue. We've saw event-processor failed in 
> processing a RELOAD event that want to invalidates an HdfsTable:
> {noformat}
> E0216 16:23:44.283689   253 MetastoreEventsProcessor.java:899] Unexpected 
> exception received while processing event
> Java exception follows:
> java.util.ConcurrentModificationException
>   at 

[jira] [Commented] (IMPALA-3127) Decouple partitions from tables

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-3127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844100#comment-17844100
 ] 

ASF subversion and git services commented on IMPALA-3127:
-

Commit ee21427d26620b40d38c706b4944d2831f84f6f5 in impala's branch 
refs/heads/master from stiga-huang
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=ee21427d2 ]

IMPALA-13009: Fix catalogd not sending deletion updates for some dropped 
partitions

*Background*

Since IMPALA-3127, catalogd sends incremental partition updates based on
the last sent table snapshot ('maxSentPartitionId_' to be specific).
Dropped partitions since the last catalog update are tracked in
'droppedPartitions_' of HdfsTable. When catalogd collects the next
catalog update, they will be collected. HdfsTable then clears the set.
See details in CatalogServiceCatalog#addHdfsPartitionsToCatalogDelta().

If an HdfsTable is invalidated, it's replaced with an IncompleteTable
which doesn't track any partitions. The HdfsTable object is then added
to the deleteLog so catalogd can send deletion updates for all its
partitions. The same if the HdfsTable is dropped. However, the
previously dropped partitions are not collected in this case, which
results in a leak in the catalog topic if the partition name is not
reused anymore. Note that in the catalog topic, the key of a partition
update consists of the table name and the partition name. So if the
partition is added back to the table, the topic key will be reused then
resolves the leak.

The leak will be observed when a coordinator restarts. In the initial
catalog update sent from statestore, coordinator will find some
partition updates that are not referenced by the HdfsTable (assuming the
table is used again after the INVALIDATE). Then a Precondition check
fails and the table is not added to the coordinator.

*Overview of the patch*

This patch fixes the leak by also collecting the dropped partitions when
adding the HdfsTable to the deleteLog. A new field, dropped_partitions,
is added in THdfsTable to collect them. It's only used when catalogd
collects catalog updates.

Removes the Precondition check in coordinator and just reports the stale
partitions since IMPALA-12831 could also introduce them.

Also adds a log line in CatalogOpExecutor.alterTableDropPartition() to
show the dropped partition names for better diagnostics.

Tests
 - Added e2e tests

Change-Id: I12a68158dca18ee48c9564ea16b7484c9f5b5d21
Reviewed-on: http://gerrit.cloudera.org:8080/21326
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


> Decouple partitions from tables
> ---
>
> Key: IMPALA-3127
> URL: https://issues.apache.org/jira/browse/IMPALA-3127
> Project: IMPALA
>  Issue Type: Improvement
>  Components: Catalog
>Affects Versions: Impala 2.2.4
>Reporter: Dimitris Tsirogiannis
>Assignee: Quanlong Huang
>Priority: Major
>  Labels: catalog-server, performance
> Fix For: Impala 4.0.0
>
>
> Currently, partitions are tightly integrated into the HdfsTable objects, 
> making incremental metadata updates difficult to perform. Furthermore, the 
> catalog transmits entire table metadata even when only few partitions change, 
> introducing significant latencies, wasting network bandwidth and CPU cycles 
> while updating table metadata at the receiving impalads. As a first step, we 
> should decouple partitions from tables and add them as a separate level in 
> the hierarchy of catalog entities (server-db-table-partition). Subsequently, 
> the catalog should transmit only entities that have changed after DDL/DML 
> statements.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13058) TestRuntimeFilters.test_basic_filters failed on ARM builds

2024-05-06 Thread Riza Suminto (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17844060#comment-17844060
 ] 

Riza Suminto commented on IMPALA-13058:
---

This seems unrelated to runtime filter. It is strange that Query Compilation: 
2.858ms, but Planning finished: 0.000ns.
{code:java}
E       Query Compilation: 2.858ms
E          - Metadata of all 2 tables cached: 274.516us (274.516us)
E          - Analysis finished: 543.727us (269.211us)
E          - Authorization finished (noop): 599.562us (55.835us)
E          - Value transfer graph computed: 665.553us (65.991us)
E          - Single node plan created: 1.105ms (439.572us)
E          - Runtime filters computed: 1.748ms (643.246us)
E          - Distributed plan created: 1.768ms (20.414us)
E          - Parallel plans created: 1.782ms (13.834us)
E          - Planning finished: 2.858ms (1.075ms)
E       Query Timeline: 50.001ms
E          - Query submitted: 0.000ns (0.000ns)
E          - Planning finished: 0.000ns (0.000ns)
E          - Submit for admission: 0.000ns (0.000ns)
E          - Completed admission: 0.000ns (0.000ns)
E          - Ready to start on 3 backends: 0.000ns (0.000ns)
E          - All 3 execution backends (13 fragment instances) started: 0.000ns 
(0.000ns)
E          - First dynamic filter received: 0.000ns (0.000ns)
E          - Rows available: 10.000ms (10.000ms)
E          - First row fetched: 50.001ms (40.000ms)
E          - Last row fetched: 50.001ms (0.000ns)
E          - Released admission control resources: 50.001ms (0.000ns)
E        - AdmissionControlTimeSinceLastUpdate: 26.000ms
E        - ComputeScanRangeAssignmentTimer: 0.000ns {code}
 

> TestRuntimeFilters.test_basic_filters failed on ARM builds
> --
>
> Key: IMPALA-13058
> URL: https://issues.apache.org/jira/browse/IMPALA-13058
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Reporter: Wenzhe Zhou
>Assignee: Riza Suminto
>Priority: Major
>
> Stacktrace:
> query_test/test_runtime_filters.py:98: in test_basic_filters
> self.run_test_case('QueryTest/runtime_filters', vector, 
> test_file_vars=vars)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/impala_test_suite.py:851:
>  in run_test_case
> update_section=pytest.config.option.update_results)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/test_result_verifier.py:701:
>  in verify_runtime_profile
> actual))
> E   AssertionError: Did not find matches for lines in runtime profile:
> E   EXPECTED LINES:
> E   row_regex: .*REMOTE.*ms.*ms.*true



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-13051) Speed up test_query_log test runs

2024-05-06 Thread Jason Fehr (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jason Fehr reassigned IMPALA-13051:
---

Assignee: Jason Fehr  (was: Michael Smith)

> Speed up test_query_log test runs
> -
>
> Key: IMPALA-13051
> URL: https://issues.apache.org/jira/browse/IMPALA-13051
> Project: IMPALA
>  Issue Type: Task
>Affects Versions: Impala 4.4.0
>Reporter: Michael Smith
>Assignee: Jason Fehr
>Priority: Minor
>
> test_query_log.py takes 11 minutes to run. Most of them use graceful 
> shutdown, and provide an unnecessary grace period. Optimize test_query_log 
> test runs, and do some other code cleanup around workload management.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-13061) Query Live table fails to load if default_transactional_type=insert_only set globally

2024-05-06 Thread Michael Smith (Jira)
Michael Smith created IMPALA-13061:
--

 Summary: Query Live table fails to load if 
default_transactional_type=insert_only set globally
 Key: IMPALA-13061
 URL: https://issues.apache.org/jira/browse/IMPALA-13061
 Project: IMPALA
  Issue Type: Bug
Reporter: Michael Smith


If transactional type defaults to insert_only for all queries via
{code}
--default_query_options=default_transactional_type=insert_only
{code}
the table definition for {{sys.impala_query_live}} is set to transactional, 
which causes an exception in catalogd
{code}
I0506 22:07:42.808758  3972 jni-util.cc:302] 4547b965aeebc5f0:8ba96c58] 
java.lang.IllegalStateException
at 
com.google.common.base.Preconditions.checkState(Preconditions.java:496)
at org.apache.impala.catalog.Table.getPartialInfo(Table.java:851)
at 
org.apache.impala.catalog.CatalogServiceCatalog.doGetPartialCatalogObject(CatalogServiceCatalog.java:3818)
at 
org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3714)
at 
org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3681)
at 
org.apache.impala.service.JniCatalog.lambda$getPartialCatalogObject$10(JniCatalog.java:431)
at 
org.apache.impala.service.JniCatalogOp.lambda$execAndSerialize$1(JniCatalogOp.java:90)
at org.apache.impala.service.JniCatalogOp.execOp(JniCatalogOp.java:58)
at 
org.apache.impala.service.JniCatalogOp.execAndSerialize(JniCatalogOp.java:89)
at 
org.apache.impala.service.JniCatalogOp.execAndSerializeSilentStartAndFinish(JniCatalogOp.java:109)
at 
org.apache.impala.service.JniCatalog.execAndSerializeSilentStartAndFinish(JniCatalog.java:253)
at 
org.apache.impala.service.JniCatalog.getPartialCatalogObject(JniCatalog.java:430)
{code}

We need to override that setting while creating {{sys.impala_query_live}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-13061) Query Live table fails to load if default_transactional_type=insert_only set globally

2024-05-06 Thread Michael Smith (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Smith reassigned IMPALA-13061:
--

Assignee: Michael Smith

> Query Live table fails to load if default_transactional_type=insert_only set 
> globally
> -
>
> Key: IMPALA-13061
> URL: https://issues.apache.org/jira/browse/IMPALA-13061
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Major
>
> If transactional type defaults to insert_only for all queries via
> {code}
> --default_query_options=default_transactional_type=insert_only
> {code}
> the table definition for {{sys.impala_query_live}} is set to transactional, 
> which causes an exception in catalogd
> {code}
> I0506 22:07:42.808758  3972 jni-util.cc:302] 
> 4547b965aeebc5f0:8ba96c58] java.lang.IllegalStateException
> at 
> com.google.common.base.Preconditions.checkState(Preconditions.java:496)
> at org.apache.impala.catalog.Table.getPartialInfo(Table.java:851)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.doGetPartialCatalogObject(CatalogServiceCatalog.java:3818)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3714)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3681)
> at 
> org.apache.impala.service.JniCatalog.lambda$getPartialCatalogObject$10(JniCatalog.java:431)
> at 
> org.apache.impala.service.JniCatalogOp.lambda$execAndSerialize$1(JniCatalogOp.java:90)
> at org.apache.impala.service.JniCatalogOp.execOp(JniCatalogOp.java:58)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerialize(JniCatalogOp.java:89)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerializeSilentStartAndFinish(JniCatalogOp.java:109)
> at 
> org.apache.impala.service.JniCatalog.execAndSerializeSilentStartAndFinish(JniCatalog.java:253)
> at 
> org.apache.impala.service.JniCatalog.getPartialCatalogObject(JniCatalog.java:430)
> {code}
> We need to override that setting while creating {{sys.impala_query_live}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-13061) Query Live table fails to load if default_transactional_type=insert_only set globally

2024-05-06 Thread Michael Smith (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Smith updated IMPALA-13061:
---
Epic Link: IMPALA-12427

> Query Live table fails to load if default_transactional_type=insert_only set 
> globally
> -
>
> Key: IMPALA-13061
> URL: https://issues.apache.org/jira/browse/IMPALA-13061
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Critical
>
> If transactional type defaults to insert_only for all queries via
> {code}
> --default_query_options=default_transactional_type=insert_only
> {code}
> the table definition for {{sys.impala_query_live}} is set to transactional, 
> which causes an exception in catalogd
> {code}
> I0506 22:07:42.808758  3972 jni-util.cc:302] 
> 4547b965aeebc5f0:8ba96c58] java.lang.IllegalStateException
> at 
> com.google.common.base.Preconditions.checkState(Preconditions.java:496)
> at org.apache.impala.catalog.Table.getPartialInfo(Table.java:851)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.doGetPartialCatalogObject(CatalogServiceCatalog.java:3818)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3714)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3681)
> at 
> org.apache.impala.service.JniCatalog.lambda$getPartialCatalogObject$10(JniCatalog.java:431)
> at 
> org.apache.impala.service.JniCatalogOp.lambda$execAndSerialize$1(JniCatalogOp.java:90)
> at org.apache.impala.service.JniCatalogOp.execOp(JniCatalogOp.java:58)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerialize(JniCatalogOp.java:89)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerializeSilentStartAndFinish(JniCatalogOp.java:109)
> at 
> org.apache.impala.service.JniCatalog.execAndSerializeSilentStartAndFinish(JniCatalog.java:253)
> at 
> org.apache.impala.service.JniCatalog.getPartialCatalogObject(JniCatalog.java:430)
> {code}
> We need to override that setting while creating {{sys.impala_query_live}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-13061) Query Live table fails to load if default_transactional_type=insert_only set globally

2024-05-06 Thread Michael Smith (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-13061 started by Michael Smith.
--
> Query Live table fails to load if default_transactional_type=insert_only set 
> globally
> -
>
> Key: IMPALA-13061
> URL: https://issues.apache.org/jira/browse/IMPALA-13061
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Critical
>
> If transactional type defaults to insert_only for all queries via
> {code}
> --default_query_options=default_transactional_type=insert_only
> {code}
> the table definition for {{sys.impala_query_live}} is set to transactional, 
> which causes an exception in catalogd
> {code}
> I0506 22:07:42.808758  3972 jni-util.cc:302] 
> 4547b965aeebc5f0:8ba96c58] java.lang.IllegalStateException
> at 
> com.google.common.base.Preconditions.checkState(Preconditions.java:496)
> at org.apache.impala.catalog.Table.getPartialInfo(Table.java:851)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.doGetPartialCatalogObject(CatalogServiceCatalog.java:3818)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3714)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3681)
> at 
> org.apache.impala.service.JniCatalog.lambda$getPartialCatalogObject$10(JniCatalog.java:431)
> at 
> org.apache.impala.service.JniCatalogOp.lambda$execAndSerialize$1(JniCatalogOp.java:90)
> at org.apache.impala.service.JniCatalogOp.execOp(JniCatalogOp.java:58)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerialize(JniCatalogOp.java:89)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerializeSilentStartAndFinish(JniCatalogOp.java:109)
> at 
> org.apache.impala.service.JniCatalog.execAndSerializeSilentStartAndFinish(JniCatalog.java:253)
> at 
> org.apache.impala.service.JniCatalog.getPartialCatalogObject(JniCatalog.java:430)
> {code}
> We need to override that setting while creating {{sys.impala_query_live}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-13061) Query Live table fails to load if default_transactional_type=insert_only set globally

2024-05-06 Thread Michael Smith (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13061?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Smith updated IMPALA-13061:
---
Priority: Critical  (was: Major)

> Query Live table fails to load if default_transactional_type=insert_only set 
> globally
> -
>
> Key: IMPALA-13061
> URL: https://issues.apache.org/jira/browse/IMPALA-13061
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Critical
>
> If transactional type defaults to insert_only for all queries via
> {code}
> --default_query_options=default_transactional_type=insert_only
> {code}
> the table definition for {{sys.impala_query_live}} is set to transactional, 
> which causes an exception in catalogd
> {code}
> I0506 22:07:42.808758  3972 jni-util.cc:302] 
> 4547b965aeebc5f0:8ba96c58] java.lang.IllegalStateException
> at 
> com.google.common.base.Preconditions.checkState(Preconditions.java:496)
> at org.apache.impala.catalog.Table.getPartialInfo(Table.java:851)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.doGetPartialCatalogObject(CatalogServiceCatalog.java:3818)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3714)
> at 
> org.apache.impala.catalog.CatalogServiceCatalog.getPartialCatalogObject(CatalogServiceCatalog.java:3681)
> at 
> org.apache.impala.service.JniCatalog.lambda$getPartialCatalogObject$10(JniCatalog.java:431)
> at 
> org.apache.impala.service.JniCatalogOp.lambda$execAndSerialize$1(JniCatalogOp.java:90)
> at org.apache.impala.service.JniCatalogOp.execOp(JniCatalogOp.java:58)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerialize(JniCatalogOp.java:89)
> at 
> org.apache.impala.service.JniCatalogOp.execAndSerializeSilentStartAndFinish(JniCatalogOp.java:109)
> at 
> org.apache.impala.service.JniCatalog.execAndSerializeSilentStartAndFinish(JniCatalog.java:253)
> at 
> org.apache.impala.service.JniCatalog.getPartialCatalogObject(JniCatalog.java:430)
> {code}
> We need to override that setting while creating {{sys.impala_query_live}}.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-13034) Add logs for slow HTTP requests dumping the profile

2024-05-06 Thread Quanlong Huang (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quanlong Huang reassigned IMPALA-13034:
---

Assignee: Quanlong Huang

> Add logs for slow HTTP requests dumping the profile
> ---
>
> Key: IMPALA-13034
> URL: https://issues.apache.org/jira/browse/IMPALA-13034
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Reporter: Quanlong Huang
>Assignee: Quanlong Huang
>Priority: Critical
>
> There are several endpoints in WebUI that can dump a query profile: 
> /query_profile, /query_profile_encoded, /query_profile_plain_text, 
> /query_profile_json
> The HTTP handler thread goes into ImpalaServer::GetRuntimeProfileOutput() 
> which acquires lock of the ClientRequestState. This could blocks client 
> requests in fetching query results. We should add warning logs when such HTTP 
> requests run slow (e.g. when the profile is too large to download in a short 
> time). IP address and other info of such requests should also be logged.
> Related codes:
> https://github.com/apache/impala/blob/f620e5d5c0bbdb0fd97bac31c7b7439cd13c6d08/be/src/service/impala-server.cc#L736
> https://github.com/apache/impala/blob/f620e5d5c0bbdb0fd97bac31c7b7439cd13c6d08/be/src/service/impala-beeswax-server.cc#L601
> https://github.com/apache/impala/blob/f620e5d5c0bbdb0fd97bac31c7b7439cd13c6d08/be/src/service/impala-hs2-server.cc#L207



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-12795) TestWebPage.test_catalog_operation_fields is flaky

2024-05-06 Thread Quanlong Huang (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quanlong Huang resolved IMPALA-12795.
-
Fix Version/s: Impala 4.4.0
   Resolution: Fixed

> TestWebPage.test_catalog_operation_fields is flaky
> --
>
> Key: IMPALA-12795
> URL: https://issues.apache.org/jira/browse/IMPALA-12795
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Quanlong Huang
>Assignee: Quanlong Huang
>Priority: Critical
> Fix For: Impala 4.4.0
>
>
> Saw the test failed in an internal job:
> {noformat}
> webserver/test_web_pages.py:942: in test_catalog_operation_fields
> assert matched
> E   assert False{noformat}
> That means the CREATE DATABASE statement was not found in the coordinator 
> webUI.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12951) Investigate RETRY_FAILED_QUERIES with System Tables

2024-05-06 Thread Michael Smith (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843919#comment-17843919
 ] 

Michael Smith commented on IMPALA-12951:


Impala has a general query retry mechanism. It's not used for inserts. I 
believe this problem only applies to system tables, but not 100% sure.

> Investigate RETRY_FAILED_QUERIES with System Tables
> ---
>
> Key: IMPALA-12951
> URL: https://issues.apache.org/jira/browse/IMPALA-12951
> Project: IMPALA
>  Issue Type: Task
>Reporter: Michael Smith
>Priority: Major
>
> My theory is that {{RETRY_FAILED_QUERIES}} doesn't work well with querying 
> System Tables because coordinators don't get added to a blacklist when they 
> fail, so it reschedules them on the same set of nodes.
> Look into whether issues with {{RETRY_FAILED_QUERIES}} when a coordinator 
> restarts (without graceful shutdown) are unique to coordinators, and how to 
> improve it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12951) Investigate RETRY_FAILED_QUERIES with System Tables

2024-05-06 Thread Jason Fehr (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843918#comment-17843918
 ] 

Jason Fehr commented on IMPALA-12951:
-

Is this issue limited to clients when they are reading from the system tables 
or does it also cover situations where inserting into the sys.impala_query_log 
table fails?

> Investigate RETRY_FAILED_QUERIES with System Tables
> ---
>
> Key: IMPALA-12951
> URL: https://issues.apache.org/jira/browse/IMPALA-12951
> Project: IMPALA
>  Issue Type: Task
>Reporter: Michael Smith
>Priority: Major
>
> My theory is that {{RETRY_FAILED_QUERIES}} doesn't work well with querying 
> System Tables because coordinators don't get added to a blacklist when they 
> fail, so it reschedules them on the same set of nodes.
> Look into whether issues with {{RETRY_FAILED_QUERIES}} when a coordinator 
> restarts (without graceful shutdown) are unique to coordinators, and how to 
> improve it.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-13060) TestQueryLogTableBeeswax.test_query_log_table_query_select_dedicate_coordinator flaky

2024-05-06 Thread Wenzhe Zhou (Jira)
Wenzhe Zhou created IMPALA-13060:


 Summary: 
TestQueryLogTableBeeswax.test_query_log_table_query_select_dedicate_coordinator 
flaky
 Key: IMPALA-13060
 URL: https://issues.apache.org/jira/browse/IMPALA-13060
 Project: IMPALA
  Issue Type: Bug
Reporter: Wenzhe Zhou
Assignee: Jason Fehr


The unit-test failed since the expected message "'Query successfully 
unregistered: query_id=554dcf86d11dbd5f:0ea9f28d'" was not found in the 
log file. 

Stacktrace:
custom_cluster/test_query_log.py:414: in 
test_query_log_table_query_select_dedicate_coordinator
client = self.get_client(vector.get_value('protocol'))
custom_cluster/test_query_log.py:73: in get_client
self.assert_impalad_log_contains("INFO", finish_re)
common/impala_test_suite.py:1271: in assert_impalad_log_contains
"impalad", level, line_regex, expected_count, timeout_s)
common/impala_test_suite.py:1322: in assert_log_contains
(expected_count, log_file_path, line_regex, found, line)
E   AssertionError: Expected 1 lines in file 
/data0/jenkins/workspace/impala-asf-master-core-s3-data-cache/repos/Impala/logs/custom_cluster_tests/impalad.impala-ec2-centos79-m6i-4xlarge-xldisk-04d5.vpc.cloudera.com.jenkins.log.INFO.20240506-052112.12754
 matching regex 'Query successfully unregistered: 
query_id=554dcf86d11dbd5f:0ea9f28d', but found 0 lines. Last line was: 
E   I0506 05:21:32.810438 14218 TAcceptQueueServer.cpp:355] New connection to 
server beeswax-frontend from client 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-12977) add search and pagination to /hadoop-varz

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843870#comment-17843870
 ] 

ASF subversion and git services commented on IMPALA-12977:
--

Commit 7c98ebb7b149a03a1fdb10f0da124c3fd2265f5d in impala's branch 
refs/heads/master from Saurabh Katiyal
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=7c98ebb7b ]

IMPALA-12977: add search and pagination to /hadoop-varz

Added search and pagination feature to /hadoop-varz

Change-Id: Ic8cac23b655fa58ce12d9857649705574614a5f0
Reviewed-on: http://gerrit.cloudera.org:8080/21329
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 


>  add search and pagination to /hadoop-varz
> --
>
> Key: IMPALA-12977
> URL: https://issues.apache.org/jira/browse/IMPALA-12977
> Project: IMPALA
>  Issue Type: Improvement
>Reporter: Saurabh Katiyal
>Assignee: Saurabh Katiyal
>Priority: Minor
>  Labels: newbie
>
> /hadoop-varz has 2000+ configurations , It'd be nice to have some of the 
> tools like search and pagination that we get on /varz (flags.tmpl)
> existing template to /hadoop-varz:
> [https://github.com/apache/impala/blob/ba4cb95b6251911fa9e057cea1cb37958d339fed/www/hadoop-varz.tmpl]
> existing template to /varz:
> [https://github.com/apache/impala/blob/ba4cb95b6251911fa9e057cea1cb37958d339fed/www/flags.tmpl]
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-13058) TestRuntimeFilters.test_basic_filters failed on ARM builds

2024-05-06 Thread Riza Suminto (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Riza Suminto reassigned IMPALA-13058:
-

Assignee: Riza Suminto

> TestRuntimeFilters.test_basic_filters failed on ARM builds
> --
>
> Key: IMPALA-13058
> URL: https://issues.apache.org/jira/browse/IMPALA-13058
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Reporter: Wenzhe Zhou
>Assignee: Riza Suminto
>Priority: Major
>
> Stacktrace:
> query_test/test_runtime_filters.py:98: in test_basic_filters
> self.run_test_case('QueryTest/runtime_filters', vector, 
> test_file_vars=vars)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/impala_test_suite.py:851:
>  in run_test_case
> update_section=pytest.config.option.update_results)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/test_result_verifier.py:701:
>  in verify_runtime_profile
> actual))
> E   AssertionError: Did not find matches for lines in runtime profile:
> E   EXPECTED LINES:
> E   row_regex: .*REMOTE.*ms.*ms.*true



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-12264) Add configuration to limit concurrent connections

2024-05-06 Thread Michael Smith (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Smith resolved IMPALA-12264.

Fix Version/s: Impala 4.4.0
   Resolution: Fixed

> Add configuration to limit concurrent connections
> -
>
> Key: IMPALA-12264
> URL: https://issues.apache.org/jira/browse/IMPALA-12264
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Backend
>Reporter: Quanlong Huang
>Assignee: Andrew Sherman
>Priority: Major
> Fix For: Impala 4.4.0
>
>
> To prevent a rogue application from repeatedly connecting to and monopolizing 
> Impala, we can add configurations to limit concurrent connections. E.g. limit 
> the number of concurrent connections per user, IP address, or the user and IP 
> address combination.
> CC [~mylogi...@gmail.com] 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13058) TestRuntimeFilters.test_basic_filters failed on ARM builds

2024-05-06 Thread Riza Suminto (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843833#comment-17843833
 ] 

Riza Suminto commented on IMPALA-13058:
---

Coordinator received filter update very fast within 2ms, but column "First 
arrived" and "Completed" stays "N/A" in "Final filter table".
{code:java}
E   Filter routing table: 
EID  Src. Node  Tgt. Node(s)  Target type  Partition filter  Pending 
(Expected)  First arrived  Completed  Enabled  Bloom Size   Est fpp  Min value  
Max value   In-list size
E   
-
E 0  2 0   REMOTE  true   1 
(3)N/AN/A true 1.00 MB  1.13e-16
E   Backend startup latencies: Count: 3, sum: 0, min / max: 0 / 0, 25th 
%-ile: 0, 50th %-ile: 0, 75th %-ile: 0, 90th %-ile: 0, 95th %-ile: 0, 99.9th 
%-ile: 0
E   Slowest backend to start up: 
E   Final filter table: 
EID  Src. Node  Tgt. Node(s)  Target type  Partition filter  Pending 
(Expected)  First arrived  Completed  Enabled  Bloom Size   Est fpp  Min value  
Max value   In-list size
E   
-
E 0  2 0   REMOTE  true   0 
(3)N/AN/A true 1.00 MB  1.13e-16 {code}
I think what happen is, the filter arrived so fast such that 
first_arrival_time() and completion_time() stays at 0 and printed as "N/A"

[https://github.com/apache/impala/blob/0d01f5e82967b84b182d40de1ce213367e0b6181/be/src/runtime/coordinator.cc#L675-L684]
 

> TestRuntimeFilters.test_basic_filters failed on ARM builds
> --
>
> Key: IMPALA-13058
> URL: https://issues.apache.org/jira/browse/IMPALA-13058
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Reporter: Wenzhe Zhou
>Priority: Major
>
> Stacktrace:
> query_test/test_runtime_filters.py:98: in test_basic_filters
> self.run_test_case('QueryTest/runtime_filters', vector, 
> test_file_vars=vars)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/impala_test_suite.py:851:
>  in run_test_case
> update_section=pytest.config.option.update_results)
> /data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/test_result_verifier.py:701:
>  in verify_runtime_profile
> actual))
> E   AssertionError: Did not find matches for lines in runtime profile:
> E   EXPECTED LINES:
> E   row_regex: .*REMOTE.*ms.*ms.*true



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-13059) Testcase fails at test_migrated_table_field_id_resolution due to unmatched column number or metadata file for version 2 is missing

2024-05-06 Thread Wenzhe Zhou (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wenzhe Zhou updated IMPALA-13059:
-
Summary: Testcase fails at test_migrated_table_field_id_resolution due to 
unmatched column number or metadata file for version 2 is missing  (was: 
Testcase fails at test_migrated_table_field_id_resolution due to unmatched 
column number)

> Testcase fails at test_migrated_table_field_id_resolution due to unmatched 
> column number or metadata file for version 2 is missing
> --
>
> Key: IMPALA-13059
> URL: https://issues.apache.org/jira/browse/IMPALA-13059
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Wenzhe Zhou
>Priority: Major
>
> Test failed due to the unmatched number of columns.
>  
> Stacktrace:
> query_test/test_iceberg.py:270: in test_migrated_table_field_id_resolution
> vector, unique_database)
> common/impala_test_suite.py:756: in run_test_case
> self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:589: in __verify_results_and_errors
> replace_filenames_with_placeholder)
> common/test_result_verifier.py:443: in verify_raw_results
> verify_results(expected_types, actual_types, order_matters=True)
> common/test_result_verifier.py:339: in verify_results
> assert expected_results == actual_results
> E   assert ['INT', 'DOUBLE'] == ['INT', 'DOUBLE', 'DOUBLE']
> E Right contains more items, first extra item: 'DOUBLE'
> E Use -v to get the full diff



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13059) Testcase fails at test_migrated_table_field_id_resolution due to unmatched column number

2024-05-06 Thread Wenzhe Zhou (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843827#comment-17843827
 ] 

Wenzhe Zhou commented on IMPALA-13059:
--

In another test case, it failed due to metadata file for version 2 is missing

Stacktrace
query_test/test_iceberg.py:264: in test_migrated_table_field_id_resolution
"iceberg_migrated_complex_test", "parquet")
common/file_utils.py:72: in create_iceberg_table_from_directory
'True');""".format(qualified_table_name))
common/impala_connection.py:216: in execute
fetch_profile_after_close=fetch_profile_after_close)
beeswax/impala_beeswax.py:191: in execute
handle = self.__execute_query(query_string.strip(), user=user)
beeswax/impala_beeswax.py:384: in __execute_query
self.wait_for_finished(handle)
beeswax/impala_beeswax.py:405: in wait_for_finished
raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
E   ImpalaBeeswaxException: ImpalaBeeswaxException:
EQuery aborted:ImpalaRuntimeException: Failed to ALTER table 
'iceberg_migrated_complex_test': Metadata file for version 2 is missing

> Testcase fails at test_migrated_table_field_id_resolution due to unmatched 
> column number
> 
>
> Key: IMPALA-13059
> URL: https://issues.apache.org/jira/browse/IMPALA-13059
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Wenzhe Zhou
>Priority: Major
>
> Test failed due to the unmatched number of columns.
>  
> Stacktrace:
> query_test/test_iceberg.py:270: in test_migrated_table_field_id_resolution
> vector, unique_database)
> common/impala_test_suite.py:756: in run_test_case
> self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:589: in __verify_results_and_errors
> replace_filenames_with_placeholder)
> common/test_result_verifier.py:443: in verify_raw_results
> verify_results(expected_types, actual_types, order_matters=True)
> common/test_result_verifier.py:339: in verify_results
> assert expected_results == actual_results
> E   assert ['INT', 'DOUBLE'] == ['INT', 'DOUBLE', 'DOUBLE']
> E Right contains more items, first extra item: 'DOUBLE'
> E Use -v to get the full diff



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-13059) Testcase fails at test_migrated_table_field_id_resolution due to unmatched column number

2024-05-06 Thread Wenzhe Zhou (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wenzhe Zhou updated IMPALA-13059:
-
Summary: Testcase fails at test_migrated_table_field_id_resolution due to 
unmatched column number  (was: Testcase fails at 
test_migrated_table_field_id_resolution due to not matched number of columns)

> Testcase fails at test_migrated_table_field_id_resolution due to unmatched 
> column number
> 
>
> Key: IMPALA-13059
> URL: https://issues.apache.org/jira/browse/IMPALA-13059
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Wenzhe Zhou
>Priority: Major
>
> Test failed due to the unmatched number of columns.
>  
> Stacktrace:
> query_test/test_iceberg.py:270: in test_migrated_table_field_id_resolution
> vector, unique_database)
> common/impala_test_suite.py:756: in run_test_case
> self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:589: in __verify_results_and_errors
> replace_filenames_with_placeholder)
> common/test_result_verifier.py:443: in verify_raw_results
> verify_results(expected_types, actual_types, order_matters=True)
> common/test_result_verifier.py:339: in verify_results
> assert expected_results == actual_results
> E   assert ['INT', 'DOUBLE'] == ['INT', 'DOUBLE', 'DOUBLE']
> E Right contains more items, first extra item: 'DOUBLE'
> E Use -v to get the full diff



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-13059) Testcase fails at test_migrated_table_field_id_resolution due to not matched number of columns

2024-05-06 Thread Wenzhe Zhou (Jira)
Wenzhe Zhou created IMPALA-13059:


 Summary: Testcase fails at test_migrated_table_field_id_resolution 
due to not matched number of columns
 Key: IMPALA-13059
 URL: https://issues.apache.org/jira/browse/IMPALA-13059
 Project: IMPALA
  Issue Type: Bug
Reporter: Wenzhe Zhou


Test failed due to the unmatched number of columns.
 
Stacktrace:
query_test/test_iceberg.py:270: in test_migrated_table_field_id_resolution
vector, unique_database)
common/impala_test_suite.py:756: in run_test_case
self.__verify_results_and_errors(vector, test_section, result, use_db)
common/impala_test_suite.py:589: in __verify_results_and_errors
replace_filenames_with_placeholder)
common/test_result_verifier.py:443: in verify_raw_results
verify_results(expected_types, actual_types, order_matters=True)
common/test_result_verifier.py:339: in verify_results
assert expected_results == actual_results
E   assert ['INT', 'DOUBLE'] == ['INT', 'DOUBLE', 'DOUBLE']
E Right contains more items, first extra item: 'DOUBLE'
E Use -v to get the full diff



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-13058) TestRuntimeFilters.test_basic_filters failed on ARM builds

2024-05-06 Thread Wenzhe Zhou (Jira)
Wenzhe Zhou created IMPALA-13058:


 Summary: TestRuntimeFilters.test_basic_filters failed on ARM builds
 Key: IMPALA-13058
 URL: https://issues.apache.org/jira/browse/IMPALA-13058
 Project: IMPALA
  Issue Type: Bug
  Components: Backend
Reporter: Wenzhe Zhou


Stacktrace:

query_test/test_runtime_filters.py:98: in test_basic_filters
self.run_test_case('QueryTest/runtime_filters', vector, test_file_vars=vars)
/data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/impala_test_suite.py:851:
 in run_test_case
update_section=pytest.config.option.update_results)
/data/jenkins/workspace/impala-cdw-master-exhaustive-release-arm/repos/Impala/tests/common/test_result_verifier.py:701:
 in verify_runtime_profile
actual))
E   AssertionError: Did not find matches for lines in runtime profile:
E   EXPECTED LINES:
E   row_regex: .*REMOTE.*ms.*ms.*true




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-13057) Incorporate tuple/slot information into the tuple cache key

2024-05-06 Thread Joe McDonnell (Jira)
Joe McDonnell created IMPALA-13057:
--

 Summary: Incorporate tuple/slot information into the tuple cache 
key
 Key: IMPALA-13057
 URL: https://issues.apache.org/jira/browse/IMPALA-13057
 Project: IMPALA
  Issue Type: Bug
  Components: Frontend
Affects Versions: Impala 4.4.0
Reporter: Joe McDonnell
Assignee: Joe McDonnell


Since the tuple and slot information is kept separately in the descriptor 
table, it does not get incorporated into the PlanNode thrift used for the tuple 
cache key. This means that the tuple cache can't distinguish between these two 
queries:
{noformat}
select int_col1 from table;
select int_col2 from table;{noformat}
To solve this, the tuple/slot information needs to be incorporated into the 
cache key. PlanNode::initThrift() walks through each tuple, so this is a good 
place to serialize the TupleDescriptor/SlotDescriptors and incorporate it into 
the hash.

The tuple ids and slot ids are global ids, so the value is influenced by the 
entirety of the query. This is a problem for matching cache results across 
different queries. As part of incorporating the tuple/slot information, we 
should also add an ability to translate tuple/slot ids into ids local to a 
subtree.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13012) Completed queries write fails regularly under heavy load

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843794#comment-17843794
 ] 

ASF subversion and git services commented on IMPALA-13012:
--

Commit 73f13f0e9f225400bb641c48da57a9871b5d8383 in impala's branch 
refs/heads/branch-4.4.0 from Michael Smith
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=73f13f0e9 ]

IMPALA-13012: Lower default query_log_max_queued

Sets the query_log_max_queued default such that

  query_log_max_queued * num_columns(49) < statement_expression_limit

to avoid triggering e.g.

  AnalysisException: Exceeded the statement expression limit (25)
  Statement has 370039 expressions.

Also increases statement_expression_limit for insertion to avoid an
error if query_log_max_queued is changed.

Logs time taken to write to the queries table for help with debugging
and adds histogram "impala-server.completed-queries.write-durations".

Fixes InternalServer so it uses 'default_query_options'.

Change-Id: I6535675307d88cb65ba7d908f3c692e0cf3259d7
Reviewed-on: http://gerrit.cloudera.org:8080/21351
Reviewed-by: Michael Smith 
Tested-by: Michael Smith 
Reviewed-by: Riza Suminto 
(cherry picked from commit ba32d70891fd68c5c1234ed543b74c51661bf272)


> Completed queries write fails regularly under heavy load
> 
>
> Key: IMPALA-13012
> URL: https://issues.apache.org/jira/browse/IMPALA-13012
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Affects Versions: Impala 4.4.0
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Critical
> Fix For: Impala 4.4.0
>
>
> Under heavy test load (running EE tests), Impala regularly fails to write 
> completed queries with errors like
> {code}
> W0411 19:11:07.764967 32713 workload-management.cc:435] failed to write 
> completed queries table="sys.impala_query_log" record_count="10001"
> W0411 19:11:07.764983 32713 workload-management.cc:437] AnalysisException: 
> Exceeded the statement expression limit (25)
> Statement has 370039 expressions.
> {code}
> After a few attempts, it floods logs with an error for each query that could 
> not be written
> {code}
> E0411 19:11:24.646953 32713 workload-management.cc:376] could not write 
> completed query table="sys.impala_query_log" 
> query_id="3142ceb1380b58e6:715b83d9"
> {code}
> This seems like poor default behavior. Options for addressing it:
> # Decrease the default for {{query_log_max_queued}}. Inserts are pretty 
> constant at 37 expressions per entry. I'm not sure why that isn't 49, since 
> that's the number of columns we have; maybe some fields are frequently 
> omitted. I would cap {{query_log_max_queued}} to {{statement_expression_limit 
> / number_of_columns ~ 5100}}.
> # Allow workload management to {{set statement_expression_limit}} higher 
> using a similar formula. This may be relatively safe as the expressions are 
> simple.
> # Ideally we would skip expression parsing and construct TExecRequest 
> directly, but that's a much larger effort.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13005) Query Live table not visible via metastore

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843793#comment-17843793
 ] 

ASF subversion and git services commented on IMPALA-13005:
--

Commit 91a9dd81ccd38e2e4b975e4ddf49f97c3e01b051 in impala's branch 
refs/heads/branch-4.4.0 from Michael Smith
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=91a9dd81c ]

IMPALA-13005: Create Query Live table in HMS

Creates the 'sys.impala_query_live' table in HMS using a similar 'CREATE
TABLE' command to 'sys.impala_query_log'. Updates frontend to identify a
System Table based on the '__IMPALA_SYSTEM_TABLE' property. Tables
improperly marked with '__IMPALA_SYSTEM_TABLE' will error when
attempting to scan them because no relevant scanner will be available.

Creating the table in HMS simplifies supporting 'SHOW CREATE TABLE' and
'DESCRIBE EXTENDED', so allows them for parity with Query Log.
Explicitly disables 'COMPUTE STATS' on system tables as it doesn't work
correctly.

Makes System Tables work with local catalog mode, fixing

  LocalCatalogException: Unknown table type for table sys.impala_query_live

Updates workload management implementation to rely more on
SystemTables.thrift definition, and adds DCHECKs to verify completeness
and ordering.

Testing:
- adds additional test cases for changes to introspection commands
- passes existing test_query_live and test_query_log suites

Change-Id: Idf302ee54a819fdee2db0ae582a5eeddffe4a5b4
Reviewed-on: http://gerrit.cloudera.org:8080/21302
Reviewed-by: Riza Suminto 
Tested-by: Impala Public Jenkins 
(cherry picked from commit 73a9ef9c4c00419b5082f355d96961c5971634d6)


> Query Live table not visible via metastore
> --
>
> Key: IMPALA-13005
> URL: https://issues.apache.org/jira/browse/IMPALA-13005
> Project: IMPALA
>  Issue Type: Bug
>Affects Versions: Impala 4.4.0
>Reporter: Michael Smith
>Assignee: Michael Smith
>Priority: Major
> Fix For: Impala 4.4.0
>
>
> Tools that inspect Hive Metastore to view information about tables are unable 
> to see {{sys.impala_query_live}} when {{enable_workload_mgmt}} is enabled. 
> This presents an inconsistent view depending on whether inspecting tables via 
> Impala or Hive MetaStore.
> Register {{sys.impala_query_live}} with Hive MetaStore, similar to what 
> {{sys.impala_query_log}} does, to make table listing consistent.
> It also doesn't work with local catalog mode.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13024) Several tests timeout waiting for admission

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13024?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843792#comment-17843792
 ] 

ASF subversion and git services commented on IMPALA-13024:
--

Commit 7ca9798949ffd31b703c610e4eca20d0e0181ab3 in impala's branch 
refs/heads/branch-4.4.0 from Riza Suminto
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=7ca979894 ]

IMPALA-13024: Ignore slots if using default pool and empty group

Slot based admission should not be enabled when using default pool.
There is a bug where coordinator-only query still does slot based
admission because executor group name set to
ClusterMembershipMgr::EMPTY_GROUP_NAME ("empty group (using coordinator
only)"). This patch adds check to recognize coordinator-only query in
default pool and skip slot based admission for it.

Testing:
- Add BE test AdmissionControllerTest.CanAdmitRequestSlotsDefault.
- In test_executor_groups.py, split test_coordinator_concurrency to
  test_coordinator_concurrency_default and
  test_coordinator_concurrency_two_exec_group_cluster to show the
  behavior change.
- Pass core tests in ASAN build.

Change-Id: I0b08dea7ba0c78ac6b98c7a0b148df8fb036c4d0
Reviewed-on: http://gerrit.cloudera.org:8080/21340
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
(cherry picked from commit 29e418679380591ea7b8e4bdb797ccdbb0964a3d)


> Several tests timeout waiting for admission
> ---
>
> Key: IMPALA-13024
> URL: https://issues.apache.org/jira/browse/IMPALA-13024
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Csaba Ringhofer
>Assignee: Riza Suminto
>Priority: Critical
> Fix For: Impala 4.4.0
>
>
> A bunch of seemingly unrelated tests failed with the following message:
> Example: 
> query_test.test_spilling.TestSpillingDebugActionDimensions.test_spilling_aggs[protocol:
>  beeswax | exec_option: {'mt_dop': 1, 'debug_action': None, 
> 'default_spillable_buffer_size': '256k'} | table_format: parquet/none] 
> {code}
> ImpalaBeeswaxException: EQuery aborted:Admission for query exceeded 
> timeout 6ms in pool default-pool. Queued reason: Not enough admission 
> control slots available on host ... . Needed 1 slots but 18/16 are already in 
> use. Additional Details: Not Applicable
> {code}
> This happened in an ASAN build. Another test also failed which may be related 
> to the cause:
> custom_cluster.test_admission_controller.TestAdmissionController.test_queue_reasons_slots
>  
> {code}
> Timeout: query 'e1410add778cd7b0:c40812b9' did not reach one of the 
> expected states [4], last known state 5
> {code}
> test_queue_reasons_slots seems to be know flaky test: IMPALA-10338



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13015) Dataload fails due to concurrency issue with test.jceks

2024-05-06 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843791#comment-17843791
 ] 

ASF subversion and git services commented on IMPALA-13015:
--

Commit 5045f19b5374678c10888376955f2ff5e360ae5b in impala's branch 
refs/heads/branch-4.4.0 from Abhishek Rawat
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=5045f19b5 ]

IMPALA-13015: Dataload fails due to concurrency issue with test.jceks

Move 'hadoop credential' command used for creating test.jceks to
testdata/bin/create-load-data.sh. Earlier it was in bin/load-data.py
which is called in parallel and was causing failures due to race
conditions.

Testing:
- Ran JniFrontendTest#testGetSecretFromKeyStore after data loading and
test ran clean.

Change-Id: I7fbeffc19f2b78c19fee9acf7f96466c8f4f9bcd
Reviewed-on: http://gerrit.cloudera.org:8080/21346
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
(cherry picked from commit f620e5d5c0bbdb0fd97bac31c7b7439cd13c6d08)


> Dataload fails due to concurrency issue with test.jceks
> ---
>
> Key: IMPALA-13015
> URL: https://issues.apache.org/jira/browse/IMPALA-13015
> Project: IMPALA
>  Issue Type: Bug
>  Components: Infrastructure
>Affects Versions: Impala 4.4.0
>Reporter: Joe McDonnell
>Assignee: Abhishek Rawat
>Priority: Major
>  Labels: flaky
>
> When doing dataload locally, it fails with this error:
> {noformat}
> Traceback (most recent call last):
>   File "/home/joemcdonnell/upstream/Impala/bin/load-data.py", line 523, in 
> 
>     if __name__ == "__main__": main()
>   File "/home/joemcdonnell/upstream/Impala/bin/load-data.py", line 322, in 
> main
>     os.remove(jceks_path)
> OSError: [Errno 2] No such file or directory: 
> '/home/joemcdonnell/upstream/Impala/testdata/jceks/test.jceks'
> Background task Loading functional-query data (pid 501094) failed.
> {noformat}
> testdata/bin/create-load-data.sh calls bin/load-data.py for functional, 
> TPC-H, and TPC-DS in parallel, so this logic has race conditions:
> {noformat}
>   jceks_path = TESTDATA_JCEKS_DIR + "/test.jceks"
>   if os.path.exists(jceks_path):
>     os.remove(jceks_path){noformat}
> I don't see a specific reason for this to be in bin/load-data.py. It should 
> be moved somewhere else that doesn't run in parallel. One possible location 
> is to add a step in testdata/bin/create-load-data.sh
> This was introduced in 
> [https://github.com/apache/impala/commit/9837637d9342a49288a13a421d4e749818da1432]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-13038) Support profile tab for imported query profiles

2024-05-06 Thread Surya Hebbar (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843707#comment-17843707
 ] 

Surya Hebbar commented on IMPALA-13038:
---

[Example Query Profile JSON | 
^json_profile_a34485359bfdfe1f_3ca8177b.json]
[Example Query Profile Text | 
^json_profile_a34485359bfdfe1f_3ca8177b.txt]

> Support profile tab for imported query profiles
> ---
>
> Key: IMPALA-13038
> URL: https://issues.apache.org/jira/browse/IMPALA-13038
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Surya Hebbar
>Assignee: Surya Hebbar
>Priority: Major
> Attachments: json_profile_a34485359bfdfe1f_3ca8177b.json, 
> json_profile_a34485359bfdfe1f_3ca8177b.txt
>
>
> Query profile imports currently support the following tabs.
>  - Query Statement
>  - Query Timeline
>  - Query Text Plan
> It would be helpful to support "Query Profile" tab for these imports.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-13038) Support profile tab for imported query profiles

2024-05-06 Thread Surya Hebbar (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Surya Hebbar updated IMPALA-13038:
--
Attachment: json_profile_a34485359bfdfe1f_3ca8177b.txt
json_profile_a34485359bfdfe1f_3ca8177b.json

> Support profile tab for imported query profiles
> ---
>
> Key: IMPALA-13038
> URL: https://issues.apache.org/jira/browse/IMPALA-13038
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Surya Hebbar
>Assignee: Surya Hebbar
>Priority: Major
> Attachments: json_profile_a34485359bfdfe1f_3ca8177b.json, 
> json_profile_a34485359bfdfe1f_3ca8177b.txt
>
>
> Query profile imports currently support the following tabs.
>  - Query Statement
>  - Query Timeline
>  - Query Text Plan
> It would be helpful to support "Query Profile" tab for these imports.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-13038) Support profile tab for imported query profiles

2024-05-06 Thread Surya Hebbar (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Surya Hebbar updated IMPALA-13038:
--
Summary: Support profile tab for imported query profiles  (was: Support 
Profile tab for Imported Query Profiles)

> Support profile tab for imported query profiles
> ---
>
> Key: IMPALA-13038
> URL: https://issues.apache.org/jira/browse/IMPALA-13038
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Surya Hebbar
>Assignee: Surya Hebbar
>Priority: Major
>
> Query profile imports currently support the following tabs.
>  - Query Statement
>  - Query Timeline
>  - Query Text Plan
> It would be helpful to support "Query Profile" tab for these imports.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org