maxgruber19 opened a new issue, #34992:
URL: https://github.com/apache/superset/issues/34992

   ### Bug description
   
   We found a very ugly issue when connecting superset (version 4.0.2, not 
selectable in the issue template 😄) to trino (476). A user has reported that 
superset is very slow and no query is processed sometimes. We found out that 
superset does not finish its queries when somebody selects a table (iceberg via 
trino) in the "SEE TABLE SCHEMA" dropdown. Combined with our resource pools 
(users can submit only 5 queries at a time, 6th will be queued) that's 
definitely a customer facing problem for us, so I'm seeking for help.
   
   <img width="928" height="426" alt="Image" 
src="https://github.com/user-attachments/assets/a71de6dd-c525-4899-ba46-0d3f5faa1245";
 />
   
   3 Queries to trino are fired from superset which are all the same. That's a 
thing I don't understand as well. Why 3 times the same query?
   ```
   SELECT * FROM default."tablename$partitions"
   ```
   
   <img width="1181" height="551" alt="Image" 
src="https://github.com/user-attachments/assets/def11925-22b2-4e1f-a6a3-8d6f99a3e4e8";
 />
   
   Superset already gets a result after couple of seconds and displays the 
table schema but the query stays in state "FINISHING" until a timeout of ~5mins 
is hit (which is the default query.client.timeout of trino). The query then is 
abandoned by trino itself.
   
   ```
   io.trino.spi.TrinoException: Query 20250828_082847_00409_ksycd was abandoned 
by the client, as it may have exited or stopped checking for query results. 
Query results have not been accessed since 2025-08-28T08:28:49.874Z: 
currentTime 2025-08-28T08:33:50.786Z
        at 
io.trino.execution.QueryTracker.failAbandonedQueries(QueryTracker.java:275)
        at io.trino.execution.QueryTracker.lambda$start$0(QueryTracker.java:83)
        at 
java.base/java.util.concurrent.Executors$RunnableAdapter.call(Unknown Source)
        at java.base/java.util.concurrent.FutureTask.runAndReset(Unknown Source)
        at 
java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown
 Source)
        at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown 
Source)
        at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown 
Source)
        at java.base/java.lang.Thread.run(Unknown Source)
   ```
   
   The tables have a bigger amount of partitions, we didn't manage to optimize 
them yet. Some of the tables have ~10.000 partitions (there are ugly reasons 
for that). Maybe that's relevant, but I don't think so because superset 
displays the schema just in time - it feels like just a thing of non 
terminating queries. According to trino, superset fetches ~10000 rows and 5mb 
from those queries within seconds.
   
   We already checked with cilium-hubble for outgoing traffic from superset to 
trino while the query is still listed as FINISHING but it seems like the query 
becomes a zombie right after superset dissplays the table schema. 
   
   Please let me know if I can provide any further details
   
   ### Screenshots/recordings
   
   _No response_
   
   ### Superset version
   
   4.1.3
   
   ### Python version
   
   3.9
   
   ### Node version
   
   I don't know
   
   ### Browser
   
   Not applicable
   
   ### Additional context
   
   _No response_
   
   ### Checklist
   
   - [x] I have searched Superset docs and Slack and didn't find a solution to 
my problem.
   - [x] I have searched the GitHub issue tracker and didn't find a similar bug 
report.
   - [x] I have checked Superset's logs for errors and if I found a relevant 
Python stacktrace, I included it here as text in the "additional context" 
section.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to