surapuramakhil commented on issue #28167:
URL: https://github.com/apache/superset/issues/28167#issuecomment-2077783230

   > like this may be feasible for a user with a handful of tables, but 
at-scale it simply doesn't work
   
   This is the problem with Language Model. That's exactly why LLM choice is 
given to users. If the situation were the scale is high, the best they can with 
high context size model like Gemini pro-1.5. Thats a separate Data Science 
problem which Apache Superset doesn't need to solve. just leverage what is 
available.
   
   
---------------------------------------------------------------------------------------------------
   
   > Using RAG to pull relevant table metadata at prompt-time still led to 
unmanageable levels of LLM hallucination that only grows worse as the size of 
the warehouse being queried increases.
   
   This is a separate data science problem which Apache Superset doesn't need 
to solve, currently langchain community (quite popular in datascience) are 
solving this problem. we just leverage it. 
   
   this might protect from hallucination 
https://python.langchain.com/docs/use_cases/sql/query_checking/ 
   Prompting / RAG strategies while working at scale - 
https://python.langchain.com/docs/use_cases/sql/large_db/
   
   As both evolve (by time), Quality of Queries will become better & better.
   
   
------------------------------------------------------------------------------------------------------
   
   > a query that is 99% correct is functionally worthless if this is intended 
to be utilized by folks who don't have the skills necessary to parse through 
AI-generated SQL.
   I agree with you about this, this doesn't fully solve who doesn't necessary 
knowledge to understand AI generated SQL. It's a copilot instead of an auto 
pilot.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: notifications-unsubscr...@superset.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: notifications-unsubscr...@superset.apache.org
For additional commands, e-mail: notifications-h...@superset.apache.org

Reply via email to