surapuramakhil commented on issue #28167: URL: https://github.com/apache/superset/issues/28167#issuecomment-2077783230
> like this may be feasible for a user with a handful of tables, but at-scale it simply doesn't work This is the problem with Language Model. That's exactly why LLM choice is given to users. If the situation were the scale is high, the best they can with high context size model like Gemini pro-1.5. Thats a separate Data Science problem which Apache Superset doesn't need to solve. just leverage what is available. --------------------------------------------------------------------------------------------------- > Using RAG to pull relevant table metadata at prompt-time still led to unmanageable levels of LLM hallucination that only grows worse as the size of the warehouse being queried increases. This is a separate data science problem which Apache Superset doesn't need to solve, currently langchain community (quite popular in datascience) are solving this problem. we just leverage it. this might protect from hallucination https://python.langchain.com/docs/use_cases/sql/query_checking/ Prompting / RAG strategies while working at scale - https://python.langchain.com/docs/use_cases/sql/large_db/ As both evolve (by time), Quality of Queries will become better & better. ------------------------------------------------------------------------------------------------------ > a query that is 99% correct is functionally worthless if this is intended to be utilized by folks who don't have the skills necessary to parse through AI-generated SQL. I agree with you about this, this doesn't fully solve who doesn't necessary knowledge to understand AI generated SQL. It's a copilot instead of an auto pilot. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: notifications-unsubscr...@superset.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: notifications-unsubscr...@superset.apache.org For additional commands, e-mail: notifications-h...@superset.apache.org