GitHub user gupta-sahil01 added a comment to the discussion: Ambient "operator 
recommender" — predictive next-operator suggestions on the canvas

1) Offline analysis
Three small text files we write once and update as the operator catalogue 
changes — no model training:

a short description of each operator (so the LLM knows what each one does)
a handful of example patterns ("after a Filter on text, suggest Sentiment") to 
guide ranking
a small test set of workflows to check ranking quality in CI

Adding a new operator just means adding one line to the descriptions file. CI 
catches it if you forget.
2) What's stored and where
Two places:

In the repo (versioned with code, reviewed in PRs): the three files above, plus 
the LLM prompt templates. That's the entire "knowledge base."
In memory at runtime (auto-evicting cache): recent suggestion results, and 
one-time analyses of any UDF source code we've seen. Caches a recent UDF 
summary so we don't pay the LLM to re-read the same Python twice.

No database, no model registry, no separate service. Four files in the repo + a 
cache.

GitHub link: 
https://github.com/apache/texera/discussions/5240#discussioncomment-17154815

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to