Hi AsterixDB community,

My name is Muhammad Abdullah, a Computer Science student at Information
Technology University (ITU), Lahore, Pakistan (CGPA: 3.91/4.00). I have
submitted a GSoC 2026 proposal for the NL2SQL++ Intelligent Query Assistant
project.

A bit about my relevant background:
- Built Make-It-RAG: a natural language to SQL system using Google Gemini
with a React chat interface
- Built Klebbix: a production RAG system with hybrid search over Qdrant +
Azure OpenAI, processing 8+ file formats at 60+ req/min
- Strong Python, FastAPI, and LLM integration experience

To prepare, I ran AsterixDB locally via Docker, explored the TinySocial
sample dataset in SQL++, and reviewed the /query/service REST API. I also
just submitted PR #41 (https://github.com/apache/asterixdb/pull/41) fixing
a couple of typos in the README — a small first step to get familiar with
the contribution workflow.

My proposal covers:
1. A schema-aware RAG context engine that ingests AsterixDB metadata and
retrieves relevant schema fragments per query
2. A model-agnostic LLM bridge (OpenAI, Gemini, Ollama) with SQL++-specific
few-shot prompting
3. A query validator using AsterixDB's /query/service explain mode with a
self-correction loop
4. A REST service + minimal web UI, with full integration tests against
TinySocial and Yelp sample datasets

One specific question for the community: is there an existing SQL++ parser
test suite in the repo that would be the right target for validation test
cases in this project?

Thank you,
Muhammad Abdullah
GitHub: github.com/abdullahiftikharcode
Email: [email protected]

Reply via email to