Hi AsterixDB community, My name is Muhammad Abdullah, a Computer Science student at Information Technology University (ITU), Lahore, Pakistan (CGPA: 3.91/4.00). I have submitted a GSoC 2026 proposal for the NL2SQL++ Intelligent Query Assistant project.
A bit about my relevant background: - Built Make-It-RAG: a natural language to SQL system using Google Gemini with a React chat interface - Built Klebbix: a production RAG system with hybrid search over Qdrant + Azure OpenAI, processing 8+ file formats at 60+ req/min - Strong Python, FastAPI, and LLM integration experience To prepare, I ran AsterixDB locally via Docker, explored the TinySocial sample dataset in SQL++, and reviewed the /query/service REST API. I also just submitted PR #41 (https://github.com/apache/asterixdb/pull/41) fixing a couple of typos in the README — a small first step to get familiar with the contribution workflow. My proposal covers: 1. A schema-aware RAG context engine that ingests AsterixDB metadata and retrieves relevant schema fragments per query 2. A model-agnostic LLM bridge (OpenAI, Gemini, Ollama) with SQL++-specific few-shot prompting 3. A query validator using AsterixDB's /query/service explain mode with a self-correction loop 4. A REST service + minimal web UI, with full integration tests against TinySocial and Yelp sample datasets One specific question for the community: is there an existing SQL++ parser test suite in the repo that would be the right target for validation test cases in this project? Thank you, Muhammad Abdullah GitHub: github.com/abdullahiftikharcode Email: [email protected]
