Hi AsterixDB Team, My name is *Pratham Tomar*, a third-year B.Tech IT student at VIT Pune (CGPA: 9.19) and an incoming Summer Intern at Barclays. I am a *LeetCode Knight (~1950 rating)* with 600+ problems solved, and my core expertise includes *C++, Java, and backend development using Spring Boot*. I am also an active open-source contributor with *15+ Hacktoberfest contributions*, earning all badges including the *Super Contributor* badge.
To understand the project domain, I studied recent research on *Text-to-SQL systems* and set up *Apache AsterixDB locally*. I downloaded the binary distribution, cloned the repository, and tested several *SQL++ queries* to better understand the query engine and schema interaction. Based on this exploration, I drafted a *GSoC proposal and architecture design* for an *NL2SQL++ assistant for AsterixDB* using LangChain4j. The design incorporates ideas from *TRISQL and C3+DIN*, including schema linking, structured prompt generation, and an execution-based self-correction loop. I have shared a *Google Docs draft of my proposal along with the architecture diagram* for your feedback. Additionally, I have experience building *AI agent systems using LangGraph in Python*. One example is *AgentPlay*, an AI agent that performs *real-time speech translation between languages*: https://github.com/IEEE-SB-VIT-Pune/agentPlay I would greatly appreciate any feedback or suggestions on the proposal. Best regards, Pratham Tomar Research Papers i referred: 1. https://www.nature.com/articles/s41598-026-39128-9 2. https://www.scitepress.org/Papers/2024/125552/125552.pdf Draft : https://docs.google.com/document/d/1NbD7_rp1oO-QsblcaQ55ZifgNlAK0lX15mMuHnIazo4/
