Hi Blake, In my understanding of LLMs, they don't need a parser at all, embeddings work on the text. There are experiments for embedding AST nodes (code2vec, code2seq) but what people usually do is to just treat code as any other piece of text.
On Mon, 7 Apr 2025 at 01:49, Blake McBride <bl...@mcbridemail.com> wrote: > Greetings, > > I am trying to write a RAG chunker for Groovy. This is used to > (essentially) train an AI/LLM on my code base so that the AI/LLM can help > me with my Groovy application. > > Essentially, what I need to do is read in a Groovy source file and do > something (create embeddings) for each individual method. This was pretty > trivial in Java because there are ready-made Java parsers. However, I have > spent a long time trying to create a parser for Groovy but have so far been > unsuccessful. > > I sure appreciate any suggestions. > > Thanks! > > Blake McBride > >