Hi Lucas! Thanks for your interest and for reaching out. Sounds like your background could provide you with a useful set of fresh eyes to view our codebase through!
This question comes up a lot (good starter issues) and I don't think our answers are ever all that satisfying. I will share a few thoughts though (all just my personal opinions): 1. Issues that familiarize you with parts of the codebase you're interested in are always useful to seek out. Sounds like a lot of your interest may gravitate towards the "core" module? Just a guess. 2. Looking at active PRs that are being iterated on is actually a really good way to get started. It will be slow going at first, but will force you to go understand parts of the codebase. Said differently, it can be useful to focus on providing feedback on changes other people are working on at first as opposed to setting off and making changes (although I know that's less fun a lot of the time). For something more concrete, Mike M. (long time community member) has a nice tool for searching issues and PRs. I added some filters to look for open issues that have had no comments and are not assigned to anyone (good sign they are not being worked on) along with a couple other filters. This could provide a good list to start with: https://githubsearch.mikemccandless .com/search.py?chg=page&text=&a1=1&a2=undefined&page=0&searcher=36672&sort=recentlyUpdated&format=list&id=vjf5tu0klway&dd=status%3AOpen&dd=issue_or_pr% 3AIssue&dd=comment_count%3A0&dd=issue_type%3Aenhancement&dd=assignee% 3AUnassigned&newText= Just scanning through that list briefly, here are some that jumped out as possibly good starting points (but I didn't look in detail so I would suggest asking on the issue if it's still relevant and check that nobody is working on it). * https://github.com/apache/lucene/issues/13207 * https://github.com/apache/lucene/issues/13598 * https://github.com/apache/lucene/issues/13084 * https://github.com/apache/lucene/issues/12919 Best of luck and have fun! Cheers, -Greg On Wed, Jul 24, 2024 at 2:56 AM Lucas Wolf <iamlucasw...@gmail.com> wrote: > Hi everyone, > > My name is Lucas and I am interested in contributing to Lucene. > > I have read through the issues list on GitHub but felt that I was lacking > a bit of context on what is achievable/impactful to tackle as a newcomer. > Perhaps someone here can help me out. :) > > My background is mostly in main-memory relational database (performance) > engineering in C++. However, I recently became interested in JVM/OpenJDK > internals and am looking for a project to put my knowledge to good use. > > I'm generally open to anything, except perhaps Vector Search, as that > would likely pose a conflict of interest with my day job. > > Thanks! > > Best, > Lucas Wolf >