Hey David,

Thank you for doing this! I don't have good ideas for new good first issues.

However, something that we need to periodically do and hasn't been done in
a long time now is taking a look at the slower Lucene tests and checking if
they are slow for a good reason or not. Some tests are slow for a good
reason (e.g. PostingsFormatTestCase and DocValuesFormatTestCase sub classes
usually), but often, they are not slow for a good reason but because they
do too much merging or index many documents with the SimpleText codec. For
instance I've seen TestPriorityQueue and TestLateInteractionRescorer often
take several seconds recently, I'm not sure if this is needed. There are
likely many other tests in this case. This could be a good opportunity to
get familiar with Lucene's tooling (tests.slowestTests, tests.profile,
etc.), how Lucene works (text analysis, indexing, merging, searching), and
getting a first change in. I wouldn't mind separate PRs for each slow test
that gets a speedup to help share this task across multiple participants to
the Hackathon.

On Wed, Sep 10, 2025 at 6:16 PM David Smiley <dsmi...@apache.org> wrote:

> I'll be running a hackathon tomorrow at Community-over-Code with Nazerke.
> I'll be sharing the "good first issue" issues as a potential list of items
> for new contributors:
> https://github.com/apache/lucene/issues?q=is%3Aissue%20state%3Aopen%20label%3A%22good%20first%20issue%22
>  (only 9 issues as I write this)
>
> You can help our project & contributors & hackathon simply by doing a
> little label grooming.  If you created an issue recently and think it's a
> "good first issue" for a contributor then add it!  Come to think of it,
> maybe even some bugs might be such.  Or maybe something on your mind hasn't
> been filed yet; please do!
>
> ~ David Smiley
> Apache Lucene/Solr Search Developer
> http://www.linkedin.com/in/davidwsmiley
>


-- 
Adrien

Reply via email to