Hi gents,
I have spent some time in the last year or so working on the Lucene More
Like This ( and related Solr components ) .

Initially I just wanted to improve it, adding BM25[1] but then I noted a
lot of areas of possible improvements.

I started then with a refactor of the functionality with these objectives
in mind :

1) make the MLT more readable
2) make the MLT more modular and easy to extend
3) make the MLT more tested

*This is just a start, I want to invest significant time with my company to
work on the functionality and contribute it back.*

I split my effort in small Pull Requests to make it easy a review and
possible contribution.

Unfortunately I didn't get much feedback so far.
The More Like This functionality seems mostly abandoned.
I tried also to contact one of the last committers that apparently got
involved in the developments ( Mark Harwood [email protected] ), but I
had no luck.

This is the current Jira Issue that start with a first small refactor +
tests :

https://issues.apache.org/jira/browse/SOLR-12299

I would love to contribute it and much more, but I need some feedback and
review ( unfortunately I am not a committer yet).

Let me know what can I do to speed up the process from my side.

Regards

[1] https://issues.apache.org/jira/browse/LUCENE-7498

--------------------------
Alessandro Benedetti
Search Consultant, R&D Software Engineer, Director
www.sease.io

Reply via email to