Automatic pre-processing of documents may be a good fit for an Update
Request Processor.
In Apache Solr I contributed a while back to the:
https://sease.io/2015/07/solr-document-classification-part-1-indexing-time.html


This update request processor internally uses the Apache Lucene document
classification module, that uses the More Like This internally.

I know that potentially you want your editor's feedback in a supervised
way, but if your direction is toward automatic enrichment, take a look as
it could be interesting as an inspiration for your use case.

Cheers
--------------------------
*Alessandro Benedetti*
Director @ Sease Ltd.
*Apache Lucene/Solr Committer*
*Apache Solr PMC Member*

e-mail: [email protected]


*Sease* - Information Retrieval Applied
Consulting | Training | Open Source

Website: Sease.io <http://sease.io/>
LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter
<https://twitter.com/seaseltd> | Youtube
<https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github
<https://github.com/seaseltd>


On Thu, 8 Sept 2022 at 12:16, Batanun B <[email protected]> wrote:

> Hi,
>
> I'm evaluating if the MoreLikeThis (mlt) feature of solr can be useful for
> our editors when they are creating new content. We want to trigger this
> before the content has been inserted in the system, so there is no document
> in solr that we can use as a base for the mlt search. So we want to use the
> "externally supplied text" feature, where we provide the article text in
> the request body of the search. This works great when we use the mlt
> request handler (/mlt). But we also would like to get facets for this
> search, and bug SOLR-7883 is stopping us from doing that.
>
> Some people suggest that we use the mlt query parser instead, as part of
> our regular request parser (/select). But I can't get that to work together
> with the "externally supplied text". It gives me the error "Bad contentType
> for search handler :text/plain".
>
> So, does anyone know how to do a search that uses MoreLikeThis with
> externally supplied text, and facets at the same time?
>
> Regards
>
>

Reply via email to