Automatic pre-processing of documents may be a good fit for an Update Request Processor. In Apache Solr I contributed a while back to the: https://sease.io/2015/07/solr-document-classification-part-1-indexing-time.html
This update request processor internally uses the Apache Lucene document classification module, that uses the More Like This internally. I know that potentially you want your editor's feedback in a supervised way, but if your direction is toward automatic enrichment, take a look as it could be interesting as an inspiration for your use case. Cheers -------------------------- *Alessandro Benedetti* Director @ Sease Ltd. *Apache Lucene/Solr Committer* *Apache Solr PMC Member* e-mail: [email protected] *Sease* - Information Retrieval Applied Consulting | Training | Open Source Website: Sease.io <http://sease.io/> LinkedIn <https://linkedin.com/company/sease-ltd> | Twitter <https://twitter.com/seaseltd> | Youtube <https://www.youtube.com/channel/UCDx86ZKLYNpI3gzMercM7BQ> | Github <https://github.com/seaseltd> On Thu, 8 Sept 2022 at 12:16, Batanun B <[email protected]> wrote: > Hi, > > I'm evaluating if the MoreLikeThis (mlt) feature of solr can be useful for > our editors when they are creating new content. We want to trigger this > before the content has been inserted in the system, so there is no document > in solr that we can use as a base for the mlt search. So we want to use the > "externally supplied text" feature, where we provide the article text in > the request body of the search. This works great when we use the mlt > request handler (/mlt). But we also would like to get facets for this > search, and bug SOLR-7883 is stopping us from doing that. > > Some people suggest that we use the mlt query parser instead, as part of > our regular request parser (/select). But I can't get that to work together > with the "externally supplied text". It gives me the error "Bad contentType > for search handler :text/plain". > > So, does anyone know how to do a search that uses MoreLikeThis with > externally supplied text, and facets at the same time? > > Regards > >
