//me: always pitching own stuff https://github.com/mkhludnev/solr-flexible-qparser might give a full control over query preprocessing.
On Thu, May 30, 2024 at 12:20 PM Jan Høydahl <jan....@cominvent.com> wrote: > I wrote this little plugin years ago > https://github.com/cominvent/request-sanitizer-component > That would be useful if you don't have control over the client doing the > requests. > If you control the client I'd add some sanitation there. > > Jan > > > 29. mai 2024 kl. 16:11 skrev Dmitri Maziuk <dmitri.maz...@gmail.com>: > > > > Hi all, > > > > our website has a search box that essentially passes its contents to > Solr without any massaging. This works fine 99% of the time, the other 1% > is when a misbehaving bot hits it and tries stuffing all sorts of crap in > there. > > > > Then bad things happen: Java's overly verbose exception stack traces > fill up the disk faster than the logs are rotated, CPU load spikes, etc. > > > > So, question: does anyone know of a validator/sanitizer we can use clean > up the terms before passing them on to Solr? -- My google-fu fails to find > one. > > > > TIA > > Dima > > -- Sincerely yours Mikhail Khludnev