Hi Jochen, There is a restriction in FVH. FVH cannot deal with variable gram size. That is, minGramSize == maxGramSize in your NGramFilterFactory setting.
koji -- http://soleami.com/blog/lucene-4-is-super-convenient-for-developing-nlp-tools.html (13/03/18 22:17), Jochen Just wrote:
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi list, i have the following field type in my schema.xml defined in order to be able to do in word search. <fieldType name="string_parts_back" class="solr.TextField" positionIncrementGap="100" omitNorms="true"> <analyzer type="index"> <tokenizer class="solr.KeywordTokenizerFactory"/> <filter class="solr.NGramFilterFactory" minGramSize="1" maxGramSize="1000"/> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> <analyzer type="query"> <tokenizer class="solr.KeywordTokenizerFactory"/> <filter class="solr.LowerCaseFilterFactory"/> </analyzer> </fieldType> Searching itself works as expected, though highlighting causes me headaches. At first I did not use the FastVectorHighlighter, which meant highlighting did not work at all for fields of this type. Since I'm using the FastVectorHighlighter most of the time highlighting works, sometimes it doesn't. Given I have a document containing the word 'Superkalifragilistischexpialligetisch' and I search for 'uperkalifragilistische', I would expect as result 'S<em>uperkalifragilistische</em>xpiallegetisch' but it is 'S<em>uperkalifragilist</em>ischexpialligetisch'. So there is 'ische' missing in the highlighted part. Sadly, I am not able to create a simple setup to reproduce this, but it only happens in our in-house live system. Though if I remove some fields from my qf attribute of the edismax parser in solconfig.xml, it stops behaving like that. Some of those removed fields have the fieldType string_parts_back. Does any one have a clue, what's going on? Thanks in advance, Jochen - -- Jochen Just Fon: (++49) 711/28 07 57-193 avono AG Mobil: (++49) 172/73 85 387 Breite Straße 2 Mail: jochen.j...@avono.de 70173 Stuttgart WWW: http://www.avono.de -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) Comment: Using GnuPG with undefined - http://www.enigmail.net/ iQIcBAEBAgAGBQJRRxP5AAoJEP1xbhgWUHmSRAsP/AlLHWA6Pw6Jk5Pmr0rqiAxE IsJ6HeL+4e56IHsKsruBY7HOGdEwRvXHSkwlKGLF+dvyzz4/lx7wbGBHJCMJJkDe Yas9izso5z4KGKzKazMYPPKoXja67zmWmRU5PYG/exT8N1gjnA98KTzXAA47xIxA rm9zUBImPF1eIZmEBcytI/+EMJI4Cy30OvRyWfc6XoxF7Kq5wJuMXvTWl24gM0tQ xdPUVZ6ir8IkrGw2P7d3/IgaAtYbT+SEAuFjSE9rtS8KdJfWbXDYYupqNV59Syqh 7F5ywEOgnt/OBTODFp9FR4ElakOlSZrmRk8CgYfUZZu9vNASxyBnCWwhz+CkCbfQ fYRzy1HyDUGIGFl6FAi+4WE4av5EdWUH6N0UEdUkE6tI5b/IqzGIdocSl36PqeMR za7jKfU9LWqc+Xoh27wLP8Wi11t/XIRQuRCxKSFpc2Go3iweCTu+cXr1K6XTndj/ uoptQ1nJJcQTRmdvxlxA5jvrVaGvOclEEFsndQWyq6wK7CJ9k+FOHfYwc7p3L1Bp QoTTErdEKgCZj+w39Ma0ASURBX1+jjLqRnMvleSD4CX2K78z8Z7c5a7m48192D6u mg6uOIUyTdTPH5SLUOU+rNDjOuLLbJOuVGXdpSqYymkr2WPlwwBj+ZYGx1lap1xE 5ZgU5nHnodtUAC9jjz52 =KsNm -----END PGP SIGNATURE-----