Re: [VOTE] Release PyLucene 9.7.0-rc1

2023-07-12 Thread Michael McCandless
+1 I ran the same exciting smoke test -- indexing first 100K enwiki docs, running a few political searches, force merging, searching again. Everything ran fine! Arch Linux kernel 6.3.2, Java 17.0.7+7, Python 3.11.3. Sorry for the delay! Mike On Sun, Jul 9, 2023 at 3:28 PM Dawid Weiss wrote:

Re: How to retain % sign against numbers in lucene indexing/ search

2023-07-12 Thread Mikhail Khludnev
Hello Amitesh. If StandardTokenizer does so (but it's worth to doublecheck on Solr Admin Analysis screen), you can experiment with WhitespaceTokenizer. On Wed, Jul 12, 2023 at 3:33 PM Amitesh Kumar wrote: > Hi Group, > > I am facing a requirement change to get % sign retained in searches. e.g >

How to retain % sign against numbers in lucene indexing/ search

2023-07-12 Thread Amitesh Kumar
Hi Group, I am facing a requirement change to get % sign retained in searches. e.g Sample search docs: 1. Number of boys 50 2. My score was 50% 3. 40-50% for pass score Search query: 50% Expected results: Doc-2, Doc-3 i.e. My score was 50% 40-50% for pass score Actual result: All 4 documents

Re: Benefits of using bytes vector for HNSW

2023-07-12 Thread Alessandro Benedetti
Hi ! You are on the spot, you deal with data 4 times smaller (of course also able to represent 1/4 of the information). But if you are ok with that you may achieve a lighter memory footprint (not 4 times lighter as there are a lot of boilerplate structures as well, but still a decent improvement).