Use absolute term position for scoring

2015-08-31 Thread aurelien . mazoyer

Hi all,

I want to take into account the absolute position of the term for the 
score calculation.
I found many threads that deal with this issue, and the answer is often: 
"use SpanFirstQuery".
The problem with this approach is that it is too "boolean" for me (the 
document matches the spanfirstquery or it does not, so the boost is 
either 0 or 1) and you cannot do exciting calculation with log of the 
position or other funny functions.
If the position was not there, we could put it in the payload of the 
term and then reuse it with a customscorer. But we already have the 
position in the term, so why can't we use the same approach? Is it a 
problem of interface? Or maybe it would have a huge impact on 
performance?


Thank you for your advice,

Regards,

Aurelien



-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



Indexing a binary field

2015-08-31 Thread Dan Smith
What's the best way to index binary data in Lucene? I'm adding a Lucene
index to a key value store, and I want to be able to delete documents based
on a binary key.

As far as I can tell the Lucene API does not support indexing binary data,
and I was looking into Base64 encoding the key. Is there a better way?

-Dan


Re: Indexing a binary field

2015-08-31 Thread Michael McCandless
StringField now also takes a BytesRef value to index, so you can index
a single binary token that way.  Does that work?

Mike McCandless

http://blog.mikemccandless.com


On Mon, Aug 31, 2015 at 12:19 PM, Dan Smith  wrote:
> What's the best way to index binary data in Lucene? I'm adding a Lucene
> index to a key value store, and I want to be able to delete documents based
> on a binary key.
>
> As far as I can tell the Lucene API does not support indexing binary data,
> and I was looking into Base64 encoding the key. Is there a better way?
>
> -Dan

-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org



Re: Indexing a binary field

2015-08-31 Thread Dan Smith
Aha! My version of Lucene was out of date. That should work perfectly. 

Thanks, 
-Dan

 Original message 
From: Michael McCandless  
Date:08/31/2015  12:57 PM  (GMT-08:00) 
To: Lucene Users , dsm...@pivotal.io 
Cc:  
Subject: Re: Indexing a binary field 

StringField now also takes a BytesRef value to index, so you can index
a single binary token that way.  Does that work?

Mike McCandless

http://blog.mikemccandless.com


On Mon, Aug 31, 2015 at 12:19 PM, Dan Smith  wrote:
> What's the best way to index binary data in Lucene? I'm adding a Lucene
> index to a key value store, and I want to be able to delete documents based
> on a binary key.
>
> As far as I can tell the Lucene API does not support indexing binary data,
> and I was looking into Base64 encoding the key. Is there a better way?
>
> -Dan


IndexWriter is not closing the FDs (deleted files)

2015-08-31 Thread Marcio Napoli
Hey! :)

It seems IndexWriter is not closing the descriptors of the removed files,
see the log below.

Thanks,
Napoli

[root@server01 log]# ls -l /proc/59491/fd  | grep index
l-wx--. 1 wildfly wildfly 64 Ago 31 11:26 429 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/write.lock
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 529 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_4.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 530 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_3.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 531 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_2.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 532 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_1.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 533 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_0.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 535 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_a.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 536 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_9.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 537 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_8.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 538 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_7.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 539 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_6.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 540 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene41_0.doc
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 541 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene41_0.pos
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 542 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene41_0.tim
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 543 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_5.nvd (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 544 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_5.fdt (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 545 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene410_0.dvd
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 619 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_o.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 676 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_k.cfs (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 677 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene41_0.doc
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 678 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene41_0.pos
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 679 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene41_0.tim
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 680 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_j.nvd (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 681 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_j.fdt (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 682 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene410_0.dvd
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 685 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene41_0.doc
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 686 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene41_0.pos
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 687 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene41_0.tim
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 688 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_n.nvd (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 689 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_n.fdt (deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 690 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene410_0.dvd
(deleted)
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 730 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_15.cfs
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 737 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_r_Lucene41_0.doc
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 738 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_r_Lucene41_0.pos
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 739 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_r_Lucene41_0.tim
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 740 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_r.nvd
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 741 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_r.fdt
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 742 ->
/usr/local/wildfly-2.0/standalone/data/index/cidadao/_r_Lucene410_0.dvd
lr-x--. 1 wildfly wildfly 64 Ago 31 20:25

Re: IndexWriter is not closing the FDs (deleted files)

2015-08-31 Thread Anton Zenkov
Are you sure you are not holding open readers somewhere?

On Mon, Aug 31, 2015 at 7:46 PM, Marcio Napoli 
wrote:

> Hey! :)
>
> It seems IndexWriter is not closing the descriptors of the removed files,
> see the log below.
>
> Thanks,
> Napoli
>
> [root@server01 log]# ls -l /proc/59491/fd  | grep index
> l-wx--. 1 wildfly wildfly 64 Ago 31 11:26 429 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/write.lock
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 529 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_4.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 530 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_3.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 531 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_2.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 532 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_1.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 533 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_0.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 535 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_a.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 536 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_9.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 537 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_8.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 538 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_7.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 539 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_6.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 540 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene41_0.doc
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 541 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene41_0.pos
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 542 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene41_0.tim
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 543 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_5.nvd (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 544 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_5.fdt (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 11:26 545 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_5_Lucene410_0.dvd
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 619 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_o.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 676 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_k.cfs (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 677 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene41_0.doc
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 678 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene41_0.pos
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 679 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene41_0.tim
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 680 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_j.nvd (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 681 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_j.fdt (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 682 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_j_Lucene410_0.dvd
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 685 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene41_0.doc
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 686 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene41_0.pos
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 687 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene41_0.tim
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 688 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_n.nvd (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 689 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_n.fdt (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 690 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_n_Lucene410_0.dvd
> (deleted)
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 730 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_15.cfs
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 737 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_r_Lucene41_0.doc
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 738 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_r_Lucene41_0.pos
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 739 ->
> /usr/local/wildfly-2.0/standalone/data/index/cidadao/_r_Lucene41_0.tim
> lr-x--. 1 wildfly wildfly 64 Ago 31 20:25 740 ->
> /usr/local/wildfly-2.0/standalone/data/index/c