Bug? "TokenStream contract violation: close() call missing", but close() call was not actually missing

2015-05-25 Thread Trejkaz
Hi all. I found a bug where Tokenizer seems to be complaining about close() not being called, but on further investigation it looks like we're not the ones who are opening it, but rather it's being opened by Lucene. Firstly, the tests rely on this method, which may or may not be correct, but I tr

Optimal FS block size for "small" documents?

2015-05-25 Thread Gili Nachum
Hi, What FS block size to use? I have an RAID-5 of SSD drives currently configured with a 128KB block size. Can I expect better indexing/query time performance with a smaller block size (say 8K)? Considering my documents are almost always smaller than 8K. I assume all stored fields would fit into