Kristian Hermsdorf wrote:
We're using pdftotext as well, because PDFbox ist really slow. If your
application should work under Windows you will probably experiance some
mystic Java-VM crashes while executing external processes in batch-mode.
(This is because of a bug in Windows-VM... we
fields. I haven't touched the
default operator, but the queries A AND -B and A AND NOT B give the same
conflicting overlap in the result set.
Thanks in advance,
Christiaan Fluit
Aduna.biz
--
-
To unsubscribe, e-mail: [EMAIL PROTECTED
ok, I feel a bit stupid now ;) Turns out this issue has been discussed a
while ago on both mailing lists and I even participated in one of
them... shame on me.
The problem is indeed in how MFQP parses my query: the query A -B becomes:
(text:A -text:B) (title:A -title:B) (path:A -path:B)
use we have an attractive offering (as judged by the
universities using it!).
Regards,
Christiaan Fluit
Aduna.biz
--
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Christoph Kiehl wrote:
I'm curious about your strategy to backup indexes based on FSDirectory.
If I do a file based copy I suspect I will get corrupted data because of
concurrent write access.
My current favorite is to create an empty index and use
IndexWriter.addIndexes() to copy the current
Erik Hatcher wrote:
Where did you get 'i'? Keep in mind that using Hits.doc(n) intends 'n'
to be a document *id*, not the iteration through the Hits collection.
This is a very common mistake, and I'm guessing one you've made here.
I believe the Javadoc (as well as my own experience) tells
We invoke the following code in a static initializer that simply
disables log4j's output entirely.
static {
Properties props = new Properties();
props.put(log4j.threshold, OFF);
org.apache.log4j.PropertyConfigurator.configure(props);