First of all, it seems the mailing list is having some troubles? Some of
my posts end up in the wrong thread (even new threads I post), I don't
receive them in my mail, and they're present only in the 'date archive'
of http://www.mail-archive.com, and not in the 'thread' one? I don't
receive some of the other peoples post in my mail too, problems started
last week I think.
Secondly, Chris, thanks for all the useful answers, everything is much
clearer now. This info should be added to the wiki I think; should I do
it? I'm still a little disappointed that I can't change the OR/AND
parsing by just changing some parameter (like I can do for the number of
results returned, for example); adding a OR between each word in the
text i want to compare sounds suboptimal, but i'll probably do it that
way; its a very minor nitpick, solr is awesome, as I said before.
@ Brian Lucas: Don't worry, solrPHP was still 99.9% functional, great
work; part of it sending a doc at a time was my fault; I was following
the exact sequence (add to array, submit) displayed in the docs. The
only thing that could be added is a big "//TODO: change this code"
before sections you have to change to make it work for a particular
schema. I'm pretty sure the custom header curl submit works for everyone
else than me; I'm on a windows test box with WAMP on it, so it may be
caused by that. I'll send you tomorrow the changes I done to the code
anyway; as I said, nothing major.
Chris Hostetter wrote:
: - What is the loadFactor variable of HashDocSet? Should I optimize it too?
this is the same as the loadFactor in a HashMap constructor -- but i don't
think it has much affect on performance since the HashDocSets never
"grow".
I personally have never tuned the loadFactor :)
: - What's the units on the size value of the caches? Megs, number of
: queries, kilobytes? Not described anywhere.
"entries" ... the number of items allowed in the cache.
: - Any way to programatically change the OR/AND preference of the query
: parser? I set it to AND by default for user queries, but i'd like to set
: it to OR for some server-side queries I must do (find related articles,
: order by score).
you mean using StandardRequestHandler? ... not that i can think of off the
top of my head, but typicaly it makes sense to just configure what you
want for your "users" in the schema, and then make any machine generated
queries be explicit.
: - Whats the difference between the 2 commits type? Blocking and
: non-blocking. Didn't see any differences at all, tried both.
do you mean the waitFlush and waitSearcher options?
if either of those is true, you shouldn't get a response back from the
server untill they have finished. if they are false, then the server
should respond instantly even if it takes several seconds (or maybe even
minutes) to complete the operation (optimizes can take a while in some
cases -- as can opening newSearchers if you have a lot of cache warming
configured)
: - Every time I do an <optimize> command, I get the following in my
: catalina logs - should I do anything about it?
the optimize command needs to be well formed XML, try "<optimize/>"
instead of just "<optimize>"
: - Any benefits of setting the allowed memory for Tomcat higher? Right
: now im allocating 384 megs.
the more memory you've got, the more cachng you can support .. but if
your index changes so frequently compared to the rate of *unique*
queries you get that your caches never fill up, it may not matter.
-Hoss
--
Michael Imbeault
CHUL Research Center (CHUQ)
2705 boul. Laurier
Ste-Foy, QC, Canada, G1V 4G2
Tel: (418) 654-2705, Fax: (418) 654-2212