Re: ANNOUNCE: Welcome Patrick O'Leary as Contrib Committer

2009-01-16 Thread patrick o'leary
mors my user name "pjaol" is actually an acronym of my full name, which is only ever used by my mother when I'm in trouble :-) It will be a pleasure to continue working with all of you, and thank you again for this honor. Thanks Patrick O'Leary > On Jan 16, 2009, at 1:54 P

Re: Words that need protection from stemming, i.e., protwords.txt

2009-01-16 Thread patrick o'leary
Porter is a little outdated I've found KStem much better http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters/Kstem You'll still need a good protected word list, but KStem is just a little nicer On Fri, Jan 16, 2009 at 6:20 PM, David Woodward wrote: > Hi. > > Any good protwords.txt out t

Committed revision 735928.

2009-01-19 Thread patrick o'leary
/whoweare.html (working copy) @@ -285,6 +285,9 @@ Wolfgang Hoschek (whosc...@...) +Patrick O'Leary (pj...@...) + + Uwe Schindler (uschind...@...) @@ -300,7 +303,7 @@ - + Emeritus Committers scootie:site pjaol$ svn diff src/documentation/content/xdocs/whoweare.xml

Re: similarity function

2009-03-05 Thread patrick o'leary
Sounds like your most difficult part will be the question parser using POS. This is kind of old school but use something like the AliceBot AIML library http://en.wikipedia.org/wiki/AIML Where the subjective terms can be extracted from the questions, and indexed separately. Or as Grant and others

Re: Autonomy search technology

2009-04-03 Thread patrick o'leary
I think you need to ask the question what do you want? A person asked me one, which is better, a gold pen fountain pen or a plastic bic pen? The answer - depends If you want the most fluid writing instrument, which gives you a certain level of accomplishment as you use it, and looks superb then th

Re: How to customize score according to field value?

2009-04-07 Thread patrick o'leary
You might want to play with both boosting and multiple sorting. You might want to look at something like Solr's boost queries or boost functions http://wiki.apache.org/solr/DisMaxRequestHandler#head-6862070cf279d9a09bdab971309135c7aea22fb3 Or if you want to go down the path of a custom score, most

Re: A Challenge!: Combining 2 searches into a single resultset?

2009-04-17 Thread patrick o'leary
I'd start by doing some research on the question rather than asking for a solution.. What your asking for can be considered 'Federated Search' http://en.wikipedia.org/wiki/Federated_search And it can be conceived in as many ways as you have document types. Any answer will probably end up customize

Re: A Challenge!: Combining 2 searches into a single resultset?

2009-04-17 Thread patrick o'leary
Why not put the keywords into the same document as another field? and search both fields at once, you can then use lucene syntax to give a boosting to the keyword fields. e.g. body:A good game last night by the redskins keyword: redskins Query= body:(game OR redskins) keyword:(game OR redskins)^10

Re: LocalLucene/Lucene Spatial

2009-04-19 Thread patrick o'leary
1) The only reason for ConstantScoreQuery is because it lets me convert the LocalLucene filter to a query. 2) TierRangeQuery- not at the moment, we're not really doing a range search, we're pulling predefined id's (e.g. a shape) from the index using the term enumerator. Average time take for a bo

Re: LocalLucene/Lucene Spatial

2009-04-20 Thread patrick o'leary
Uwe Schindler > H.-H.-Meier-Allee 63, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > -Original Message- > > From: patrick o'leary [mailto:pj...@pjaol.com] > > Sent: Monday, April 20, 2009 3:33 AM > > To: java-user@lucene.apach

Re: Servlets Sharing Resources

2009-04-21 Thread patrick o'leary
Why not have 1 servlet and based on a parameter / url, serve 2 different outputs? if(request.getString("asXML") !=null) showXML(); else showOtherStuff(); Save yourself the hassle of dealing with jndi / contexts / spring or SingleTons On Tue, Apr 21, 2009 at 12:01 PM, David Seltzer wrote:

Re: Servlets Sharing Resources

2009-04-21 Thread patrick o'leary
everything hit the same > doGet(). > > -Original Message- > From: patrick o'leary [mailto:pj...@pjaol.com] > Sent: Tuesday, April 21, 2009 12:51 PM > To: java-user@lucene.apache.org > Subject: Re: Servlets Sharing Resources > > Why not have 1 servlet and base

Re: Lucene Index Encryption

2009-05-08 Thread patrick o'leary
There will always be levels of where data will be insecurely available. Most notably within the memory of an application once it's running. Unless you want to go down the path of encrypting and decrypting each and every string. At which point you loose dictionary functionality and well any useful e