Re: Lucene MBean service for JBoss

2003-09-25 Thread Otis Gospodnetic
Thanks, I'm finally including this on the Contributions page.

Otis

--- Andy Scholz [EMAIL PROTECTED] wrote:
 Hi All,
 
 For those that may be interested, I have written a full text indexing
 
 service for the JBoss application server that uses Lucene as its
 engine. It 
 allows lucene to be used as a service rather than a standalone app
 with 
 thread pooling, access synchronization, management etc. Index and
 search 
 interfaces are accessable via JNDI and remotely via session EJB's.
 
 Additionally I have provided content filters for common formats like
 HTML, 
 MSWord, MSExcel, xml etc (with some help from other projects). A
 simple 
 interface also allows you to write your own filters for different
 formats.
 
 It is available under an LGPL license and source code, binaries and
 info 
 are avaialble here:
 http://ejindex.sourceforge.net
 
 I'd love to get some feedback, so if your iterested, please let me
 know 
 your comments or suggestions;)
 
 regards,
 Andy Scholz
 
 
 -
 To unsubscribe, e-mail: [EMAIL PROTECTED]
 For additional commands, e-mail: [EMAIL PROTECTED]
 


__
Do you Yahoo!?
The New Yahoo! Shopping - with improved product search
http://shopping.yahoo.com

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



wildcards in fields?

2003-09-25 Thread lucene
Hi!

I search in a field called url. url:www.blah.com does return hits while 
url:blah.com does not. So I tried url:*blah.com but this does even throw a 
ParseException.

What am I doing wrong?

Timo


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



stop words in index

2003-09-25 Thread lucene
Hi!

I use a GermanAnalyzer for indexing and searching, search for der (the) 
does not return any hits. But examining the index with Luke does show up 
der as the top ranked word. Other word which are probably stop words as 
well (zum) return hits.

bug?
Timo


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



AW: wildcards in fields?

2003-09-25 Thread Materna, Wolf-Dietrich (empolis B)
Hello,
 Von: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
 
 I search in a field called url. url:www.blah.com does 
 return hits while 
 url:blah.com does not. So I tried url:*blah.com but this does 
 even throw a 
 ParseException.
 
 What am I doing wrong?
Wildcards at the beginning of a term are not supported. Lucene use the first
characters to speedup the search for matching terms (== words). So the query
parser doesn't accept you query. (Afterwise lucene has to examine all terms
and this would be very slow).
You may use a second field to store the url as invered string, e.g.
invurl: moc.halb.www and search for invurl:moc.halb.* instead.
Checkout the archives for both mailing lists ([EMAIL PROTECTED]
and [EMAIL PROTECTED]). IRC there are other proposals discussed
serveral time ago to workaround this limit.
Regards,
Wolf-Dietrich
-- 
Wolf-Dietrich Materna
Development
 
empolis GmbH -  arvato knowledge management 

phone: +49 - 30 - 34 35 34 95 
mailto:[EMAIL PROTECTED] http://www.empolis.com 

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Lucene MBean service for JBoss

2003-09-25 Thread Andy Scholz
Thanks Otis...

With any luck my current employer will also chip in a few bucks to help 
maintain the project (I'm working on it)...

cheers
-andy
Otis Gospodnetic wrote:

Thanks, I'm finally including this on the Contributions page.

Otis

--- Andy Scholz [EMAIL PROTECTED] wrote:
 

Hi All,

For those that may be interested, I have written a full text indexing

service for the JBoss application server that uses Lucene as its
engine. It 
allows lucene to be used as a service rather than a standalone app
with 
thread pooling, access synchronization, management etc. Index and
search 
interfaces are accessable via JNDI and remotely via session EJB's.

Additionally I have provided content filters for common formats like
HTML, 
MSWord, MSExcel, xml etc (with some help from other projects). A
simple 
interface also allows you to write your own filters for different
formats.

It is available under an LGPL license and source code, binaries and
info 
are avaialble here:
http://ejindex.sourceforge.net

I'd love to get some feedback, so if your iterested, please let me
know 
your comments or suggestions;)

regards,
Andy Scholz
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
   



__
Do you Yahoo!?
The New Yahoo! Shopping - with improved product search
http://shopping.yahoo.com
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
 



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


French texts

2003-09-25 Thread Gayo Diallo
Hi,
I just want to know If It's possible to use Lucene for french documents.
Is there any analyser for this language ?
Best regards,
Gayo Diallo


Re: French texts

2003-09-25 Thread Bruce Ritchie
Yes, you can use lucene to search French documents. The snowball stemmers contribution contains a 
French stemmer - you'll find it athttp://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/

Regards,

Bruce Ritchie

Gayo Diallo wrote:

Hi,
I just want to know If It's possible to use Lucene for french documents.
Is there any analyser for this language ?
Best regards,
Gayo Diallo


smime.p7s
Description: S/MIME Cryptographic Signature