RE: Any tools to detect language of document

2003-06-17 Thread Neil Couture
look for Ted Dunning algorithm on the web.


-neil

-Original Message-
From: Randy Darling [mailto:[EMAIL PROTECTED]
Sent: 17 juin, 2003 16:41
To: Lucene Users List
Subject: Any tools to detect language of document



I am attempting to come up with an automated way to
select which language analyzer to use on a document.

Anyone know of any algorithms available to detect
what language the document may be written in?

Are there any special Analyzers that attempt to support
multiple languages?


Thanks,
Randy

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Any tools to detect language of document

2003-06-17 Thread Randy Darling

I am attempting to come up with an automated way to
select which language analyzer to use on a document.

Anyone know of any algorithms available to detect
what language the document may be written in?

Are there any special Analyzers that attempt to support
multiple languages?


Thanks,
Randy

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: Weighted Search by Field using MultiFieldQueryParser

2003-06-17 Thread Otis Gospodnetic
Ah!  Sorry about confusion.

Otis

--- "Kevin L. Cobb" <[EMAIL PROTECTED]> wrote:
> Yes . . . but you suggested that I post it to the User group (as
> opposed
> to the Dev group).
> 
>  
> ~
> Kevin L Cobb, MT(ASCP), SCJP2
> Senior Software Engineer
> Emergint, Inc.
> 501 E Broadway, Suite 250
> Louisville, KY 40202
> [EMAIL PROTECTED]
> 502.852.1423
> 502.852.1589 fax
>  
> The preceding e-mail message and attachments, if any, contains
> information that is confidential and qualifies as such under the
> Definition of Confidential Information in the Non-Disclosure
> Agreement,
> and may constitute non-public information and trade secrets. It is
> intended to be conveyed only to the designated recipient(s). If you
> are
> not an intended recipient of this message, please notify the sender
> at
> 502-852-1605. 
>  
> 
> -Original Message-
> From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] 
> Sent: Tuesday, June 17, 2003 9:05 AM
> To: Lucene Users List
> Subject: Re: Weighted Search by Field using MultiFieldQueryParser
> 
> Didn't I reply to this yesterday?
> 
> --- "Kevin L. Cobb" <[EMAIL PROTECTED]> wrote:
> > I have an index that has three fields in it. When I do a search
> using
> > MultiFieldQueryParser, the search applies the same importance
> > (weight)
> > to each of the fields. BUT, what if I want to apply a different
> > weight
> > to each field, i.e. I want to consider found terms from certain
> > fields
> > as less "important" than others. I have applied an algorithm to
> help
> > me
> > do this, which involves searching each field separately and then
> > recombining the results into a single collection, but hate to
> > reinvent
> > the wheel if I don't have to.
> > 
> >  
> > 
> >  
> > 
> > ~
> > 
> > Kevin L Cobb, MT(ASCP), SCJP2
> > 
> > Senior Software Engineer
> > 
> > Emergint, Inc.
> > 
> > 501 E Broadway, Suite 250
> > 
> > Louisville, KY 40202
> > 
> > [EMAIL PROTECTED]
> > 
> > 502.852.1423
> > 
> > 502.852.1589 fax
> > 
> >  
> > 
> >  
> > 
> > The preceding e-mail message and attachments, if any, contains
> > information that is confidential and qualifies as such under the
> > Definition of Confidential Information in the Non-Disclosure
> > Agreement,
> > and may constitute non-public information and trade secrets. It is
> > intended to be conveyed only to the designated recipient(s). If you
> > are
> > not an intended recipient of this message, please notify the sender
> > at
> > 502-852-1605. 
> > 
> >  
> > 
> > 
> 
> 
> __
> Do you Yahoo!?
> SBC Yahoo! DSL - Now only $29.95 per month!
> http://sbc.yahoo.com
> 
> -
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> 
> -
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 


__
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Weighted Search by Field using MultiFieldQueryParser

2003-06-17 Thread Tatu Saloranta
On Tuesday 17 June 2003 05:43, Kevin L. Cobb wrote:
> I have an index that has three fields in it. When I do a search using
> MultiFieldQueryParser, the search applies the same importance (weight)
> to each of the fields. BUT, what if I want to apply a different weight
> to each field, i.e. I want to consider found terms from certain fields
> as less "important" than others. I have applied an algorithm to help me
> do this, which involves searching each field separately and then
> recombining the results into a single collection, but hate to reinvent
> the wheel if I don't have to.

Have you looked at MultiFieldQueryParser source? It's a very simple class, and
modifying it (making a new class) should be easy; pass in not only field names 
but also weights to apply?
(as a sidenote, MultiFieldQueryParser does some unnecessary work as is... it 
seems to re-parse same query once for each field, could just clone it)

-+ Tatu +-



-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



RE: Weighted Search by Field using MultiFieldQueryParser

2003-06-17 Thread Kevin L. Cobb
Yes . . . but you suggested that I post it to the User group (as opposed
to the Dev group).

 
~
Kevin L Cobb, MT(ASCP), SCJP2
Senior Software Engineer
Emergint, Inc.
501 E Broadway, Suite 250
Louisville, KY 40202
[EMAIL PROTECTED]
502.852.1423
502.852.1589 fax
 
The preceding e-mail message and attachments, if any, contains
information that is confidential and qualifies as such under the
Definition of Confidential Information in the Non-Disclosure Agreement,
and may constitute non-public information and trade secrets. It is
intended to be conveyed only to the designated recipient(s). If you are
not an intended recipient of this message, please notify the sender at
502-852-1605. 
 

-Original Message-
From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, June 17, 2003 9:05 AM
To: Lucene Users List
Subject: Re: Weighted Search by Field using MultiFieldQueryParser

Didn't I reply to this yesterday?

--- "Kevin L. Cobb" <[EMAIL PROTECTED]> wrote:
> I have an index that has three fields in it. When I do a search using
> MultiFieldQueryParser, the search applies the same importance
> (weight)
> to each of the fields. BUT, what if I want to apply a different
> weight
> to each field, i.e. I want to consider found terms from certain
> fields
> as less "important" than others. I have applied an algorithm to help
> me
> do this, which involves searching each field separately and then
> recombining the results into a single collection, but hate to
> reinvent
> the wheel if I don't have to.
> 
>  
> 
>  
> 
> ~
> 
> Kevin L Cobb, MT(ASCP), SCJP2
> 
> Senior Software Engineer
> 
> Emergint, Inc.
> 
> 501 E Broadway, Suite 250
> 
> Louisville, KY 40202
> 
> [EMAIL PROTECTED]
> 
> 502.852.1423
> 
> 502.852.1589 fax
> 
>  
> 
>  
> 
> The preceding e-mail message and attachments, if any, contains
> information that is confidential and qualifies as such under the
> Definition of Confidential Information in the Non-Disclosure
> Agreement,
> and may constitute non-public information and trade secrets. It is
> intended to be conveyed only to the designated recipient(s). If you
> are
> not an intended recipient of this message, please notify the sender
> at
> 502-852-1605. 
> 
>  
> 
> 


__
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Re: Weighted Search by Field using MultiFieldQueryParser

2003-06-17 Thread Otis Gospodnetic
Didn't I reply to this yesterday?

--- "Kevin L. Cobb" <[EMAIL PROTECTED]> wrote:
> I have an index that has three fields in it. When I do a search using
> MultiFieldQueryParser, the search applies the same importance
> (weight)
> to each of the fields. BUT, what if I want to apply a different
> weight
> to each field, i.e. I want to consider found terms from certain
> fields
> as less "important" than others. I have applied an algorithm to help
> me
> do this, which involves searching each field separately and then
> recombining the results into a single collection, but hate to
> reinvent
> the wheel if I don't have to.
> 
>  
> 
>  
> 
> ~
> 
> Kevin L Cobb, MT(ASCP), SCJP2
> 
> Senior Software Engineer
> 
> Emergint, Inc.
> 
> 501 E Broadway, Suite 250
> 
> Louisville, KY 40202
> 
> [EMAIL PROTECTED]
> 
> 502.852.1423
> 
> 502.852.1589 fax
> 
>  
> 
>  
> 
> The preceding e-mail message and attachments, if any, contains
> information that is confidential and qualifies as such under the
> Definition of Confidential Information in the Non-Disclosure
> Agreement,
> and may constitute non-public information and trade secrets. It is
> intended to be conveyed only to the designated recipient(s). If you
> are
> not an intended recipient of this message, please notify the sender
> at
> 502-852-1605. 
> 
>  
> 
> 


__
Do you Yahoo!?
SBC Yahoo! DSL - Now only $29.95 per month!
http://sbc.yahoo.com

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



Weighted Search by Field using MultiFieldQueryParser

2003-06-17 Thread Kevin L. Cobb
I have an index that has three fields in it. When I do a search using
MultiFieldQueryParser, the search applies the same importance (weight)
to each of the fields. BUT, what if I want to apply a different weight
to each field, i.e. I want to consider found terms from certain fields
as less "important" than others. I have applied an algorithm to help me
do this, which involves searching each field separately and then
recombining the results into a single collection, but hate to reinvent
the wheel if I don't have to.

 

 

~

Kevin L Cobb, MT(ASCP), SCJP2

Senior Software Engineer

Emergint, Inc.

501 E Broadway, Suite 250

Louisville, KY 40202

[EMAIL PROTECTED]

502.852.1423

502.852.1589 fax

 

 

The preceding e-mail message and attachments, if any, contains
information that is confidential and qualifies as such under the
Definition of Confidential Information in the Non-Disclosure Agreement,
and may constitute non-public information and trade secrets. It is
intended to be conveyed only to the designated recipient(s). If you are
not an intended recipient of this message, please notify the sender at
502-852-1605. 

 



Re: newbie lucene demo

2003-06-17 Thread Maurice Coyle






hi,
this isn't a lucene problem.  if there's a path on your classpath with spaces in it, you need to put double quotes ("..") around it, so your classpath would look like:
 
"C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\lib\lucene-1.2.jar";"C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\lib\lucene-demos-1.2.jar"
 
---Original Message---
 

From: Lucene Users List
Date: 17 June 2003 10:20:01
To: [EMAIL PROTECTED]
Subject: newbie lucene demo
 Hi.I have build Lucene successfully and now I'm trying to use Lucene demo. But I get error when I want to build a index.The classpath is set to:C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\lib\lucene-1.2.jar;C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\lib\lucene-demos-1.2.jarI have put the lucene-1.2.jar and lucene-demos-1.2.jar files in:C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\libI run the command from:C:\Program Files\lucene-1.2-srcThe command is:java org.apache.lucene.demo.IndexFiles C:\Program Files\lucene-1.2-src/srcI get the error:adding C:\Programcaught a class java.io.FileNotFoundExceptionwith message: C:\Program (The system cannot find the file specified)-This mail sent through IMP: http://horde.org/imp/-To unsubscribe, e-mail: [EMAIL PROTECTED]For additional commands, e-mail: [EMAIL PROTECTED]. 







  IncrediMail - Email has finally evolved - Click Here


newbie lucene demo

2003-06-17 Thread di99mwo
Hi.

I have build Lucene successfully and now I'm trying to use Lucene demo. But I 
get error when I want to build a index.


The classpath is set to:
C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\lib\lucene-1.2.jar;
C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\lib\lucene-demos-1.2.jar

I have put the lucene-1.2.jar and lucene-demos-1.2.jar files in:
C:\Program Files\s1studio_jdk\j2sdk1.4.1_02\lib

I run the command from:
C:\Program Files\lucene-1.2-src

The command is:
java org.apache.lucene.demo.IndexFiles C:\Program Files\lucene-1.2-src/src

I get the error:
adding C:\Program
 caught a class java.io.FileNotFoundException
 with message: C:\Program (The system cannot find the file specified)




-
This mail sent through IMP: http://horde.org/imp/

-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]