Hi all,
my name is Laura and I'm a new member of this list. I'm a long date
user of tomcat and I'm also a meber of tomcat user list.
Yesterday looking at the jakarta menu I saw lucene and I said:What is
this?
Reading lucene home page I understood that Lucene is a very interesting
and
example of link extraction.
Try http://www.quiotix.com/opensource/html-parser
Its easy to write a Visitor which extracts the links; should take abou
t ten
lines of code.
--
Brian Goetz
Quiotix Corporation
[EMAIL PROTECTED] Tel: 650-843-1300Fax
/
Otis
--- [EMAIL PROTECTED] [EMAIL PROTECTED] wrote:
Hi Otis,
thanks for your reply. I have been looking for Spindle and Mojo for
2
hours but I don't found anything.
Can you help me? Wher can I find something?
Thanks for your help and time
Laura
PROTECTED] death is the
Vantaa, .fi last dance eternal
--
To unsubscribe, e-mail: mailto:lucene-user-
[EMAIL PROTECTED]
For additional commands, e-mail: mailto:lucene-user-
[EMAIL PROTECTED]
-
word list to run statistics
on a page :-) ?!?
On Wednesday 24 April 2002 11:02, [EMAIL PROTECTED] wrote:
Hi all,
I'm using Jobo for spidering web sites and lucene for indexing. The
problem is that I'd like spidering only Italian web sites.
How can I see discover the country
11:02:32 +0200
From: [EMAIL PROTECTED] [EMAIL PROTECTED]
Subject: Italian web sites
To: [EMAIL PROTECTED]
Send reply to:Lucene Users List lucene-
[EMAIL PROTECTED]
Hi all,
I'm using Jobo for spidering web sites and lucene
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Received your mail we will get back to you shortly
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
I also have problems regarding my application,
what would be the ideal memory allocation for lucene
considering my application will serve at least 20 transactions per second?
tia
--buics
On Fri, 3 Sep 2004 15:20:45 +0200, [EMAIL PROTECTED]
[EMAIL PROTECTED] wrote:
Terence,
still had
java.io.IOException: Lock obtain timed out
I was trying to create two instance of IndexSearcher with different index files
Is there something i've missed?
tia,
buics
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional
Hi all,
i use pdfbox to parse pdf file to lucene document.when i parse Chinese
pdf file,pdfbox is not always success.
Is anyone have some advice?
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e
it is not about analyzer ,i need to read text from pdf file first.
- Original Message -
From: Chandan Tamrakar [EMAIL PROTECTED]
To: Lucene Users List [EMAIL PROTECTED]
Sent: Wednesday, September 08, 2004 4:15 PM
Subject: Re: pdf in Chinese
which analyzer you are using to index
after popularity (a field) and
not by anything else. How can I do this? What classes and methods do I have
to change?
thanks,
William
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
in
Simliarity. Should I do anything about it? or does'nt it matter?
/William
You need your own Similarity implementation and you need to set it as
shown in this javadoc:
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/Similarit
y.html
Otis
--- [EMAIL PROTECTED] [EMAIL
to set it as
shown in this javadoc:
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/search/
Similarity.html
Otis
--- [EMAIL PROTECTED] [EMAIL PROTECTED] wrote:
Hi,
I know this is probably a common question and I've found a couple of
posts
about it in the archive
house254.1266
house144.1942
house037.5
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
55.9017
house254.1266
house144.1942
house037.5
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
im sorry friends.. i put the title incorrectly for two times
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
numbers and then all becomes crazy for me...
i need to solve this search:
number: -10
range: -50 TO 5
i need help..
i dont find anything using google..
thanks
d2clon
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional
hi morus company;
On Thursday 18 November 2004 12:49, Morus Walter wrote:
[EMAIL PROTECTED] writes:
i need to solve this search:
number: -10
range: -50 TO 5
i need help..
i dont find anything using google..
If your numbers are in the interval MIN/MAX and MIN0 you can shift
());
}
}
}
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
to calling
fsWriter.close() to check the number of docs ... that won't work for hte
same reason.
-Hoss
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED
: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Hi Otis
I did try, here's what I get:
[EMAIL PROTECTED] tmp]# time java MemoryVsDisk 1 1 10 -r
Docs in the RAM index: 1
Docs in the FS index: 0
Total time: 142 ms
real0m0.322s
user0m0.268s
sys 0m0.033s
I tried other combinations but they dont seem to affect the outcome
either
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
technically.
Otis
--- [EMAIL PROTECTED] [EMAIL PROTECTED] wrote:
Here's probably a silly question, very newbish, but I had to ask.
Since I have mysql documents that contain over 30 fields each and
most of them
are added to the index, is it a common practice to add fields to the
index with
empty values
.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
.
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Friday, December 10, 2004 2:59 PM
To: Lucene Users List
Subject: Re: No of docs using IndexSearcher
numDocs()
http://jakarta.apache.org/lucene/docs/api/org/apache/lucene/index/IndexR
eader.html#numDocs()
Ravi said
supported indexable filetype-collection (XML, HTML, PDF,
MSWord-DOC, RTF, Plaintext).
WBR,
Tom.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED
the information of the DB of
forum (for example MySQL)
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
-
To unsubscribe, e
book charge. Amazon.com are quoting shipping in 24hrs. Is this a new 'Boston Tea Party'?
cheers
David
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
for the particular document before creating the
analyzer.
regards
Bernhard
[EMAIL PROTECTED] schrieb:
Greetings everyone
I wonder is there a solution for analyzing both English and French
documents using the same analyzer.
Reason being is that we have predominantly English documents
PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
Morus Walter said the following on 1/21/2005 2:14 AM:
No. You could do a ( ( french-query ) or ( english-query ) ) construct
using
one query. So query construction would be a bit more complex but querying
itself wouldn't change.
The first thing I'd do in your case would be to look at the
similar comments. But I'm a bit surprised there's not
a bit more in terms of use of the official java extension to php.
Thanks for the great package!
Owen
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail
://promotions.yahoo.com/new_mail
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
-
To unsubscribe, e-mail: [EMAIL
indexed.
You could also try the extreme case and set that max value to the max
Integer.
Otis
--- [EMAIL PROTECTED] [EMAIL PROTECTED] wrote:
Hi everyone
I'm having a bizzare problem with a few of the documents here that do
not seem to get indexed entirely.
I use textmining WordExtractor to convert M
Thanks Andrzej and Pasha for your prompt replies and suggestions.
I will try everything you have suggested and report back on the findings!
regards
-pedja
Pasha Bizhan said the following on 2/25/2005 6:32 PM:
Hi,
whole document was indexed or not.
Luke can help you to give an answer the
49 matches
Mail list logo