Please ask these types of questions on java-user. Java-dev is
reserved for Lucene internal development
However, it is your responsibility to do the crawling. You might look
into Nutch or Aperture or some other crawler that is aware of these
kinds of contexts. Lucene is agnostic of the content. It is up to
your application to tell Lucene how to index your content.
On Dec 28, 2007, at 2:23 PM, Jesiel Trevisan wrote:
Thanks Steven,
I saw this link, it´s nice, but... for me, it still does not clear
how to do
a Indexing my webSite.
For example, I have in my website this struct below:
<home> -
Index.jsp
<products>
tv.jsp
dvd.jsp
<services>
service01.jsp
service02.jsp
The Lucene can make a search in <home> and get all de context of all
JSP
files and then, put in the lucene´s index files ?
For example, if, into the Index.jsp there are many liks to other JSP
pages,
is the Lucene will navegate into these links an then, get all its
context
informations ?
And, if I just will want a <Title> of the pages in the JSP, how I
can create
this filter ?
Thanks for help and sorry about these baby´s questions ;-(
On Dec 28, 2007 1:46 PM, Steven A Rowe <[EMAIL PROTECTED]> wrote:
Hi Jesiel,
Here's a good place to start:
http://wiki.apache.org/lucene-java/Resources
Steve
On 12/28/2007 at 8:27 AM, Jesiel Trevisan wrote:
Hi everyone,
I just have a little doubts with Lucene 2.2.0
I got the web example of Lucene Search, but, it does not have
nothing about
indexing file example, there is only the Searcher Function example.
I just would like to see or read something about How Can I do/make
the
indexing lucene files.
If somebody have any example or tutorial about how to
indexing Lucene files
by searching html files, it´ll be helpfull to me so much.
Thanks.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--
_______________________________________________________
Jesiel A.S. Trevisan
Email: [EMAIL PROTECTED]
MSN: [EMAIL PROTECTED]
Skype & AIM: jesieltrevisan
YahooMessager: jesiel.trevisan
ICQ:: 46527510
_______________________________________________________
CONFIDENTIALITY NOTICE: This e-mail message, including any
attachments, is
for the sole use of the intended recipient(s) and may contain
confidential
and privileged information or otherwise be protected by law. Any
unauthorized review, use, disclosure or distribution is prohibited.
If you
are not the intended recipient, please contact the sender by reply e-
mail
and destroy all copies of the original message.
--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com
Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]