Please ask these types of questions on java-user. Java-dev is reserved for Lucene internal development

However, it is your responsibility to do the crawling. You might look into Nutch or Aperture or some other crawler that is aware of these kinds of contexts. Lucene is agnostic of the content. It is up to your application to tell Lucene how to index your content.


On Dec 28, 2007, at 2:23 PM, Jesiel Trevisan wrote:

Thanks Steven,

I saw this link, it´s nice, but... for me, it still does not clear how to do
a Indexing my webSite.

For example, I have in my website this struct below:

<home> -
         Index.jsp
    <products>
                     tv.jsp
                     dvd.jsp
    <services>
                     service01.jsp
                     service02.jsp

The Lucene can make a search in <home> and get all de context of all JSP
files and then, put in the lucene´s index files ?
For example, if, into the Index.jsp there are many liks to other JSP pages, is the Lucene will navegate into these links an then, get all its context
informations ?

And, if I just will want a <Title> of the pages in the JSP, how I can create
this filter ?

Thanks for help and sorry about these baby´s questions ;-(




On Dec 28, 2007 1:46 PM, Steven A Rowe <[EMAIL PROTECTED]> wrote:

Hi Jesiel,

Here's a good place to start:

 http://wiki.apache.org/lucene-java/Resources

Steve

On 12/28/2007 at 8:27 AM, Jesiel Trevisan wrote:
Hi everyone,

I just have a little doubts with Lucene 2.2.0

I got the web example of Lucene Search, but, it does not have
nothing about
indexing file example, there is only the Searcher Function example.

I just would like to see or read something about How Can I do/make the
indexing lucene files.

If somebody have any example or tutorial about how to
indexing Lucene files
by searching html files, it´ll be helpfull to me so much.

Thanks.





---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




--
_______________________________________________________
Jesiel A.S. Trevisan
Email: [EMAIL PROTECTED]
MSN: [EMAIL PROTECTED]
Skype & AIM: jesieltrevisan
YahooMessager: jesiel.trevisan
ICQ:: 46527510
_______________________________________________________
CONFIDENTIALITY NOTICE: This e-mail message, including any attachments, is for the sole use of the intended recipient(s) and may contain confidential
and privileged information or otherwise be protected by law. Any
unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply e- mail
and destroy all copies of the original message.

--------------------------
Grant Ingersoll
http://lucene.grantingersoll.com

Lucene Helpful Hints:
http://wiki.apache.org/lucene-java/BasicsOfPerformance
http://wiki.apache.org/lucene-java/LuceneFAQ




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to