Hi Ernesto!
Meta tags are custom tags that you add in your web page, to be more
exactly, inside the head/head tag, to identify the contents of the
web page to search engine indexes. For example your can add meta tag to
describe the author of the page, keywords, cache, and so on. What you can
Hi Lourival
Thanks, I see, I undertstand it now. I know metatags in html, but I can't use
it, because I want to crawl pages from others sites. I think categorize the
pages by url, with regular expressions.
muito obrigado! e até mais...
;)
Ernesto.
Lourival Júnior [EMAIL PROTECTED] escribió:
You probably still want to write a plugin. You can user whatever
algorithms you like to identify a site category, then add that as a
field in the index.
Ernesto De Santis wrote:
Hi Lourival
Thanks, I see, I undertstand it now. I know metatags in html, but I can't use
it, because I want to
Hi All
Please, some body can answer my questions?
I'm a nutch beginner, I hope that my questions/doubts are easy... ;)
Or if my email is wrong, tell me. Or confirm me if I'm in the right way.
Thanks a lot!
Ernesto.
Ernesto De Santis escribió:
Hi
I'm new in nutch, start yesterday.
But I have
Hi Ernesto!
I know what you mean. Sometimes I get no answers too. Unfortunately, I'm new
in nutch and lucene and I can't help you. Continue trying, the comunity will
help you :).
On 8/22/06, Ernesto De Santis [EMAIL PROTECTED] wrote:
Hi All
Please, some body can answer my questions?
I'm a
You can't do it unless you write a plugin to parse a custom meta tag
called category.
I'm trying to do something like this now, but the plugin documentation
is horrible.
Lourival Júnior wrote:
Hi Ernesto!
I know what you mean. Sometimes I get no answers too. Unfortunately,
I'm new
in
You're right about it being not documented well, but it's actually
pretty simple to do.
You need to write an indexing filter and a query filter. For the indexing
filter, I would copy the index-more plugin, and change names, dirs, and
build files appropriately. The main thing you'll change is the