Yes, that is an option we are certainly considering, but we would rather have a start page and forget about it. Cheers, Fr
On 1/20/06, Neal Whitley <[EMAIL PROTECTED]> wrote: > Franz, > > Someone else will need to confirm this... > > FYI...why not simply inject the urls directly into Nutch? > > ./nutch inject db/ -urlfile seeds.txt > > > At 03:49 PM 1/20/2006, you wrote: > > >Thank you, but if I do that will the page be read for urls? > >Cheers, Frank > > > >On 1/20/06, Neal Whitley <[EMAIL PROTECTED]> wrote: > > > Franz, > > > > > > I 'think' you could use the regex url filter to not index this page > > > (regex-urlfilter.txt). > > > > > > Something like: -^http://([a-z0-9]*\.)*tripod.com/ > > > > > > I am new to Nutch so I make no guarantee... :-) > > > > > > Neal > > > > > > > > > > > > At 05:23 AM 1/20/2006, you wrote: > > > > > > >Hello, > > > > > > > >We are trying to implement Nutch on an intranet and have setup a > > > >special page which has links to all the other pages of the site, since > > > >many are not linked together. > > > >We will start with this special page and then go from there to all the > > > >other pages, but we would like to not index the first page (so that it > > > >doesn't show up in search results), just use it for its links. > > > >Is it possible easily? > > > > > > > >Thank you. > > > > > > > >
