Which is better for overall performance? To parse during fetching or afterward?
On Thu, Sep 18, 2008 at 4:01 PM, Andrzej Bialecki <[EMAIL PROTECTED]> wrote: > Kevin MacDonald wrote: > >> I'm sure it's just my ignorance of some basics of nutch. The way I read >> that >> code it said to me "if I'm not supposed to parse, go ahead and parse". >> > > "If I'm not supposed to parse during fetching, go ahead and parse it after > I'm done with fetching, because I only have unparsed content". > > You still need parsing in order to get the outlinks. > > > > -- > Best regards, > Andrzej Bialecki <>< > ___. ___ ___ ___ _ _ __________________________________ > [__ || __|__/|__||\/| Information Retrieval, Semantic Web > ___|||__|| \| || | Embedded Unix, System Integration > http://www.sigram.com Contact: info at sigram dot com > >
