Hello,

Just what to be sure about link inversion.

When I run
bin/nutch invertlinks linkdb some_segment
it adds specified segment to linkdb preserving existing data, running
bin/nutch invertlinks linkdb -dir some_parent_segment_dir
adds all segments do linkdb preserving data.

When I want to create fresh new linkdb the -dir argument does the job (it
will fill linkdb from all segments) and there is no need to run invertlinks
for each segment separately. Is that correct?

Btw. is there a way to use wildcards with readlinkdb -url argument? Sth
like that:
bin/nutch readlinkdb linkdb -url http://www.xyz.com/* to get all inlinks
pointing to xyz.com

Regards,
Tomasz

Reply via email to