Uroš Gruber wrote:
Hi,
Could someone point me how to get CrawlDatum data from key url in
ParseOutputFormat.write [83].
I would like to add data to link urls but this data depend on data of
url being crawled.
You can't, because that instance of CrawlDatum is not available at this
place. Either you need to provide it on the input to the map/reduce job
(but then you will have to change input and output formats), or you
should prepare this information in advance during parsing, and put it
into ParseData.metadata.
I hope I was clear enough about my problem.
I hope so too ;)
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com