[General] Webboard: In/out links and fetching time for each page + xpath

2013-12-05 Thread bar
Author: Alexander Barkov Email: b...@mnogosearch.org Message: Hi, Hi there, Is it possible to obtain these informations after having crawled a website : - Fetching / downloading time of each page - Total in and out links (from the website structure itself) This is possible in

[General] Webboard: In/out links and fetching time for each page + xpath

2013-12-05 Thread bar
Author: Alexander Barkov Email: b...@mnogosearch.org Message: skip I guess you need this is for XML files. XPath is currently not possible. We could take advantage of libxml2 to add XPath support. But this needs some development efforts. Btw, simple extraction from a given XML tag is