Author: Alexander Barkov
Email: b...@mnogosearch.org
Message:
Many thanks
I use Xpath everyday to find content on xHTML content and it works pretty
well.
xHTML is a valid XML. So XPath should work.
Thank you so much for your answers.
Any idea of when the 3.4 could be released ?
Author: Mamadoo
Email: fohoi...@gmail.com
Message:
For fetching time, ok thanks ! Great news !
For the in / out links per page, any chance you add this one day ?
For xpath, thanks but no, it's not for XML parsing.
I would need it, for example, to scrap specific content on my pages.
Reply:
Author: Alexander Barkov
Email: b...@mnogosearch.org
Message:
For fetching time, ok thanks ! Great news !
For the in / out links per page, any chance you add this one day ?
As I said in the previous message, in 3.3.4
*ALL* in/out links can be collected into the table links.
It's trivial to
Author: Mamadoo
Email: fohoi...@gmail.com
Message:
Many thanks
I use Xpath everyday to find content on xHTML content and it works pretty well.
Thank you so much for your answers.
Any idea of when the 3.4 could be released ?
Reply: http://www.mnogosearch.org/board/message.php?id=21602
Author: Alexander Barkov
Email: b...@mnogosearch.org
Message:
Hi,
Hi there,
Is it possible to obtain these informations after having crawled a website :
- Fetching / downloading time of each page
- Total in and out links (from the website structure itself)
This is possible in
Author: Alexander Barkov
Email: b...@mnogosearch.org
Message:
skip
I guess you need this is for XML files.
XPath is currently not possible. We could take advantage
of libxml2 to add XPath support. But this needs some
development efforts.
Btw, simple extraction from a given XML tag is
Author: Mamadoo
Email: fohoi...@gmail.com
Message:
Hi there,
Is it possible to obtain these informations after having crawled a website :
- Fetching / downloading time of each page
- Total in and out links (from the website structure itself)
Would it be possible to add xpath support instead of