Author: rafikCyc
Email: rafikothm...@gmail.com
Message:
Hello,

I would like to crawl the whole html code for each url.

Is there anyway to do this ?

I've tried this in the indexer.conf but it doesn't work :

Section headhtml               25 2058 "<head([^>]*)>(*.)</head>" $2
Section bodyhtml               26 2058 "<body([^>]*)>(*.)</body>" $2
Section htmlcode                25 2058 "<html([^>]*)>(*.)</html>" $2

Section body                    1       2018    afterheaders    html
gets the body but with all htlm tags stripped out :(


Thank you for your help


Reply: <http://www.mnogosearch.org/board/message.php?id=21772>

_______________________________________________
General mailing list
General@mnogosearch.org
http://lists.mnogosearch.org/listinfo/general

Reply via email to