Author: rafikCyc

I would like to crawl the whole html code for each url.

Is there anyway to do this ?

I've tried this in the indexer.conf but it doesn't work :

Section headhtml               25 2058 "<head([^>]*)>(*.)</head>" $2
Section bodyhtml               26 2058 "<body([^>]*)>(*.)</body>" $2
Section htmlcode                25 2058 "<html([^>]*)>(*.)</html>" $2

Section body                    1       2018    afterheaders    html
gets the body but with all htlm tags stripped out :(

Thank you for your help

Reply: <>

General mailing list

Reply via email to