URL: <https://savannah.gnu.org/bugs/?55603>
Summary: Naive comment type in use by webdevs not recognized Project: GNU Wget Submitted by: None Submitted on: Tue 29 Jan 2019 03:56:13 PM UTC Category: Program Logic Severity: 3 - Normal Priority: 5 - Normal Status: None Privacy: Public Assigned to: None Originator Name: Originator Email: Open/Closed: Open Discussion Lock: Any Release: 1.20 Operating System: Microsoft Windows Reproducibility: Every Time Fixed Release: None Planned Release: None Regression: None Work Required: None Patch Included: No _______________________________________________________ Details: Some webdevs do comments like this: <!--comment comment--!> It displays correctly in web browsers (chrome and firefox checked). Wget does not recognize the termination. I invoke Wget with: start wget -nd -r -l 0 --reject-regex sort=nam -I /neopets http://lost.quiggle.org/ Using "--strict-comments" fixes it for this particular website. I don't know how it sees the comments in this mode, is it "<!--comment>"? The only occurrence of a "-->" is towards the end of the webpage and that is what is throwing naive comment parsing off. Maybe a strict html comment just can't be pieced together on the website. I also suspect that naive comments mode parses this block differently from chrome, not sure though. <script type="text/javascript"><!-- google_ad_client = "ca-pub-2676160449557179"; /* indexbase */ google_ad_slot = "6483197392"; google_ad_width = 300; google_ad_height = 250; //--> </script> chrome does not see a html comment there log.txt attached _______________________________________________________ File Attachments: ------------------------------------------------------- Date: Tue 29 Jan 2019 03:56:13 PM UTC Name: log.txt Size: 49KiB By: None <http://savannah.gnu.org/bugs/download.php?file_id=46125> _______________________________________________________ Reply to this item at: <https://savannah.gnu.org/bugs/?55603> _______________________________________________ Message sent via Savannah https://savannah.gnu.org/