RE: Character encoding

2005-04-05 Thread Alan Hunter
Niksic [mailto:[EMAIL PROTECTED] Sent: March 31, 2005 3:19 PM To: Alan Hunter Cc: 'wget@sunsite.dk' Subject: Re: Character encoding I'm not sure what causes this problem, but I suspect it does not come from Wget doing something wrong. That Notepad opens the file correctly is indicative enough. Maybe

RE: Character encoding

2005-04-01 Thread Alan Hunter
Message- From: Hrvoje Niksic [mailto:[EMAIL PROTECTED] Sent: March 31, 2005 3:19 PM To: Alan Hunter Cc: 'wget@sunsite.dk' Subject: Re: Character encoding I'm not sure what causes this problem, but I suspect it does not come from Wget doing something wrong. That Notepad opens the file

Character encoding

2005-03-31 Thread Alan Hunter
Hi, I have a webpage that has some html textthat has been pasted from MS Word and the quote char ' is a special "type", ie not the ascii one. This char displays fine in IE/Firefox. However, when I spider the page with Wget (windows) it encodes this character in a funny way e.g. areaĆ¢(tm)s =

RE: Character encoding

2005-03-31 Thread Alan Hunter
Hi, Thanks for the reply. It is the page text that is the problem. When I started to investigate it further I found that it actually only happens when the page being wgot is a .aspx (.net asp) file. I made 3 identical files (as below), one with .html ext, 1 with .aspx ext and one with .zzz