Substituting %3E for ">" does not work either.  
  ----- Original Message ----- 
  From: Alan Thomas 
  To: wget@sunsite.dk 
  Sent: Saturday, March 31, 2007 3:23 AM
  Subject: Special characters in http


      I am using wget 1.10.2 on a Windows 98 machine. I would like to 
non-interactively query the U.S. patent database.  I am using the following 
wget command:

  wget --convert-links --directory-prefix="C:\Program Files\wget\perimeter" 
--no-clobber 
http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/PTO/search-adv.html&r=0&p=1&f=S&l=50&Query=isd/1/1/2000->1/1/2010+and+(aclm/(software+and+hardware)+or+ttl/(software+and+hardware))&d=PTXT

  However, this query is seen by the server as:

  isd/1/1/2000-/1/2010 and (aclm/(software and hardware) or ttl/(software and 
hardware))

  So, the ">" character is being translated as "/", which the server does not 
like (no matches returned).  

          However, if I open the link below directly in my browser, it works 
fine:

  
http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/PTO/search-adv.html&r=0&p=1&f=S&l=50&Query=isd/1/1/2000->1/1/2010+and+(aclm/(software+and+hardware)+or+ttl/(software+and+hardware))&d=PTXT

          What is happening?  Is the problem due to the fact that the ">" 
character is reserved in HTML?  Is there something that I should do 
differently?  

          I am still a novice to wget and http.  I have looked in the wget 
manual (probably not strictly a wget question) and on the web, but have not 
found where this is discussed.  

  Thanks, Alan



Reply via email to