I am using wget 1.10.2 on a Windows 98 machine. I would like to non-interactively query the U.S. patent database. I am using the following wget command:
wget --convert-links --directory-prefix="C:\Program Files\wget\perimeter" --no-clobber http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/PTO/search-adv.html&r=0&p=1&f=S&l=50&Query=isd/1/1/2000->1/1/2010+and+(aclm/(software+and+hardware)+or+ttl/(software+and+hardware))&d=PTXT However, this query is seen by the server as: isd/1/1/2000-/1/2010 and (aclm/(software and hardware) or ttl/(software and hardware)) So, the ">" character is being translated as "/", which the server does not like (no matches returned). However, if I open the link below directly in my browser, it works fine: http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&u=/netahtml/PTO/search-adv.html&r=0&p=1&f=S&l=50&Query=isd/1/1/2000->1/1/2010+and+(aclm/(software+and+hardware)+or+ttl/(software+and+hardware))&d=PTXT What is happening? Is the problem due to the fact that the ">" character is reserved in HTML? Is there something that I should do differently? I am still a novice to wget and http. I have looked in the wget manual (probably not strictly a wget question) and on the web, but have not found where this is discussed. Thanks, Alan