Hi, No idea how I get into these subtle issues that seem to defy logic...
I grabbed pdf files with: $ wget -r -nd -A \*pdf http://www.ncrealtors.org/sampleforms.cfm When I re-issue the command, hoping for just updated files, wget re-downloads everything as filename.1, then deletes that file. Any idea how to prevent these useless downloads? Thanks, Pierre Here's the console output... $ wget -r -nd -A \*pdf http://www.ncrealtors.org/sampleforms.cfm --2014-03-16 18:02:38-- http://www.ncrealtors.org/sampleforms.cfm Resolving www.ncrealtors.org (www.ncrealtors.org)... 208.106.182.41 Connecting to www.ncrealtors.org (www.ncrealtors.org)|208.106.182.41|:80... connected. HTTP request sent, awaiting response... 200 OK Length: unspecified [text/html] Saving to: ‘sampleforms.cfm’ [ <=> ] 78,572 126KB/s in 0.6s 2014-03-16 18:02:39 (126 KB/s) - ‘sampleforms.cfm’ saved [78572] Loading robots.txt; please ignore errors. --2014-03-16 18:02:39-- http://www.ncrealtors.org/robots.txt Connecting to www.ncrealtors.org (www.ncrealtors.org)|208.106.182.41|:80... connected. HTTP request sent, awaiting response... 200 OK Length: 380 [text/plain] Saving to: ‘robots.txt.1’ 100%[==================================================>] 380 --.-K/s in 0s 2014-03-16 18:02:39 (57.1 MB/s) - ‘robots.txt.1’ saved [380/380] Removing sampleforms.cfm since it should be rejected. --2014-03-16 18:02:39-- http://www.ncrealtors.org/uploads/formspolicy.pdf Reusing existing connection to www.ncrealtors.org:80. HTTP request sent, awaiting response... 200 OK Length: 169919 (166K) [application/pdf] Saving to: ‘formspolicy.pdf.1’ 100%[===================================================>] 169,919 233KB/s in 0.7s 2014-03-16 18:02:40 (233 KB/s) - ‘formspolicy.pdf.1’ saved [169919/169919] Removing formspolicy.pdf.1 since it should be rejected. [...]
