Retrieving multiple files
I would like to ftp four files from a host. The files are in a large directory, and are different such that wild cards won't do it. If I write the following routine: WGET -N -i Files.txt where Files.txt is: ftp://ftp.f-prot.is/pub/fp-def.zip ftp://ftp.f-prot.is/pub/fp-def.asc ftp://ftp.f-prot.is/pub/macrdef2.zip ftp://ftp.f-prot.is/pub/nomacro.def the process disconnects from the host after each transfer, and reconnects for the next transfer. Is there a syntax such that I can connect to the host once, transfer the four files, and then disconnect? If the host isn't busy it probably doesn't make any difference, but if the host is busy, one doesn't want to lose an established connection. Thanks, Fred Holmes [EMAIL PROTECTED]
[±¤°í] Æí¸®ÇÑ Ãë¾÷»çÀÌÆ® Àâ´º½ºÀÔ´Ï´Ù
Title: Àâ´º½º - ¼ÖÁ÷ÇÑ ÇÁ·ÎÆ÷ÁîÀÇ ±â¾÷°ú ÀÎÀç°¡ ¸¸³ª´Â ±¸ÀÎ/±¸Á÷ Àü¹® À¥ ¸®Å©·çÆûçÀÌÆ®! ¾È³çÇϼ¼¿ä Ãë¾÷Àü¹®»çÀÌÆ® Àâ´º½ºÀÔ´Ï´Ù. º» ¸ÞÀÏÀº Àâ´º½ºÀÇ »õ·Î¿î °³Æí ¼Ò½ÄÀ» ¾Ë·Áµå¸®±â À§ÇØ º¸³»µå¸° ¸ÞÀÏÀÔ´Ï´Ù. º» ¸ÞÀÏÀº 2~3°³¿ù¿¡ 1ȸ ¹ß¼ÛµÇ¾îÁý´Ï´Ù. ¼ö½ÅÀ» ¿øÄ¡ ¾ÊÀ¸½Ã¸é ¾Æ·¡ÀÇ ¼ö½Å°ÅºÎ¸¦ Ŭ¸¯ÇØÁÖ¼¼¿ä. ¡¡ °³ÀΠȸ¿øÀ» À§ÇÑ My vision ¡¤ Æí¸®ÇÑ ³ªÀÇ À̷¼ °ü¸® À̷¼¸¦ 10°³ ±îÁö ´Ù¸¥ ³»¿ëÀ¸·Î µî·ÏÇÏ½Ç ¼ö ÀÖ½À´Ï´Ù. ¶ÇÇÑ ¸ðµç ä¿ë°ø°í¿¡ À̸ÞÀÏ ÀÔ»çÁö¿ø ±â´ÉÀÌ ÀÖ¾î °£ÆíÇÏ°Ô ¿øÇÏ´Â ±â¾÷¿¡ ¿øÇϽô À̷¼·Î Áö¿øÇÏ½Ç ¼ö ÀÖ½À´Ï´Ù. ¿Â¶óÀÎ/À̸ÞÀÏÁö¿ø ȸ»ç ¸®½ºÆ®°ü¸®´Â ±âº».. ¡¤ À̷¼ ¿öµåÆÄÀÏ ÀúÀå ¡¤ ¿Â¶óÀÎ/À̸ÞÀÏ ÀÔ»çÁö¿ø ±â¾÷ ȸ¿øÀ» À§ÇÑ ±â¾÷ä¿ë¼ºñ½º ¡¤ Æí¸®ÇÑ ¿Â¶óÀΠä¿ë¼ºñ½º ¹«·á·Î ä¿ë°ø°í¸¦ Á÷Á¾¼ö Á¦ÇѾøÀÌ µî·ÏÇÏ½Ç ¼ö ÀÖ½À´Ï´Ù. ¸ÞÀÎ ¹è³ÊÇü ä¿ë°ø°í ¶ÇÇÑ Ãë¾÷»çÀÌÆ®Áß °¡Àå Àú·ÅÇÑ ¿ä±ÝÀ¸·Î ¼ºñ½ºÇص帮°í ÀÖ½À´Ï´Ù. ±× ¿Ü ´Ù¾çÇÑ ÀÎÀçÆÄ°ß / ä¿ë´ëÇà ¼ºñ½ºµµ ÀÌ¿ëÇØ º¸¼¼¿ä. ¡¤ ä¿ë´ëÇà / ÀÎÀçÆÄ°ß ¼ºñ½º ¡¤ Àâ´º½º ÇìµåÇåÆà ¼ºñ½º ¿©¼ºÀ¯¸Á Á÷Á¾Á¤º¸ Ãë¾÷¿¡ µµ¿òÀÌ µÇ´Â ÀÚ°ÝÁõ Á¤º¸ ´ç´çÇÑ Àâ´º½º ¿©¼º ȸ¿ø¸¸À» À§ÇÑ 70 ¿©°³ÀÇ À¯¸Á Á÷Á¾Á¤º¸. 21C ÃֽŠÁ÷¾÷Á¤º¸.. ÀúÈñ Àâ´º½º¿Í ÇÔ²² Ä¿¸®¾î ¿ì¸ÕÀÇ ²ÞÀ» ÀÌ·ç¾î º¸¼¼¿ä Æò¼Ò ±Ã±ÝÇß´ø °¢ ºÐ¾ßº° ÀÚ°ÝÁõ¿¡ °üÇÑ »ó¼¼Á¤º¸¸¦ ¸ð¾Ò¾î¿ä. ÀÚ°ÝÁõ¿¡ ´ëÇÑ ¼öÇàÁ÷¹«, Ãëµæ¹æ¹ý, Ãë¾÷Àü¸ÁÁ¤º¸. ÀÚ°ÝÁõÀ¸·Î Ãë¾÷ ½±°Ô ÇÏÀÚ±¸¿ä. ¸ÚÁø À̷¼ ÀÛ¼ºÇÏ´Â ¹æ¹ý ¾Ë¾ÆµÎ¸é µçµçÇÑ ¸éÁ¢ ¿ä·É ³ª¿¡°Ô µü ¸Â´Â ¸ÚÁø À̷¼¿Í ÀÚ±â¼Ò°³¼¸¦ ÀÛ¼ºÇÏ´Â ¹æ¹ý. ³ª¸¦ Ç¥ÇöÇÏ´Â À̷¼, ´ëÃæ´ëÃæ ÀÛ¼ºÇؼ± ¸éÁ¢º¸±âµµ Èûµé¾î¿ä. À̷¼ ÀÛ¼º ³ëÇϿ츦 ¾Ë·Áµå·Á¿ä ¸éÁ¢ÀÇ Á¾·ù¿¡¼ºÎÅÍ ÃÖ±Ù ¸éÁ¢µ¿Çâ±îÁö ´Ù¾çÇÑ ¸éÁ¢ Á¤º¸. °¡½¿ÀÌ µÎ±ÙµÎ±Ù.. ¶³¸®´Â ¸éÁ¢½Ã°£. ¸éÁ¢½Ã ²À ¾Ë¾ÆµÎ¾î¾ß ÇÒ ¿ä·ÉµéÀ» ¾Ë·Áµå·Á¿ä º» ¸ÞÀÏÀº Á¤º¸Åë½ÅºÎ ±Ç°í »çÇ׿¡ ÀÇ°Å Á¦¸ñ¿¡ [±¤°í]¶ó Ç¥½ÃµÈ ±¤°í ¸ÞÀÏÀÔ´Ï´Ù. ´õ ÀÌ»ó ¸ÞÀÏÀ» ¼ö½ÅÇÏ°í ½ÍÁö ¾ÊÀ¸½Ã¸é [¼ö½Å °ÅºÎ]¸¦ Ŭ¸¯ÇØ ÁֽʽÿÀ. ¡¡
[±¤°í] Æí¸®ÇÑ Ãë¾÷»çÀÌÆ® Àâ´º½ºÀÔ´Ï´Ù
Title: Àâ´º½º - ¼ÖÁ÷ÇÑ ÇÁ·ÎÆ÷ÁîÀÇ ±â¾÷°ú ÀÎÀç°¡ ¸¸³ª´Â ±¸ÀÎ/±¸Á÷ Àü¹® À¥ ¸®Å©·çÆûçÀÌÆ®! ¾È³çÇϼ¼¿ä Ãë¾÷Àü¹®»çÀÌÆ® Àâ´º½ºÀÔ´Ï´Ù. º» ¸ÞÀÏÀº Àâ´º½ºÀÇ »õ·Î¿î °³Æí ¼Ò½ÄÀ» ¾Ë·Áµå¸®±â À§ÇØ º¸³»µå¸° ¸ÞÀÏÀÔ´Ï´Ù. º» ¸ÞÀÏÀº 2~3°³¿ù¿¡ 1ȸ ¹ß¼ÛµÇ¾îÁý´Ï´Ù. ¼ö½ÅÀ» ¿øÄ¡ ¾ÊÀ¸½Ã¸é ¾Æ·¡ÀÇ ¼ö½Å°ÅºÎ¸¦ Ŭ¸¯ÇØÁÖ¼¼¿ä. ¡¡ °³ÀΠȸ¿øÀ» À§ÇÑ My vision ¡¤ Æí¸®ÇÑ ³ªÀÇ À̷¼ °ü¸® À̷¼¸¦ 10°³ ±îÁö ´Ù¸¥ ³»¿ëÀ¸·Î µî·ÏÇÏ½Ç ¼ö ÀÖ½À´Ï´Ù. ¶ÇÇÑ ¸ðµç ä¿ë°ø°í¿¡ À̸ÞÀÏ ÀÔ»çÁö¿ø ±â´ÉÀÌ ÀÖ¾î °£ÆíÇÏ°Ô ¿øÇÏ´Â ±â¾÷¿¡ ¿øÇϽô À̷¼·Î Áö¿øÇÏ½Ç ¼ö ÀÖ½À´Ï´Ù. ¿Â¶óÀÎ/À̸ÞÀÏÁö¿ø ȸ»ç ¸®½ºÆ®°ü¸®´Â ±âº».. ¡¤ À̷¼ ¿öµåÆÄÀÏ ÀúÀå ¡¤ ¿Â¶óÀÎ/À̸ÞÀÏ ÀÔ»çÁö¿ø ±â¾÷ ȸ¿øÀ» À§ÇÑ ±â¾÷ä¿ë¼ºñ½º ¡¤ Æí¸®ÇÑ ¿Â¶óÀΠä¿ë¼ºñ½º ¹«·á·Î ä¿ë°ø°í¸¦ Á÷Á¾¼ö Á¦ÇѾøÀÌ µî·ÏÇÏ½Ç ¼ö ÀÖ½À´Ï´Ù. ¸ÞÀÎ ¹è³ÊÇü ä¿ë°ø°í ¶ÇÇÑ Ãë¾÷»çÀÌÆ®Áß °¡Àå Àú·ÅÇÑ ¿ä±ÝÀ¸·Î ¼ºñ½ºÇص帮°í ÀÖ½À´Ï´Ù. ±× ¿Ü ´Ù¾çÇÑ ÀÎÀçÆÄ°ß / ä¿ë´ëÇà ¼ºñ½ºµµ ÀÌ¿ëÇØ º¸¼¼¿ä. ¡¤ ä¿ë´ëÇà / ÀÎÀçÆÄ°ß ¼ºñ½º ¡¤ Àâ´º½º ÇìµåÇåÆà ¼ºñ½º ¿©¼ºÀ¯¸Á Á÷Á¾Á¤º¸ Ãë¾÷¿¡ µµ¿òÀÌ µÇ´Â ÀÚ°ÝÁõ Á¤º¸ ´ç´çÇÑ Àâ´º½º ¿©¼º ȸ¿ø¸¸À» À§ÇÑ 70 ¿©°³ÀÇ À¯¸Á Á÷Á¾Á¤º¸. 21C ÃֽŠÁ÷¾÷Á¤º¸.. ÀúÈñ Àâ´º½º¿Í ÇÔ²² Ä¿¸®¾î ¿ì¸ÕÀÇ ²ÞÀ» ÀÌ·ç¾î º¸¼¼¿ä Æò¼Ò ±Ã±ÝÇß´ø °¢ ºÐ¾ßº° ÀÚ°ÝÁõ¿¡ °üÇÑ »ó¼¼Á¤º¸¸¦ ¸ð¾Ò¾î¿ä. ÀÚ°ÝÁõ¿¡ ´ëÇÑ ¼öÇàÁ÷¹«, Ãëµæ¹æ¹ý, Ãë¾÷Àü¸ÁÁ¤º¸. ÀÚ°ÝÁõÀ¸·Î Ãë¾÷ ½±°Ô ÇÏÀÚ±¸¿ä. ¸ÚÁø À̷¼ ÀÛ¼ºÇÏ´Â ¹æ¹ý ¾Ë¾ÆµÎ¸é µçµçÇÑ ¸éÁ¢ ¿ä·É ³ª¿¡°Ô µü ¸Â´Â ¸ÚÁø À̷¼¿Í ÀÚ±â¼Ò°³¼¸¦ ÀÛ¼ºÇÏ´Â ¹æ¹ý. ³ª¸¦ Ç¥ÇöÇÏ´Â À̷¼, ´ëÃæ´ëÃæ ÀÛ¼ºÇؼ± ¸éÁ¢º¸±âµµ Èûµé¾î¿ä. À̷¼ ÀÛ¼º ³ëÇϿ츦 ¾Ë·Áµå·Á¿ä ¸éÁ¢ÀÇ Á¾·ù¿¡¼ºÎÅÍ ÃÖ±Ù ¸éÁ¢µ¿Çâ±îÁö ´Ù¾çÇÑ ¸éÁ¢ Á¤º¸. °¡½¿ÀÌ µÎ±ÙµÎ±Ù.. ¶³¸®´Â ¸éÁ¢½Ã°£. ¸éÁ¢½Ã ²À ¾Ë¾ÆµÎ¾î¾ß ÇÒ ¿ä·ÉµéÀ» ¾Ë·Áµå·Á¿ä º» ¸ÞÀÏÀº Á¤º¸Åë½ÅºÎ ±Ç°í »çÇ׿¡ ÀÇ°Å Á¦¸ñ¿¡ [±¤°í]¶ó Ç¥½ÃµÈ ±¤°í ¸ÞÀÏÀÔ´Ï´Ù. ´õ ÀÌ»ó ¸ÞÀÏÀ» ¼ö½ÅÇÏ°í ½ÍÁö ¾ÊÀ¸½Ã¸é [¼ö½Å °ÅºÎ]¸¦ Ŭ¸¯ÇØ ÁֽʽÿÀ. ¡¡
Re: Wget 1.8.1 BUG
Ryan Daniels [EMAIL PROTECTED] writes: The following command line causes a Segfault on my system: wget -spider http://www.yahoo.com Note that the correct syntax is `--spider', and that this (currently defunct) option does not accept arguments. But the bug you've uncovered is real: you can repeat it with: wget -r I'll try to fix it.
Re: SSL site mirroring
Thomas Reinke [EMAIL PROTECTED] writes: Ok, either I've completely misread wget, or it has a problem mirroring SSL sites. It appears that it is deciding that the https:// scheme is something that is not to be followed. That's a bug. Your patch is close to how it should be fixed, with two pending issues: * SCHEME_HTTPS is undefined when SSL is not used. The identifier shouldn't be used unconditionally. * Should we have a scheme_http_like property and test for it instead of using u-scheme == SCHEME_HTTP || u-scheme == SCHEME_HTTPS?
Re: Feature wish for wget
Herold Heiko [EMAIL PROTECTED] writes: But that's not the real issue here - why -i for input but not for others ? A consistent interface should allow something like --file-char=@ -@Rfilename -@Aotherfilename ecc., i.e. accept a filename everywhere a option is allowed. This is a neat idea, but traditional Unix utilities have at best been flaky at supporting this sort of thing. Getopt for one does not support this kind of option mangling. Also, pursuing this route would raise more questions about the format of the files -- newline or comma-separated fields would hardly be sufficient in all cases.
Re: -k bug
Peter Gucwa @ IIS-RTP [EMAIL PROTECTED] writes: option -k does not work in following call: wget -k -r -l 1 http://www.softcomputer.com/cgi/jobs.cgi What version of Wget are you using? How exactly does it not work? What did you expect to happen, and what happened instead?
Re: How to save what I see (2)
Robin B. Lake [EMAIL PROTECTED] writes: In a prior posting, I asked about saving an image from a Web page instead of just saving the information necessary to re-retrieve that image. I was advised to try -p -k --html-extension Using wget-1.8.1-pre2, I still don't see the image data saved in the .html file wget creates. `wget -p -k --html-extension URL' is supposed to do these things: 1. download the HTML file under URL 1a. rename the file to `.html' if it doesn't already have that extension. 2. download the inline images and style sheets needed to display that HTML file, provided they are on the same host. 3. convert the links in the HTML file so that they point either locally (in case of downloaded images) or to the remote location (in case of links to external resources). Which of these is not happening when you run it? (To make Wget access other hosts, use `--include-domains b.mktw.net'.) ./wget -nH -p -k --html-extension -q -O /QoI/NDX/working/CHARTS/$myday+NDX.html 'http://bigcharts.marketwatch.com/quickchart/quickchart.asp?symb=%24NDXsid=0o_symb=%24NDXx=60y=15freq=9time=1' -O will not work with -p. This is a limitation in how -p currently works.
Re: Bug? (not following 302's, or following them incorrectly)
Brendan Ragan [EMAIL PROTECTED] writes: This is the problem i'm having with an older wget (1.5.3) when i enter the url 'http://www.tranceaddict.com/cgi-bin/songout.php?id=1217-dirty_dirtymonth=dec' it goes Connecting to www.tranceaddict.com:80... connected! HTTP request sent, awaiting response... 302 Found Location: http://vid1.tranceaddict.com/mp3/singles/dec/Dirty-Dirty-(Original_Mix)-XP-www_tranceaddict_com.mp3 [following] and dutifully retrieves the file. On a newer version (1.7) it goes Connecting to www.tranceaddict.com:80... connected! HTTP request sent, awaiting response... 302 Moved Temporarily Location: http://www.yahoo.com [following] www.tranceaddict.com does not appear to be available at the moment, so I can't repeat this myself. Here are several hints how to provide more data: * Try the same URL with the latest version of Wget, which is 1.8.1. Many bugs have been fixed; perhaps this is one of them. * Use the `-d' switch to provide the debug output for both the successful 1.5.3 run and the unsuccessful 1.8.1 run, and mail both outputs here. That way we'll have a chance to determine what 1.8.1 is doing wrong.
Re: wget does not treat urls starting with // correctly
Ian Abbott [EMAIL PROTECTED] writes: On 4 Jan 2002 at 12:22, Bastiaan Stougie wrote: wget -P $LOCALDIR -m -np -nH -p --cut-dirs=2 http://host/dir1/dir2/ This works fine, except that wget does not follow all the urls. It skips urls like: A HREF=//host/dir1/dir2/filetext/A Wow, I had no idea things like this were used. src/ChangeLog entry: 2002-01-07 Ian Abbott [EMAIL PROTECTED] * url.c (uri_merge_1): Deal with net path relative URL (one that starts with //). [...] Thanks for the patch; I've now applied it (with minor changes in examples and comments.)
Re: SSL site mirroring
Hrvoje Niksic wrote: Thomas Reinke [EMAIL PROTECTED] writes: Ok, either I've completely misread wget, or it has a problem mirroring SSL sites. It appears that it is deciding that the https:// scheme is something that is not to be followed. That's a bug. Your patch is close to how it should be fixed, with two pending issues: * SCHEME_HTTPS is undefined when SSL is not used. The identifier shouldn't be used unconditionally. * Should we have a scheme_http_like property and test for it instead of using u-scheme == SCHEME_HTTP || u-scheme == SCHEME_HTTPS? Does raise an interesting question re a bug introduced by the patch - is https://HOST the the same host when it is referred to as http://HOST? Had found another problem introduced by the patch where u-scheme was tested against parent-scheme without any reference to what scheme is being used (download_child_p, section 7.) The following fixed it, assumes that the two are different. If you decide the two are the same, then a single scheme as you propose works. If the two are different, then you probably need to propogate tests for the two different schemes throughout the code as I did originally. /* 7. */ if ((u-scheme == parent-scheme) || (u-scheme==SCHEME_HTTP parent-scheme==SCHEME_HTTPS) || (u-scheme==SCHEME_HTTPS parent-scheme==SCHEME_HTTP)) if (!opt.spanhost 0 != strcasecmp (parent-host, u-host)) { DEBUGP ((This is not the same hostname as the parent's (%s and %s).\n, u-host, parent-host)); goto out; } Thomas -- E-Soft Inc. http://www.e-softinc.com Publishers of SecuritySpace http://www.securityspace.com Tel: 1-905-331-2260 Fax: 1-905-331-2504 Tollfree in North America: 1-800-799-4831