date:20051130

spaces in pathnames using --directory-prefix=prefix

2005-11-30 Thread Jonathan D. Degumbia

Hello, I’m trying to use the --directory-prefix=prefix option for wget on a Windows system. My prefix has spaces in the path directories. Wget appears to terminate the path at the first space encountered. In other words if my prefix is: c:/my prefix/ then wget copies files to c:/m

RE: spaces in pathnames using --directory-prefix=prefix

2005-11-30 Thread Tony Lewis

Jonathan DeGumbia wrote: > I'm trying to use the --directory-prefix=prefix option for wget on a > Windows system. My prefix has spaces in the path directories. Wget > appears to terminate the path at the first space encountered. In other > words if my prefix is: c:/my prefix/ then wget copi

Re: spaces in pathnames using --directory-prefix=prefix

2005-11-30 Thread Mauro Tortonesi

Jonathan D. Degumbia wrote: Hello, I'm trying to use the --directory-prefix=prefix option for wget on a Windows system. My prefix has spaces in the path directories. Wget appears to terminate the path at the first space encountered. In other words if my prefix is: c:/my prefix/ then wget

Re: Limit time to run

2005-11-30 Thread Mauro Tortonesi

Frank McCown wrote: It would be great if wget had a way of limiting the amount of time it took to run so it won't accidentally hammer on someone's web server for an indefinate amount of time. I'm often needing to let a crawler run for a while on an unknown site, and I have to manually kill wge

RE: Limit time to run

2005-11-30 Thread Post, Mark K

I think that a combination of --limit-rate and --wait parameters makes this type of enhancement unnecessary, given that his stated purpose was to not "hammer" a particular site. Mark Post -Original Message- From: Mauro Tortonesi [mailto:[EMAIL PROTECTED] Sent: Wednesday, November 30, 20

Re: Limit time to run

2005-11-30 Thread Frank McCown

From what I understand, killing wget processes may result in resource leaks. If wget could self-terminate, it could do so more gracefully and note the termination in its log file. Plus writing scripts to launch and kill processes is certainly not trivial. Frank Mauro Tortonesi wrote: Fra

Re: Limit time to run

2005-11-30 Thread Frank McCown

Using these parameters will only slow the process of crawling a site. The purpose is not only to avoid wasting a web server's resources but also to quit when it appears the site is taking too long to download. Since wget doesn't appear to contain any logic for detecting crawler traps, a timed s

wget output question

2005-11-30 Thread Jon Berry

I'm trying to use wget to do the following: 1. retrieve a single page 2. convert the links in the retrieved page to their full, absolute addresses. 3. save the page with a file name that I specify I thought this would do it: wget -k -O test.html http://www.google.com However, it doesn't c

Re: Limit time to run

2005-11-30 Thread Hrvoje Niksic

Frank McCown <[EMAIL PROTECTED]> writes: > From what I understand, killing wget processes may result in resource > leaks. Really? What kind of resource leaks are you referring to? Wget does not create temporary files, nor does it allocate external resources other than dynamically allocated memo

wget with a log database?

2005-11-30 Thread Juhana Sadeharju

Hello. I would like to have a database within wget. The database would let wget know what it has downloaded earlier. Wget could download only new and changed files, and could continue the download without having the old downloadings in my disk. The database would also be accessed by other program

Re: Limit time to run

2005-11-30 Thread Frank McCown

From what I understand, killing wget processes may result in resource leaks. Really? What kind of resource leaks are you referring to? Wget does not create temporary files, nor does it allocate external resources other than dynamically allocated memory and network connections, both of which w

Re: wget output question

2005-11-30 Thread Steven M. Schweda

> 1. retrieve a single page That worked. > 2. convert the links in the retrieved page to their full, absolute > addresses. My "wget -h" output (Wget 1.10.2a1) says: -k, --convert-links make links in downloaded HTML point to local files. Wget 1.9.1e says: -k, --convert-links

Re: wget output question

2005-11-30 Thread Jon Berry

Steven M. Schweda wrote: Not anything about converting relative links to absolute. I don't see an option to do this automatically. From the wget man page for --convert-links: ...if a linked file was downloaded, the link will refer to its local name; if it was not downloaded, the link wi

spaces in pathnames using --directory-prefix=prefix

RE: spaces in pathnames using --directory-prefix=prefix

Re: spaces in pathnames using --directory-prefix=prefix

Re: Limit time to run

RE: Limit time to run

Re: Limit time to run

Re: Limit time to run

wget output question

Re: Limit time to run

wget with a log database?

Re: Limit time to run

Re: wget output question

Re: wget output question

13 matches

Site Navigation

Mail list logo

Footer information