Islon Scherer wrote:
> Hi, I'm using wget to recursively download content from a bunch os sites.
> The command line is "wget -x -r -l1 [url]"
> I have a problem with one url:
> http://olhardigital.uol.com.br/ultimas_noticias/1
> If I execute wget with my parameters in this url it gives me lots of 
> "No such file or directory"
> for every file inside the
> 'olhardigital.uol.com.br/produtos/digital_news/' directory
> because 'olhardigital.uol.com.br/produtos/digital_news' is a file too
> (html) saved by wget
> previously so it can't create the 'digital_news' directory in the file
> system.
> I can't remove the directory sctructure (-x option) because I have to
> know the url of the downloaded
> files for further processing.
> Is there a way to circunvent the file/dir with the same name problem?
> Or a way to
> retrieve the original url of the file without using the directory
> structure?
>
> Reproduce the problem executing: wget -x -r -l1
> http://olhardigital.uol.com.br/ultimas_noticias/1
>
> Regards.
> Islon Scherer

Adding the -E option seems to skip the problem.


Reply via email to