On 17.12.22 20:32, Paolo Dista via Primary discussion list for GNU Wget wrote:
Hello,

I am trying to download all the radio programmes from this page: 
https://www.radiofrance.fr/personnes/gilles-deleuze with:

wget -r -l 1 -H -nd -np -A '*.mp3' -D media.radiofrance-podcast.net 
https://www.radiofrance.fr/personnes/gilles-deleuze

In vain.

The terminal prints this:

--2022-12-17 20:06:36-- https://www.radiofrance.fr/personnes/gilles-deleuze
Resolving www.radiofrance.fr (www.radiofrance.fr)... 23.210.120.113, 
2a02:26f0:300:192::3658, 2a02:26f0:300:1a7::3658
Connecting to www.radiofrance.fr (www.radiofrance.fr)|23.210.120.113|:443... 
connected.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [text/html]
Saving to: ‘gilles-deleuze.tmp’

gilles-deleuze.tmp [ <=> ] 157.09K --.-KB/s in 0.1s

2022-12-17 20:06:36 (1.21 MB/s) - ‘gilles-deleuze.tmp’ saved [160862]

Removing gilles-deleuze.tmp since it should be rejected.

FINISHED --2022-12-17 20:06:36--
Total wall clock time: 0.3s
Downloaded: 1 files, 157K in 0.1s (1.21 MB/s)

Could you please help me see what I am missing to?

As random.coder wrote, the URLs are part of Javascript.
But it's possible to grep the URLs from there.
You can do this in several (very hacky) steps:

wget https://www.radiofrance.fr/personnes/gilles-deleuze

cat gilles-deleuze | sed 's/http/\nhttp/g' | grep ^http | cut -d'\' -f1 | grep mp3 >mp3.list

wget -i mp3.list

Regards, Tim

Attachment: OpenPGP_signature
Description: OpenPGP digital signature

Reply via email to