[Bug-wget] [bug #49458] Please search txt files named ".ram" for links

2016-11-23 Thread Tim Ruehsen
Update of bug #49458 (project wget):

  Status:None => Wont Fix   
 Open/Closed:Open => Closed 


___

Reply to this item at:

  

___
  Message sent via/by Savannah
  http://savannah.gnu.org/




[Bug-wget] [bug #49458] Please search txt files named ".ram" for links

2016-10-28 Thread Dale Worley
Follow-up Comment #4, bug #49458 (project wget):

Really, this opens up the can of worms of making which files are scanned for
URLs configurable.

Ideally, the user could supply a list of extensions/MIME types and provide
corresponding programs that would extract the URLs from the files.

But you would also want to implement --convert-links for these files, and that
would be considerably more complicated.


___

Reply to this item at:

  

___
  Message sent via/by Savannah
  http://savannah.gnu.org/




[Bug-wget] [bug #49458] Please search txt files named ".ram" for links

2016-10-28 Thread Tim Ruehsen
Update of bug #49458 (project wget):

 Release:  1.17.1 => 1.18   
Operating System:   Microsoft Windows => None   

___

Follow-up Comment #3:

This URL(s) within the .ram/.ra file likely points to some audio stream - it
might be a radio station with changing (and endless !) content.
'Downloading' the contents could easily lead to a 'hanging' wget. That is
perhaps not what you want.
--spider could at least 'check' that URL.
And the ICY-Metadata HTTP header makes a difference since the remote side
likely is a shoutcast server.

BTW, there are lot's of other file formats that contain URLs.
E.g. Wget2 reads URLs from sitemaps and sitemap catalogs (compressed or
uncompressed), from Atom and RSS feeds.
There is also examples/getstream.c that 'downloads' URLs (streams) from .ram,
.m3u, .wax, .asx, .pls, .xspf (audio/x-mpegurl, audio/x-pn-realaudio,
application/pls+xml, audio/x-scpls, application/xspf+xml).


___

Reply to this item at:

  

___
  Message sent via/by Savannah
  http://savannah.gnu.org/




[Bug-wget] [bug #49458] Please search txt files named ".ram" for links

2016-10-27 Thread Ander Juaristi
Follow-up Comment #1, bug #49458 (project wget):

I don't exactly know what you're asking, but I don't think it would be a good
fit for wget given that you could easily automate it with a shell script.

The closest enhancement I can come across is to make -i accept multiple files,
which, to the best of my knowledge, does not yet.

E.g. wget -i file1.ram,file2.ram,foo.txt,bar.html

What do you all think?

___

Reply to this item at:

  

___
  Message sent via/by Savannah
  http://savannah.gnu.org/




[Bug-wget] [bug #49458] Please search txt files named ".ram" for links

2016-10-27 Thread Patrick Alan Marshall
Follow-up Comment #2, bug #49458 (project wget):

Sorry, I didn't really word it that well. The idea was for it to scan .ram
text files for resources when mirroring a site using the -m flag, as right now
I have to make two scans when mirroring a site that uses .ram assets to a
WARC.

___

Reply to this item at:

  

___
  Message sent via/by Savannah
  http://savannah.gnu.org/




[Bug-wget] [bug #49458] Please search txt files named ".ram" for links

2016-10-27 Thread anonymous
URL:
  

 Summary: Please search txt files named ".ram" for links
 Project: GNU Wget
Submitted by: None
Submitted on: Thu 27 Oct 2016 04:11:13 PM UTC
Category: Feature Request
Severity: 3 - Normal
Priority: 5 - Normal
  Status: None
 Privacy: Public
 Assigned to: None
 Originator Name: powerKitten
Originator Email: wertc...@gmail.com
 Open/Closed: Open
 Discussion Lock: Any
 Release: 1.17.1
Operating System: Microsoft Windows
 Reproducibility: Every Time
   Fixed Release: None
 Planned Release: None
  Regression: No
   Work Required: None
  Patch Included: No

___

Details:

The .ram "format" is simply an ANSI encoded text file. It consists of links to
RealAudio resources. I think that it would benefit from this feature, as
currently I have to go into each .ram file, copy the links into a list of
urls, and then fetch them using the -i flag. I've attached a sample .ram file
which can also be found at the (archived) address:
http://web.archive.org/web/20161027160920/http://www.theorderoftime.com/art/music/ra-popular/theimportance.ram
or the original address:
http://www.theorderoftime.com/art/music/ra-popular/theimportance.ram



___

File Attachments:


---
Date: Thu 27 Oct 2016 04:11:13 PM UTC  Name: theimportance.ram  Size: 64B  
By: None



___

Reply to this item at:

  

___
  Message sent via/by Savannah
  http://savannah.gnu.org/