Re: Instead of saving text files i need as html

2006-06-09 Thread Sion Arrowsmith
Tim Chase  [EMAIL PROTECTED] wrote:
 [ ... ]
 urllib.urlretrieve(lines.strip('/n'), 'c:\\temp\\' \
 + outfilename.strip('\n')[7:] + '.html')
 [ ... ]  I'm not sure what the odd 
slicing is for, but I'll presume the OP knows what they're doing.

It's taking the http://; off the front of the URL.

 len(http://;)
7

-- 
\S -- [EMAIL PROTECTED] -- http://www.chaos.org.uk/~sion/
  ___  |  Frankly I have no feelings towards penguins one way or the other
  \X/  |-- Arthur C. Clarke
   her nu becomeþ se bera eadward ofdun hlæddre heafdes bæce bump bump bump
-- 
http://mail.python.org/mailman/listinfo/python-list

Instead of saving text files i need as html

2006-06-08 Thread Shani
I have the following code which takes a list of urls
http://google.com;, without the quotes ofcourse, and then saves there
source code as a text file. I wan to alter the code so that for the
list of URLs an html file is saved.

-begin-
import urllib
urlfile = open(r'c:\temp\url.txt', 'r')
for lines in urlfile:
try:
outfilename = lines.replace('/', '-')
urllib.urlretrieve(lines.strip('/n'), 'c:\\temp\\' \
+ outfilename.strip('\n')[7:] + '.txt')
except:
pass
-end-

-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Instead of saving text files i need as html

2006-06-08 Thread Larry Bates
Then just write HTML around your list.  I would guess
you want them inside a table.  Just write appropriate
HTML tags before/after the urls.  If you want the URLs
to be clickable make them in into a hrefurl/a lines.

-Larry Bates

Shani wrote:
 I have the following code which takes a list of urls
 http://google.com;, without the quotes ofcourse, and then saves there
 source code as a text file. I wan to alter the code so that for the
 list of URLs an html file is saved.
 
 -begin-
 import urllib
 urlfile = open(r'c:\temp\url.txt', 'r')
 for lines in urlfile:
 try:
 outfilename = lines.replace('/', '-')
 urllib.urlretrieve(lines.strip('/n'), 'c:\\temp\\' \
 + outfilename.strip('\n')[7:] + '.txt')
 except:
 pass
 -end-
 
-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Instead of saving text files i need as html

2006-06-08 Thread 3c273
Shani [EMAIL PROTECTED] wrote in message
news:[EMAIL PROTECTED]
 I have the following code which takes a list of urls
 http://google.com;, without the quotes ofcourse, and then saves there
 source code as a text file. I wan to alter the code so that for the
 list of URLs an html file is saved.

 -begin-
 import urllib
 urlfile = open(r'c:\temp\url.txt', 'r')
 for lines in urlfile:
 try:
 outfilename = lines.replace('/', '-')
 urllib.urlretrieve(lines.strip('/n'), 'c:\\temp\\' \
 + outfilename.strip('\n')[7:] + '.txt')
 except:
 pass
 -end-


Is this what you mean?

-begin-
import urllib
urlfile = open(r'c:\temp\url.txt', 'r')
for lines in urlfile:
try:
outfilename = lines.replace('/', '-')
urllib.urlretrieve(lines.strip('/n'), 'c:\\temp\\' \
+ outfilename.strip('\n')[7:] + '.html')
except:
pass
-end-
Louis


-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Instead of saving text files i need as html

2006-06-08 Thread 3c273
Shani [EMAIL PROTECTED] wrote in message
news:[EMAIL PROTECTED]
 I have the following code which takes a list of urls
 http://google.com;, without the quotes ofcourse, and then saves there
 source code as a text file. I wan to alter the code so that for the
 list of URLs an html file is saved.

 -begin-
 import urllib
 urlfile = open(r'c:\temp\url.txt', 'r')
 for lines in urlfile:
 try:
 outfilename = lines.replace('/', '-')
 urllib.urlretrieve(lines.strip('/n'), 'c:\\temp\\' \
 + outfilename.strip('\n')[7:] + '.txt')
 except:
 pass
 -end-

Or is this what you mean?
-begin-
import urllib
urlfile = open('c:\\temp\\url.txt', 'r')
newurlfile = open('c:\\temp\\newurls.html', 'w')
newurlfile.write('html \nbody\n')
for lines in urlfile:
try:
if lines == '\n':
pass
else:
lines = 'a href=' + lines.strip() +''\
 + lines.strip() + '/a' + 'br\n'
newurlfile.write(lines)
except:
pass
newurlfile.write('/body \n/html')
urlfile.close()
newurlfile.close()
-end-
Louis


-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Instead of saving text files i need as html

2006-06-08 Thread 3c273
3c273 [EMAIL PROTECTED] wrote in message
news:[EMAIL PROTECTED]
 Or is this what you mean?
 -begin-
 import urllib
 urlfile = open('c:\\temp\\url.txt', 'r')
 newurlfile = open('c:\\temp\\newurls.html', 'w')
 newurlfile.write('html \nbody\n')
 for lines in urlfile:
 try:
 if lines == '\n':
 pass
 else:
 lines = 'a href=' + lines.strip() +''\
  + lines.strip() + '/a' + 'br\n'
 newurlfile.write(lines)
 except:
 pass
 newurlfile.write('/body \n/html')
 urlfile.close()
 newurlfile.close()
 -end-
 Louis

Oops, I guess we don't need import urllib anymore.
Louis


-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Instead of saving text files i need as html

2006-06-08 Thread bruno at modulix
Shani wrote:
 I have the following code which takes a list of urls
 http://google.com;, without the quotes ofcourse, and then saves there
 source code as a text file. I wan to alter the code so that for the
 list of URLs an html file is saved.

What you write in a text file is up to you  - and AFAICT, HTML is still
a text format.


-- 
bruno desthuilliers
python -c print '@'.join(['.'.join([w[::-1] for w in p.split('.')]) for
p in '[EMAIL PROTECTED]'.split('@')])
-- 
http://mail.python.org/mailman/listinfo/python-list


Re: Instead of saving text files i need as html

2006-06-08 Thread Tim Chase
 Is this what you mean?
 
 -begin-
 import urllib
 urlfile = open(r'c:\temp\url.txt', 'r')
 for lines in urlfile:
 try:
 outfilename = lines.replace('/', '-')
 urllib.urlretrieve(lines.strip('/n'), 'c:\\temp\\' \
 + outfilename.strip('\n')[7:] + '.html')
 except:
 pass
 -end-

[laughs]  I suspect the urlretrieve line should contain 
strip('\n') instead of strip('/n'), but otherwise, the 
original code looked pretty kosher.  I'm not sure what the odd 
slicing is for, but I'll presume the OP knows what they're doing.

While not a python solution, the standard *nix tool would be 
either wget or curl:

bash wget -i listofurls.txt

which is freely available with the Cygwin suite of GNU tools for 
Win32 platforms.

-tkc



-- 
http://mail.python.org/mailman/listinfo/python-list