Bug#309264: O: unhtml -- Remove the markup tags from an HTML file

2005-05-20 Thread Philipp Kern
retitle 309264 ITA: unhtml -- Remove the markup tags from an HTML file
owner 309264 !
thanks
I intend to adopt unhtml.
Kind regards,
Philipp Kern
Debian Developer
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]


Bug#309264: O: unhtml -- Remove the markup tags from an HTML file

2005-05-20 Thread Jeroen van Wolffelaar
On Fri, May 20, 2005 at 12:29:11PM +0200, Philipp Kern wrote:
 retitle 309264 ITA: unhtml -- Remove the markup tags from an HTML file
 owner 309264 !
 thanks
 
 I intend to adopt unhtml.

I'm wondering, is unhtml really such a useful package to have in the
archive? The package description really sounds like it's a
perl-oneliner, indeed, the .deb's are 11kB in size.

What is the added value of this package to Debian? Also, why can't this
script/program be included in some other package that does
html-processing? Or, what about lynx -dump -stdin with some extra
options to drop the footnotes on links etc? It'll also reformat for
certain textwidths etc, making it IMHO much more useful.

--Jeroen

-- 
Jeroen van Wolffelaar
[EMAIL PROTECTED]
http://jeroen.A-Eskwadraat.nl


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#309264: O: unhtml -- Remove the markup tags from an HTML file

2005-05-20 Thread Philipp Kern
On 20.05.2005, at 12:49, Jeroen van Wolffelaar wrote:
What is the added value of this package to Debian? Also, why can't  
this
script/program be included in some other package that does
html-processing? Or, what about lynx -dump -stdin with some extra
options to drop the footnotes on links etc? It'll also reformat for
certain textwidths etc, making it IMHO much more useful.
At least it strips only HTML tags, not all XML tags it encounters in  
a stream. And it does not strip the contents within script / tags.  
If you count this as an additional value. This program seems to be  
very lightweight, without a interpreter overhead. If you only need  
the HTML tags stripped without a pretty formatting like the output of  
a Lynx dump this would be for you.

Kind regards,
Philipp Kern
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]


Bug#309264: O: unhtml -- Remove the markup tags from an HTML file

2005-05-15 Thread Al Stone
Package: wnpp
Severity: normal

I intend to orphan the unhtml package.  It's been over a year since
I used the package, and have no further time for it nor interest in
it.

The package description is:
 This program removes all HTML tags from an HTML file and directs its
 output to stdout.  It can be used as a filter for getting the text
 content of an HTML file without the need of firing up a web browser.

-- System Information:
Debian Release: 3.1
  APT prefers unstable
  APT policy: (500, 'unstable'), (500, 'testing')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.6.8-2-686
Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]