Bug#309264: O: unhtml -- Remove the markup tags from an HTML file
retitle 309264 ITA: unhtml -- Remove the markup tags from an HTML file owner 309264 ! thanks I intend to adopt unhtml. Kind regards, Philipp Kern Debian Developer -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#309264: O: unhtml -- Remove the markup tags from an HTML file
On Fri, May 20, 2005 at 12:29:11PM +0200, Philipp Kern wrote: retitle 309264 ITA: unhtml -- Remove the markup tags from an HTML file owner 309264 ! thanks I intend to adopt unhtml. I'm wondering, is unhtml really such a useful package to have in the archive? The package description really sounds like it's a perl-oneliner, indeed, the .deb's are 11kB in size. What is the added value of this package to Debian? Also, why can't this script/program be included in some other package that does html-processing? Or, what about lynx -dump -stdin with some extra options to drop the footnotes on links etc? It'll also reformat for certain textwidths etc, making it IMHO much more useful. --Jeroen -- Jeroen van Wolffelaar [EMAIL PROTECTED] http://jeroen.A-Eskwadraat.nl -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#309264: O: unhtml -- Remove the markup tags from an HTML file
On 20.05.2005, at 12:49, Jeroen van Wolffelaar wrote: What is the added value of this package to Debian? Also, why can't this script/program be included in some other package that does html-processing? Or, what about lynx -dump -stdin with some extra options to drop the footnotes on links etc? It'll also reformat for certain textwidths etc, making it IMHO much more useful. At least it strips only HTML tags, not all XML tags it encounters in a stream. And it does not strip the contents within script / tags. If you count this as an additional value. This program seems to be very lightweight, without a interpreter overhead. If you only need the HTML tags stripped without a pretty formatting like the output of a Lynx dump this would be for you. Kind regards, Philipp Kern -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]
Bug#309264: O: unhtml -- Remove the markup tags from an HTML file
Package: wnpp Severity: normal I intend to orphan the unhtml package. It's been over a year since I used the package, and have no further time for it nor interest in it. The package description is: This program removes all HTML tags from an HTML file and directs its output to stdout. It can be used as a filter for getting the text content of an HTML file without the need of firing up a web browser. -- System Information: Debian Release: 3.1 APT prefers unstable APT policy: (500, 'unstable'), (500, 'testing') Architecture: i386 (i686) Shell: /bin/sh linked to /bin/bash Kernel: Linux 2.6.8-2-686 Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968) -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]