Your message dated Sun, 23 May 2010 15:47:08 +0000
with message-id <[email protected]>
and subject line Bug#582199: fixed in html2text 1.3.2a-15
has caused the Debian Bug report #582199,
regarding attribute names with numerics cause html2text to output remaining 
attributes
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)


-- 
582199: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=582199
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: html2text
Version: 1.3.2a-14
Severity: normal
Tags: upstream patch

If the <html> element has an attribute name that contains a numeric,
html2text stops processing the element and the remaining attributes show
up in the output (debug shows that it treats the remainder as PCDATA).

Example input misparsed at the x2:
<html xmlns:v="urn:schemas-microsoft-com:vml"
xmlns:o="urn:schemas-microsoft-com:office:office"
xmlns:w="urn:schemas-microsoft-com:office:word"
xmlns:x="urn:schemas-microsoft-com:office:excel"
xmlns:p="urn:schemas-microsoft-com:office:powerpoint"
xmlns:a="urn:schemas-microsoft-com:office:access"
xmlns:mt="http://schemas.microsoft.com/sharepoint/soap/meetings/";
xmlns:x2="http://schemas.microsoft.com/office/excel/2003/xml";
xmlns:ppda="http://www.passport.com/NameSpace.xsd"; 
><body>test</body></html>

Output:
="http://schemas.microsoft.com/office/excel/2003/xml"; xmlns:ppda="http://
www.passport.com/NameSpace.xsd" >

test


Patch:
--- HTMLControl.C.orig  2003-11-23 04:05:29.000000000 -0700
+++ HTMLControl.C       2010-05-18 19:33:54.000000000 -0600
@@ -372,7 +372,7 @@
             attribute.first = c;
             for (;;) {
               c = get_char();
-              if (!isalpha(c) && c != '-' && c != '_' && c != ':') break;
+              if (!isalnum(c) && c != '-' && c != '_' && c != ':') break;
              // Same as in line 352 - Arno
               attribute.first += c;
             }


-- System Information:
Debian Release: squeeze/sid
  APT prefers oldstable
  APT policy: (500, 'oldstable'), (500, 'testing'), (500, 'stable')
Architecture: i386 (i686)

Kernel: Linux 2.6.30-2-686 (SMP w/4 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages html2text depends on:
ii  libc6                         2.10.2-6   Embedded GNU C Library: Shared lib
ii  libgcc1                       1:4.4.2-9  GCC support library
ii  libstdc++6                    4.4.2-9    The GNU Standard C++ Library v3

html2text recommends no packages.

Versions of packages html2text suggests:
ii  curl                          7.20.0-2   Get a file from an HTTP, HTTPS or 
ii  wget                          1.12-1.1   retrieves files from the web

-- no debconf information
--- HTMLControl.C.orig	2003-11-23 04:05:29.000000000 -0700
+++ HTMLControl.C	2010-05-18 19:33:54.000000000 -0600
@@ -372,7 +372,7 @@
             attribute.first = c;
             for (;;) {
               c = get_char();
-              if (!isalpha(c) && c != '-' && c != '_' && c != ':') break;
+              if (!isalnum(c) && c != '-' && c != '_' && c != ':') break;
 	      // Same as in line 352 - Arno
               attribute.first += c;
             }

--- End Message ---
--- Begin Message ---
Source: html2text
Source-Version: 1.3.2a-15

We believe that the bug you reported is fixed in the latest version of
html2text, which is due to be installed in the Debian FTP archive:

html2text_1.3.2a-15.diff.gz
  to main/h/html2text/html2text_1.3.2a-15.diff.gz
html2text_1.3.2a-15.dsc
  to main/h/html2text/html2text_1.3.2a-15.dsc
html2text_1.3.2a-15_i386.deb
  to main/h/html2text/html2text_1.3.2a-15_i386.deb



A summary of the changes between this version and the previous one is
attached.

Thank you for reporting the bug, which will now be closed.  If you
have further comments please address them to [email protected],
and the maintainer will reopen the bug report if appropriate.

Debian distribution maintenance software
pp.
Eugene V. Lyubimkin <[email protected]> (supplier of updated html2text package)

(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing [email protected])


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Format: 1.8
Date: Sun, 23 May 2010 18:27:27 +0300
Source: html2text
Binary: html2text
Architecture: source i386
Version: 1.3.2a-15
Distribution: unstable
Urgency: low
Maintainer: Eugene V. Lyubimkin <[email protected]>
Changed-By: Eugene V. Lyubimkin <[email protected]>
Description: 
 html2text  - advanced HTML to text converter
Closes: 576033 582199
Changes: 
 html2text (1.3.2a-15) unstable; urgency=low
 .
   * debian/copyright:
     - Applied formatting improvements patch from Jari Aalto. (Closes: #576033)
   * debian/control:
     - Bumped 'Standards-Version' to 3.8.4, no changes needed.
     - Updated my mail address.
     - Added '{misc:Depends}' to dependencies.
   * debian/patches:
     - 510-disable-backspaces.patch: refreshed.
     - 550-skip-numbers-in-html-tag-attributes.patch: new patch by Eric Krohn.
       (Closes: #582199)
   * debian/README.source:
     - Added with quilt usage as content.
   * debian/tests:
     - Added a test for new patch.
     - Echo test case names while running.
Checksums-Sha1: 
 e1c0e90fe1eb8078b1ce2a169d67f90605d74c98 1191 html2text_1.3.2a-15.dsc
 15ba3a1859371b94ac5657238c876b553ca2eb5b 67783 html2text_1.3.2a-15.diff.gz
 4d6bfcfb2a1f95d219f748e9f1594b07fff9842c 100934 html2text_1.3.2a-15_i386.deb
Checksums-Sha256: 
 ff6968d5a55bd91f300d4c43cddab5f8bf9d5dbb926b82a4fe883eac54c252ac 1191 
html2text_1.3.2a-15.dsc
 05ec73b19714b05ec87ca298ad7cf900e3e04563facaa09b899d9faf9127d5f5 67783 
html2text_1.3.2a-15.diff.gz
 02b3313a8ae2efea7dc12e2469e3cf170eb2a30d18aaf3e9d7524ae09f164323 100934 
html2text_1.3.2a-15_i386.deb
Files: 
 b45ca91ef81863121afe33f836e13ea1 1191 web optional html2text_1.3.2a-15.dsc
 72b603e4cc264aaac53267d037dab7ef 67783 web optional html2text_1.3.2a-15.diff.gz
 c2c26fe70d8e18a94207d1017f6f0720 100934 web optional 
html2text_1.3.2a-15_i386.deb

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)

iEYEARECAAYFAkv5SfkACgkQchorMMFUmYzZegCeIGx448nC1FpLe5KiY3gyV6MI
cLEAn3KrvBcSLVlMzhc13KepCXyYNzjs
=W++j
-----END PGP SIGNATURE-----



--- End Message ---

Reply via email to