https://bugzilla.redhat.com/show_bug.cgi?id=2244406

            Bug ID: 2244406
           Summary: Review Request: python-RTFDE - A library for
                    extracting HTML content from RTF encapsulated HTML
           Product: Fedora
           Version: rawhide
          Hardware: All
                OS: Linux
            Status: NEW
         Component: Package Review
          Severity: medium
          Priority: medium
          Assignee: [email protected]
          Reporter: [email protected]
        QA Contact: [email protected]
                CC: [email protected]
  Target Milestone: ---
    Classification: Fedora



Spec URL:
https://download.copr.fedorainfracloud.org/results/gui1ty/extract-msg/fedora-rawhide-x86_64/06529620-python-RTFDE/python-RTFDE.spec
SRPM URL:
https://download.copr.fedorainfracloud.org/results/gui1ty/extract-msg/fedora-rawhide-x86_64/06529620-python-RTFDE/python-RTFDE-0.1.0-1.20231015git66780b8.fc40.src.rpm

Description:
RTFDE: RTF De-Encapsulator

A python3 library for extracting encapsulated HTML & plain text content
from the RTF bodies of .msg files.

De-encapsulation enables previously encapsulated HTML and plain text
content to be extracted and rendered as HTML and plain text instead of
the encapsulating RTF content. After de-encapsulation, the HTML and
plain text should differ only minimally from the original HTML or plain
text content.

Features

 - De-encapsulate HTML from RTF encapsulated HTML
 - De-encapsulate plain text from RTF encapsulated text

Known Issues

 - This library _fully_ unquotes text it de-encapsulates because it does
 not know which text was quoted in the RTF conversion process and which
 text was quoted in the original html/text. So, for instance escaped
 Quoted-Printable text will be returned un-escaped.
 - This library currently can't combine attachments from a .MSG Message
 object with the de-encapsulated HTML. This is mostly because I could
 not get a good set of examples of encapsulated HTML which had
 attachment objects that needed to be integrated back into the body of
 the HTML.

Anti-Features (I don't intend to have this library do this.)

 - Extract plain text from RTF encapsulated HTML. If you want this,
 then you will have to parse the HTML using another library.

Fedora Account System Username: gui1ty

Copr Build:
https://copr.fedorainfracloud.org/coprs/gui1ty/extract-msg/build/6529620/


-- 
You are receiving this mail because:
You are on the CC list for the bug.
You are always notified about changes to this product and component
https://bugzilla.redhat.com/show_bug.cgi?id=2244406

Report this comment as SPAM: 
https://bugzilla.redhat.com/enter_bug.cgi?product=Bugzilla&format=report-spam&short_desc=Report%20of%20Bug%202244406%23c0
_______________________________________________
package-review mailing list -- [email protected]
To unsubscribe send an email to [email protected]
Fedora Code of Conduct: 
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/[email protected]
Do not reply to spam, report it: 
https://pagure.io/fedora-infrastructure/new_issue

Reply via email to