#317: bibdocfile: replace ligatures when textifying
------------------------+------------------------
Reporter: simko | Owner: jani
Type: enhancement | Status: new
Priority: major | Component: WebSubmit
Version: | Resolution:
Keywords: |
------------------------+------------------------
Comment (by skaplun):
In refextract this code exist to clean and normalize text extracted from
PDF. In particular see:
http://invenio-
software.org/repo/invenio/tree/modules/bibedit/lib/refextract.py#n512
which is used in:
http://invenio-
software.org/repo/invenio/tree/modules/bibedit/lib/refextract.py#n1685
several others filtering are available which could be centralized in the
textification procedure for the benefit of the whole stack.
It might be that some other [http://invenio-software.org/repo/personal
/invenio-chayward/refs/heads Christopher's branch] contains refactored
code that can be directly used for this purpose.
--
Ticket URL: <http://invenio-software.org/ticket/317#comment:6>
Invenio <http://invenio-software.org>