CVS Commit Overview for 2007-03-29
==================================

2007-03-29  Samuele Kaplun <[email protected]>

        * modules/webstyle/lib/webinterface_tests.py: Added many more
        attribute to FakeApache in order to satisfy mod_python 3.3.

2007-03-29  Jerome Caffaro <[email protected]>

        * modules/bibformat/lib/bibformat_config.py,
        modules/bibformat/lib/bibformat_engine.py: Improved error messages.

2007-03-29  Samuele Kaplun <[email protected]>

        * modules/websubmit/lib/file.py: Cleaned indentation and added a
        function for adjusting uploaded filenames in order to have them
        unique.

2007-03-29  Samuele Kaplun <[email protected]>

        * modules/websubmit/lib/websubmit_engine.py: Added two checks in
        order to make websumbit happy with latest mod_python.

2007-03-29  Nicholas Robinson <[email protected]>

        * modules/bibedit/lib/refextract.py: + Fixed a bug when looking for
        the end of the references section. Previously, there was a regexp
        searching for a certain pattern of digits (something that occurs
        sometimes when the figures/tables are converted to text).
        Unfotunately, in certain cases, the regexp search was horribly long
        - infinite, maybe! It was fixed by removing the regexp pattern and
        using some methods of the string object, such as replace, is_digit,
        etc. Additionally, this also seems to have improved the recognition
        of the end of a reference section.  + To identify different pages
        of a PDF document, refextract was looking for  a page-break
        character (\\f) on its own line because pdftotext always put this
        character in a line of its own. However, in a new version of this
        tool (3.01 onwards?), this character is not necessarily in its own
        line. This caused some problems when searching for headers/footers,
        etc. Therefore, when text from pdftotext is read-in by refextract,
        it now adds this char into its own line, should it come at the
        start of a line; + Added a new numeration-recognition pattern (and
        subsequent handling code) that is used when transforming a tagged
        citation line into MARC XML. The new pattern looks for tagged
        numeration, and is applied immediately after a title + numeration
        pattern has been applied. This handles IBIDs that do not actually
        use the word "IBID". E.g.: <cds.TITLE>J. Phys. A</cds.TITLE> :
        <cds.VOL>31</cds.VOL> <cds.YR>(1998) </cds.YR>
        <cds.PG>2391</cds.PG>; : <cds.VOL>32</cds.VOL> <cds.YR>(1999)
        </cds.YR> <cds.PG>6119</cds.PG>.  The 2nd group of numeration
        clearly belongs with the title - the author has simply missed out
        the title. Previously reference this would have been missed.  Now
        however, it will be recognised;

-- 
CDS Invenio Developers <[email protected]>


Reply via email to