[task #6474] Centralizing word extraction

noreply [Samuele Kaplun] Tue, 26 Feb 2008 17:41:48 +0100

This is an automated notification sent by LCG Savannah.
It relates to:
                task #6474, project CDS Invenio


==============================================================================
 LATEST MODIFICATIONS of task #6474:
==============================================================================

Update of task #6474 (project cdsware):

                Priority:              5 - Normal => 3 - Low                
             Assigned to:                    None => skaplun                


==============================================================================
 OVERVIEW of task #6474:
==============================================================================

URL:
  <http://savannah.cern.ch/task/?6474>

                 Summary: Centralizing word extraction
                 Project: CDS Invenio
            Submitted by: skaplun
            Submitted on: 2008-02-25 08:31
         Should Start On: 2008-02-25 00:00
   Should be Finished on: 2008-02-25 00:00
                Category: None
                Priority: 3 - Low
                  Status: None
                 Privacy: Public
        Percent Complete: 0%
             Assigned to: skaplun
             Open/Closed: Open
         Discussion Lock: Any
                  Effort: 0.00

    _______________________________________________________


BibIndex, BibClassify, RefExtract, BibRank with word ranking, all need to
convert a document into a stream of word via pdf tools. It worth do this only
once and cache the extracted document in a zipped way just next to the
different revision of the document.
some_document.pdf;1
some_document.ps.gz;1
.text_in_some_document;1

A centralized api for this would be needed.



    _______________________________________________________

Carbon-Copy List:

CC Address                          | Comment
------------------------------------+-----------------------------
2195                                | -SUB-




==============================================================================

This item URL is:
  <http://savannah.cern.ch/task/?6474>

_______________________________________________
  Message sent via/by LCG Savannah
  http://savannah.cern.ch/

[task #6474] Centralizing word extraction

Reply via email to