This is an automated notification sent by LCG Savannah.
It relates to:
task #6474, project CDS Invenio
==============================================================================
LATEST MODIFICATIONS of task #6474:
==============================================================================
Update of task #6474 (project cdsware):
Priority: 5 - Normal => 3 - Low
Assigned to: None => skaplun
==============================================================================
OVERVIEW of task #6474:
==============================================================================
URL:
<http://savannah.cern.ch/task/?6474>
Summary: Centralizing word extraction
Project: CDS Invenio
Submitted by: skaplun
Submitted on: 2008-02-25 08:31
Should Start On: 2008-02-25 00:00
Should be Finished on: 2008-02-25 00:00
Category: None
Priority: 3 - Low
Status: None
Privacy: Public
Percent Complete: 0%
Assigned to: skaplun
Open/Closed: Open
Discussion Lock: Any
Effort: 0.00
_______________________________________________________
BibIndex, BibClassify, RefExtract, BibRank with word ranking, all need to
convert a document into a stream of word via pdf tools. It worth do this only
once and cache the extracted document in a zipped way just next to the
different revision of the document.
some_document.pdf;1
some_document.ps.gz;1
.text_in_some_document;1
A centralized api for this would be needed.
_______________________________________________________
Carbon-Copy List:
CC Address | Comment
------------------------------------+-----------------------------
2195 | -SUB-
==============================================================================
This item URL is:
<http://savannah.cern.ch/task/?6474>
_______________________________________________
Message sent via/by LCG Savannah
http://savannah.cern.ch/