#363: Number of results show in XML-ish formats
------------------------+---------------------------------------------------
Reporter: tbrooks | Owner:
Type: defect | Status: new
Priority: major | Milestone:
Component: BibFormat | Version:
Resolution: | Keywords:
------------------------+---------------------------------------------------
Comment (by arwagner):
More than 25 records are no issue in Endnote Tagged (aka RIS) or BibTeX
format.
However, to my experience, they produce considerable load upon import from
an XML format. Main reason is that the XML parsers may perform very badly
on larger sets. Especially, if those sets are huge. In a local use case it
proved a bad idea to write out more than 100 records in XML. The required
memory footprint by the parser may also be quite an issue in this case.
E.g. in my usages, performance is much better if I do 1000 records on per
record basis than in one chunk. (Using Perl XML::XPath.) For common papers
in experimental HEP (large numbers of authors) a lot fewer records seem
advisable, sometimes already one record is quite a job for the importer.
Therefore, it seems advisable to not increase the number of records in one
chunk but offer the user several separate files to download and import. A
similar approach is implemented in OAI-PMH.
--
Ticket URL: <http://invenio-software.org/ticket/363#comment:1>
Invenio <http://invenio-software.org>