#363: Number of results show in XML-ish formats
------------------------+---------------------------------------------------
  Reporter:  tbrooks    |       Owner:     
      Type:  defect     |      Status:  new
  Priority:  major      |   Milestone:     
 Component:  BibFormat  |     Version:     
Resolution:             |    Keywords:     
------------------------+---------------------------------------------------

Comment (by arwagner):

 More than 25 records are no issue in Endnote Tagged (aka RIS) or BibTeX
 format.

 However, to my experience, they produce considerable load upon import from
 an XML format. Main reason is that the XML parsers may perform very badly
 on larger sets. Especially, if those sets are huge. In a local use case it
 proved a bad idea to write out more than 100 records in XML. The required
 memory footprint by the parser may also be quite an issue in this case.
 E.g. in my usages, performance is much better if I do 1000 records on per
 record basis than in one chunk. (Using Perl XML::XPath.) For common papers
 in experimental HEP (large numbers of authors) a lot fewer records seem
 advisable, sometimes already one record is quite a job for the importer.

 Therefore, it seems advisable to not increase the number of records in one
 chunk but offer the user several separate files to download and import. A
 similar approach is implemented in OAI-PMH.

-- 
Ticket URL: <http://invenio-software.org/ticket/363#comment:1>
Invenio <http://invenio-software.org>

Reply via email to