On 27 Jan 2010, at 23:32, Mark Diggory wrote:

> On Wed, Jan 27, 2010 at 10:16 AM, Simon Brown <st...@cam.ac.uk> wrote:
>
>> I confess that the reason why we are having this discussion at all
>> eludes me. It seems like a fairly obvious bug for the importer to
>> prune the indexes so many times (the comment for pruneIndexes() even
>> says "called from the public interfaces or at the end of a batch
>> indexing process"), it has a demonstrably detrimental effect on the
>> performance of the software, and the fix for it is not particularly
>> complicated. Is there something I've missed?
>>
>> --
>> Simon Brown <st...@cam.ac.uk> - Cambridge University Computing  
>> Service
>> +44 1223 3 34714 - New Museums Site, Pembroke Street, Cambridge CB2  
>> 3QH
>
> We discuss it because we seek to maintain an appropriate separation of
> concerns in our architecture. And because Graham usually challenges us
> to look at aspects of that architecture that are important.  What is
> under discussion is not that performance can't be improved by your
> patch, you've identified a very important issue in batch processing.
> We are discussing architecturally if we want to alter the
> Context/EventManager framework and expose calls to pruneIndex.  We
> want to be careful to avoid exposing too much of the internals of the
> Browse system outside in the application architecture.

That's fine; as I said in my initial submission, I know that my patch  
isn't the best way of writing that code. I should have made  
pruneIndexes() package private rather than public, for one thing. It  
was done to get the patch out and under discussion quickly and in a  
way which would hopefully minimise the effect on the rest of the code  
base.

Having dug through the code a little more in the meantime, it seems  
that the effect of pruneIndexes() is to remove from the browse indexes  
information about items which are expunged and/or withdrawn; in that  
light it might not be necessary to call it when items are added or  
changed at all, thus reducing the patch to a single-line change. If  
that's the case I'll happily withdraw mine in favour of the new one. :)

> Excellent work on finding a means to improve DSpace performance.

Thank you.

--
Simon Brown <st...@cam.ac.uk> - Cambridge University Computing Service
+44 1223 3 34714 - New Museums Site, Pembroke Street, Cambridge CB2 3QH



------------------------------------------------------------------------------
The Planet: dedicated and managed hosting, cloud storage, colocation
Stay online with enterprise data centers and the best network in the business
Choose flexible plans and management services without long-term contracts
Personal 24x7 support from experience hosting pros just a phone call away.
http://p.sf.net/sfu/theplanet-com
_______________________________________________
Dspace-devel mailing list
Dspace-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-devel
  • [Dspac... Simon Brown (JIRA)
    • [... Richard Rodgers (JIRA)
      • ... Thornton, Susan M. (LARC-B702)[RAYTHEON TECHNICAL SERVICES COMPANY]
        • ... Richard Rodgers
          • ... Tom De Mulder
            • ... Graham Triggs
              • ... Simon Brown
                • ... Mark Diggory
                • ... Simon Brown
                • ... Graham Triggs
                • ... Graham Triggs
                • ... Richard, Joel M
                • ... Graham Triggs
    • [... Simon Brown (JIRA)
    • [... Simon Brown (JIRA)
    • [... Graham Triggs (JIRA)
    • [... Mark Diggory (JIRA)
    • [... Simon Brown (JIRA)
    • [... Graham Triggs (JIRA)

Reply via email to