I prefer also option C

Cheers
Tsuy

On Oct 26, 2011, at 5:14 PM, Tommaso Teofili wrote:

> same here; I'd go with C option :)
> Tommaso
> 
> 2011/10/26 Daniel Spicar <dspi...@apache.org>
> 
>> the JIRA issue can be found here:
>> https://issues.apache.org/jira/browse/CLEREZZA-643
>> 
>> On Wed, Oct 26, 2011 at 3:36 PM, Daniel Spicar <dspi...@apache.org> wrote:
>> 
>>> Rupert provided a patch to improve serialization performance (thanks for
>>> the effort!). I reviewed his Patch and have written my comments on the
>> JIRA
>>> page. But I think we need to discuss the issues I raise there. In
>> summary:
>>> 
>>> - neither the patch nor the current implementations work reliably with
>> very
>>> large graphs (larger than memeory)
>>> - the patch is significantly faster than the current implementation
>>> - the current implementation is easier to quick-fix for very large graphs
>>> (but also very slow)
>>> 
>>> There is a sketch of a better solution that should allow us to be faster
>>> and not limited by memory size. It is based on sorted iterators. However
>>> these iterators need to be supplied by the underlying TripleCollections
>> and
>>> that will require more changes to the core of Clerezza.
>>> 
>>> Because both, the current implementation and the patch doe not really
>> work
>>> on "big" TripleCollection (when big means really really big) the question
>> we
>>> should discuss its:
>>> a) keep everything as it is and solve the problem properly (possibly as
>>> described in the issue)
>>> b) quick fix the current implementation (slow performance)  + schedule a
>>> proper solution
>>> c) apply the patch (fast but graphs limited to available memory size) +
>>> schedule a proper solution
>>> 
>>> My favorite is c.
>>> 
>>> What do you think?
>>> 
>> 

--trialox ag-------------------------------------
  tsuyoshi ito
  hardturmstrasse 101 
  8005 zuerich

Reply via email to