Hi all,

I am currently developing some Collections Model which uses Fedora and I am 
facing some design issues. I would like to ask for advice regarding the 
advantages from the point of view of impacts in searching and indexing of two 
different designs I have in mind. Either approaches I will describe are not 
complex in terms of collection structure and object types. On the other hand, I 
am using both Mulgara RI and Solr engine combined with Fedora generic search 
for automatic index updates.

The first approach includes the following layout:

- Collection level object, which is the container of the collection and only 
holds metadata datastreams and one content datastream
- Document objects, which are related with the collection container and could 
have potentially a large number of internal datastreams (inline xml datastream 
types) and make use of RELS-EXT to link them with the collection container 
object and RELS-INT datastream to specify the different relationships within 
the contained inline datastreams.

This approach, from the point of view of indexing the datastreams in the search 
engine, presents a very simple structure since everything is included within 
one object but on the other hand, presents performance issues since the number 
of inline datastreams could potentially be very high (each one is very small 
but the number of them could be very high).

The second approach involves having multiple types of objects:
- Collection level object (the same as in the first approach)
- Document object, which now only contains metadata, RELS-EXT and one managed 
datastream
- Data objects, which only contain metadata datastream, inline xml datastream 
and RELS-EXT to relate them with the document objects and between them. These 
objects are the equivalent of the inline-xml datastreams that were included 
within the document objects in the first approach.

This solution does not present complexity or impact in searchs when using the 
RI and Mulgara triplestore as a search interface but seems to be more complex 
if using Solr Search engine.

Has anyone dealt with similar design issues? I would really appreciate any 
advice on this.

Thanks in advance,
Agustina

------------------------------------------------------------------------------
WhatsUp Gold - Download Free Network Management Software
The most intuitive, comprehensive, and cost-effective network 
management toolset available today.  Delivers lowest initial 
acquisition cost and overall TCO of any competing solution.
http://p.sf.net/sfu/whatsupgold-sd
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Reply via email to