Hi all, I am currently developing some Collections Model which uses Fedora and I am facing some design issues. I would like to ask for advice regarding the advantages from the point of view of impacts in searching and indexing of two different designs I have in mind. Either approaches I will describe are not complex in terms of collection structure and object types. On the other hand, I am using both Mulgara RI and Solr engine combined with Fedora generic search for automatic index updates.
The first approach includes the following layout: - Collection level object, which is the container of the collection and only holds metadata datastreams and one content datastream - Document objects, which are related with the collection container and could have potentially a large number of internal datastreams (inline xml datastream types) and make use of RELS-EXT to link them with the collection container object and RELS-INT datastream to specify the different relationships within the contained inline datastreams. This approach, from the point of view of indexing the datastreams in the search engine, presents a very simple structure since everything is included within one object but on the other hand, presents performance issues since the number of inline datastreams could potentially be very high (each one is very small but the number of them could be very high). The second approach involves having multiple types of objects: - Collection level object (the same as in the first approach) - Document object, which now only contains metadata, RELS-EXT and one managed datastream - Data objects, which only contain metadata datastream, inline xml datastream and RELS-EXT to relate them with the document objects and between them. These objects are the equivalent of the inline-xml datastreams that were included within the document objects in the first approach. This solution does not present complexity or impact in searchs when using the RI and Mulgara triplestore as a search interface but seems to be more complex if using Solr Search engine. Has anyone dealt with similar design issues? I would really appreciate any advice on this. Thanks in advance, Agustina ------------------------------------------------------------------------------ WhatsUp Gold - Download Free Network Management Software The most intuitive, comprehensive, and cost-effective network management toolset available today. Delivers lowest initial acquisition cost and overall TCO of any competing solution. http://p.sf.net/sfu/whatsupgold-sd _______________________________________________ Fedora-commons-users mailing list Fedora-commons-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/fedora-commons-users