Re: [fcrepo-user] [fcrepo-dev] GSearch planning

Conal Tuohy Sun, 23 Oct 2011 17:08:23 -0700

On 17/10/11 11:48, aj...@virginia.edu wrote:
> Heartily seconded!
>
> In the architecture we're exploring at UVa, we use RELS-INT to define 
> relationships between datastreams and indexing transforms. The relevance to 
> this issue lies in RELS-EXT. By indexing RELS-EXT as a datastream (and 
> assuming that the molecular "para-object" that is responsible for a given 
> index record is constructed via RELS-EXT relationships) we can obtain 
> information about the other objects that may be involved in any index record 
> to which a given object is associated. I'm in agreement that keeping the 
> analysis of object relationships for indexing purposes in indexing XSLT is 
> _not_ the best way, and instead we look to combine this technique with the 
> use of Enhanced Content Model Views to create the kind of multiobject records 
> to which Jonathan is pointing by hiding the explicit structure of the 
> "para-object" from the indexing XSLT. This may or may not be the best 
> possible solution for the problem, so I'm just offering it as a place to 
> begin discussion.
>
>
> ---
> A. Soroka
> Online Library Environment
> the University of Virginia Library
>
>
>
>
> On Oct 16, 2011, at 8:15 PM, Jonathan Green wrote:
>
>> Something that I think needs to be considered when moving forward with 
>> gsearch is that the index may not always share a 1 to 1 relationship with 
>> objects in fedora. In a very atomistic content model perhaps the solr 
>> document is actually composed of parts from many related objects. These 
>> types of decisions are currently very hard to make in XSLTs.
In what way hard? Can you expand a little on the difficulties you see?


>> While I think XSLTs have a place in transforming metadata, there needs to be 
>> something more.
One issue to keep to in mind here is the 80/20 rule. If Fedora's 
indexing system is complex enough to allow for all manner of complex 
cases, then it may be needlessly complex for many simple cases. A more 
complex system would make complex indexing easier, but if it also makes 
simpler cases harder (even just harder to understand a configuration 
system), then the OVERALL ease-of-use might actually decrease. I don't 
think it's possible to strike a perfect balance, but a technology like 
XSLT might be a useful catch-all: it can handle simple cases very 
simply, but can also be extended arbitrarily (including, for instance, 
transcluding metadata from related Fedora objects or other XML datasources).

In very many cases, the mapping of Fedora objects to Solr documents is 
very simple and won't, for instance, involve any aggregation. But the 
mapping from Fedora objects to Solr documents is in principle arbitrary; 
you might choose to do pretty much anything, quite legitimately. You 
might have metadata schemas of any type; you might use the RDF store, 
you might have external authority files, etc. This is where, I think, a 
system which is sufficiently configurable to be fully general could well 
end up as complex as an XSLT-based system would be, but without many of 
the advantages of XSLT (code libraries, books and mail-lists, programmer 
experience, etc).

It might be enough to ship Fedora with a basic set of XSLT transforms, 
and a few sample transforms showing how to use the resource index, etc.
-- 

Conal Tuohy
eResearch Business Analyst
Victorian eResearch Strategic Initiative
+61-466324297



------------------------------------------------------------------------------
The demand for IT networking professionals continues to grow, and the
demand for specialized networking skills is growing even more rapidly.
Take a complimentary Learning@Cisco Self-Assessment and learn 
about Cisco certifications, training, and career opportunities. 
http://p.sf.net/sfu/cisco-dev2dev
_______________________________________________
Fedora-commons-users mailing list
Fedora-commons-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users

Re: [fcrepo-user] [fcrepo-dev] GSearch planning

Reply via email to