On Mon, 2008-08-18 at 19:23 +0100, Graham Triggs wrote:
> Richard Rodgers wrote:
> > I do worry about opening door #1 [content rejection],
> > since taking assets as found seems pretty close to the bedrock
> > use-case for digital repositories - at least preservation-minded ones.
> >    
> Well, that is an interesting argument! Now, if we look at assets 'as 
> found' then they will [probably] be located in a users file system. That 
> file system will already be enforcing a unique constraint on the names 
> of files within a directory.
> 
True - ingestion into a repository is a destructive
decontextualization/recontextualization. My point about
'as found' means be as minimally destructive
as possible, viz not changing 'graham.txt' to 'richard.txt' (or
'graham[2].txt'). If fact (the web submission UI notwithstanding),
nothing in the data model prevents us from capturing all we want as
contextual metadata (hasParent 'q4' etc). 

> Now, in your example you had a user with two files that had the same 
> name but located in different directories. Presumably there is implicit 
> knowledge in the particular organisation of the file structure. And we 
> are not taking it 'as found' because DSpace is forcing the user to throw 
> away that organisation (and therefore any knowledge/information it 
> implies) when attaching all those files to a single item.
> 
> In terms of the sequence number, we assign a 'genuine unique id' to 
> every bitstream that is ingested, and there is no reason why that id 
> can't be used in place of the sequence number in the url in the case 
> where disambiguation is necessary.
> 
> There is nothing wrong with presenting a disambiguation page if a url is 
> provided without that unique id, and where the filename can't uniquely 
> resolve.
> 
> There just isn't any need to use a sequence number in this way, and 
> include it as part of the URL. What there is a need for is a way to 
> define the order in which the bitstreams are presented for an item - 
> which should be the job of a sequence number, but it isn't used for that.
> 
> (Note that the above is true for the majority using the 80/20 rule. 
> There may be some exceptional cases that doesn't fit into the above 
> statements, but then they may not be serviced sufficiently by the 
> existing use of sequence numbers either).
> 
> G
> This email has been scanned by Postini.
> For more information please visit http://www.postini.com
> 


-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
DSpace-tech mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-tech

Reply via email to