Re: [CODE4LIB] Announcing OLAC's prototype FRBR-inspired moving image discovery interface

Karen Coyle Mon, 13 Dec 2010 05:34:26 -0800

Kelley, this is great! Thanks. And since you already have so muchwritten up, would you consider going a bit further and offering it tothe code4lib journal? My reasons are selfish -- i'd like to be able tofind and cite this in the future.


Later I may have a few comments.


kc

Quoting Kelley McGrath <[email protected]>:

We called it "FRBR-inspired" since it probably wouldn't pass musteras an orthodox FRBR interpretation. We were looking to experimentwith a practical approach that we thought would make it much easierfor patrons to discover moving images in libraries and archives. Ifyou haven't read it, the "about" page gives a general overview ofour approach at http://blazing-sunset-24.heroku.com/page/about
Our top level is a combination of FRBR work information andinformation about what we are calling the "primary expression." Wehaven't made any internal distinction between these two types ofinformation. This enables us to record together the data that wethink people expect to see about the generic moving image andreflects the sort of information that is given in IMDb, the AllMovie Guide, and film and TV reference sources. This is also thedata that we would want to re-use in every MARC record for amanifestation of a given movie.
This also allowed us to get around some of the areas of moreorthodox FRBR modeling that we found unhelpful. For example, FRBRdoesn't allow language at the Work level, but we think it isimportant to record the original language of a moving image at thetop level. In addition, RDA has mapped a number of functions, suchas art director, costume designer and performer, to the expressionlevel. We would prefer to present these at the top level. It is hardto imagine a version of Gone With the Wind with a different costumedesigner or cast that would still be the same work. So all the SevenSamurai data you listed above belongs either to the work or theprimary expression.
We mingle expression, manifestation and item information in theversion facets on the right. We don't show any explicit expressionrecords. In this demonstration we are not actually identifying anyunique expressions, although in the future we will probably want todo this for what I think of as "named expressions." Since this is ademo, we are working with a limited number of attributes and theonly expression-level facets we provide are soundtrack and subtitlelanguages.
In this sense, our approach is similar to the near manifestationidea that Simon mentioned. We are not trying to assert that we haveidentified particular expressions. Rather, we are trying to providea mechanism for the user to identify the set of items that meettheir needs. It is not clear to me that libraries are always in aposition to accurately identify expressions.
Rather than providing a hierarchical view where the user selects awork, then an expression, and so on, as is common in FRBRpresentations, we permit the user to begin at any FRBR level. Theuser is invited to limit by as many characteristics as they desireto delineate the set of things that they are interested in. Theyonly need to select as many attributes as are important to them andno more. This may not meet the needs of all scholars, but we hopethat it will meet the vast majority of general purpose user needs.
It's a bit of a different approach than I have seen elsewhere, but Ithink it works particularly well for moving images. One of the mainreasons I think this is because of the types of expressions thatpredominate in commercial moving images. I will try to explain someof my thoughts on types of expressions below.
1. Expressions that can be reduced to controlled vocabulary options
These are the most common types of commercial moving imageexpressions, especially in the DVD era. They are distinguished bycharacteristics that such as
  Soundtrack language(s)
  Subtitle language(s)
  Accessibility options (captioning, SDH, and audio description)
Aspect ratio (although in this era of widescreen TVs, full screenmodifications are less common)
  Colorization
  Soundtracks for silent films
These can be full described based on standardized data (although forthe silent film soundtracks, this would involve multiple pieces ofinformation, i.e., musical work, composer, conductor, performer(s),etc.)
DVD often contain what essentially are multiple expressions in thatthey offer multiple soundtrack and subtitle options and may offermultiple aspect ratios. A silent film on DVD may come with alternatesoundtracks. All of these can be combined in various ways by theviewer, which can make for a large number of expressions containedin a single manifestation.
2. Named expressions
These are versions that are different in moving image content due tohave been edited differently. Examples include
  Theatrical release
  Director's cut
  Unrated version
Although Martha Yee found a strong correlation between differencesin duration and the likelihood that two things represented twodifferent expressions, this doesn't always work. The archetypicalexample of Blade Runner was released on DVD with five differentversions (http://en.wikipedia.org/wiki/Versions_of_Blade_Runner),all of which had run times within a few minutes of each other. Thesetypes of expressions would benefit from their own identifier andsome sort of separate display. In public and academic libraries,this type of moving image expression is far less common than thefirst type. There are no examples of this type of expression in oursample data.
Many more subtle expressions of this type cannot practically beidentified by the individual library cataloger because thepublishers do not provide the necessary information. Many filmsreleased on DVD have been remastered or restored or modified in someway, but it is not clear how to usefully or consistently record thisinformation even when it is provided in some form. For example, itsometimes seems like every release of the Star Wars films must beslightly different, but the videos don't come labeled in any waythat's useful for identifying them. There is a page at Wikipediatracking some changes(http://en.wikipedia.org/wiki/List_of_changes_in_Star_Wars_re-releases) andan enormous thread on the release of the original theatricalversions(http://sideshowcollectors.com/forums/showthread.php?t=12157).
3. Manifestations with additional content
Many manifestation could be considered to be new expressions becauseof the presence of additional content. These types of expressiondon't affect the content of the moving image work itself. Theseadditions could be potentially treated in a couple ways and thedecisions of individual cataloging agencies are likely to vary.
a) Additional content recognized as a work in its own right
Any additional content is theoretically a work in its own right, butthere is a cost-benefit analysis involved in deciding to treat itthat way. In some cases, DVDs come with bonus features that containcontent that the library might potentially have bought (or hasalready bought) independently. These would benefit from beingdescribed as separate works. There are a couple examples of this inour data set. If you do a search for Citizen Kane, you'll get themovie plus a TV documentary called The Battle Over Citizen Kane.Both of these have been issued separately, but the manifestationlisted as " DVD (2001)" under both titles represents the samemanifestation, which includes the TV documentary as supplementarycontent. Whether it is necessary to inform users in some way thatthese are on the same disc at this point or not, I am not sure.
b) Undifferentiated additional content listed with the manifestation
DVDs often come with an abundance of special features, most of whichare probably not worth the time it would take to describe them asseparate works. We have not included any of this type of informationin the demo, but one possibility would just be to list the contentwith each manifestation.
Merging the expression and manifestation facets gave us a simplerinterface and we don't think it harms most viewer's ability to findwhat they want. The four levels of FRBR make a lot of sense from atheoretical perspective (although it is easy to see that there oftenare multiple layers of expressions and that works have manyrecursive relationships). For moving images, in many cases, userscare more about the manifestation format (DVD vs. VHS vs. Blu-rayvs. streaming) than about expression characteristics.
There is also not always a hard and fast line between what goes in arecord as expression and manifestation information. For example,Criterion Collection is generally recorded as a publisher. However,for many users, it likely serves as a proxy for expression sinceCriterion is known for the quality of its videos. According to theirwebsite, "Every time we start work on a film, we track down the bestavailable film elements in the world, use state-of-the-art telecineequipment and a select few colorists capable of meeting our rigorousstandards, then take time during the film-to-video digital transferto create the most pristine possible image and sound. Wheneverpossible, we work with directors and cinematographers to ensure thatthe look of our releases does justice to their intentions."(http://www.criterion.com/about_us)
Well, that was a bit of a long-winded reply and didn't really answeryour question, but I hope it was helpful in framing what we'retrying to do. This is still very much an experiment and there are anumber of data modeling problems that I glossed over in order tomake the demo work, but which would have to be resolved for alarger-scale application.
Kelley


Karen Coyle wrote:
Kelley,

do you have somewhere documentation on which properties/attributes are
associated with each FRBR entity? I ask this in part out of my
ignorance of moving image cataloging, and therefore I am having
trouble translating from the FRBR documentation to what appears in
your prototype. I did my usual search on "seven samurai" and the
display (which I assume represents the Work) reads (in part):

Alternate Title:
   Seven Samurai
Director:
   Kurosawa, Akira, 1910-1998
Genres:
   Feature; Fiction; Drama;
Language:
   Japanese
Country:
   Japan
Original Aspect:
   Full screen ( 1.37:1 )
Run Time:
   206
Color:
   B&W
Sound:
   Sound

I'm curious as to which are Work attributes and which are Expression
attributes. Also, is there an example that shows one work and multiple
expressions?

kc




--
Karen Coyle
[email protected] http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet

Re: [CODE4LIB] Announcing OLAC's prototype FRBR-inspired moving image discovery interface

Reply via email to