title truncation
----------------
i can adjust the max title length to 240, but will subtract the mediafile’s 
extension from the total title length, and then truncate the title based on 
that evaluation; e.g., .tiff is 5 bytes, .jpg is 4 bytes. i’ll add this to the 
list of items i hope to cover during the hackathon.


title building
--------------
we also have an issue with how the title is built. currently, the values mapped 
to the template’s title parameter and title identifier are used to create the 
title. james healed ran into an issue when he was uploading some images from 
the british library where he didn’t understand this building process and was at 
first confused. he ended up creating one “special” field in the metadata that 
contained the _unique_ title he wanted to use. he then mapped that special, 
unique title to the title identifier parameter without mapping anything to the 
template title parameter. i suggest that we rename the gwtoolset title 
identifier parameter to gwtoolset-title and that we recommend that uploaders 
mimc what james did.


with kind regards,
dan


On Apr 30, 2014, at 21:12 , Fæ <[email protected]> wrote:

> Sorry, got my examples confused. The general point about filename
> truncation is still correct.
> 
> Fae
> 
> On 30/04/2014, Fæ <[email protected]> wrote:
>> Hi,
>> 
>> I had an odd problem with files not being created, which I think I can
>> put down to how long filenames are handled by GWT.
>> 
>> As an example, my xml specified (A) but GWT created (B):
>> A. File:Index Map No.2 of a part of Suffolk County. South Side - Ocean
>> Shore, Long Island. Part of Islip and Part of Brookhaven. Published by
>> E. Belcher Hyde. 97 Liberty Street, Brooklyn. 5 Beekman Street,
>> NYPL1633883.tiff (209 chars) (see link)
>> B. File:Index Map No. 2 of a part of Suffolk County. South Side -
>> Ocean Shore, Long Island. Easthampton. Published by E. Belcher Hyde.
>> 97 Liberty Street, Brooklyn. 5 Beekman Street, Manhattan. 1916. Volume
>> NYPL1633.tiff (206 chars)
>> 
>> This seems an easy thing to warn the user about when reading the xml.
>> In terms of behaviour I would expect the tool to reject the xml as
>> malformed and warn about maximum allowed filename length, rather than
>> truncate the name, in this case truncation meant corrupting the unique
>> NYPL identifier.
>> 
>> It would be better if GWT allowed the maximum title length that
>> Commons allows (240 bytes, the number of visible characters varying by
>> charset).
>> 
>> I vaguely recall the Steering Committee discussing this last year, so
>> I'm unsure if this is worth raising in bugzilla. Suggestions?
>> 
>> Links
>> 1.
>> https://commons.wikimedia.org/wiki/File:Index_Map_No.2_of_a_part_of_Suffolk_County._South_Side_-_Ocean_Shore%2C_Long_Island._Part_of_Islip_and_Part_of_Brookhaven._Published_by_E._Belcher_Hyde._97_Liberty_Street%2C_Brooklyn._5_Beekman_Street%2C_NYPL1633883.tiff
>> 2. https://bugzilla.wikimedia.org/show_bug.cgi?id=30202
>> 3. https://commons.wikimedia.org/wiki/Commons:Filenames
>> 
>> Fae
>> --
>> [email protected] https://commons.wikimedia.org/wiki/User:Fae
>> 
> 
> 
> -- 
> [email protected] https://commons.wikimedia.org/wiki/User:Fae
> 
> _______________________________________________
> Glamtools mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/glamtools


_______________________________________________
Glamtools mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/glamtools

Reply via email to