On Oct 18, 2021, at 2:50 PM, McDonald, Stephen <[email protected]> wrote:
>> Using Dublin Core, is there a way to express size measured in words?
>
> Only if you create your own guidelines for your own data. As I said, it's
> just like MARC 300$a: "300 p.", "300 pages", "300 pages", "three hundred
> pages", and "1 volume (300 pages)" are all equivalent in MARC. Extent was
> not designed for computer parsing.
>
> The DCMI documentation for extent says, "Recommended practice is to specify
> the file size in megabytes and duration in ISO 8601 format", but doesn't give
> any guidance for string encoding scheme for those dimensions, nor any
> guidance at all for other dimensions. It's possible that someone has written
> suggested guidelines for certain purposes.
Thank you for the prompt reply, and yep. Consequently, I might ultimately go
with something like below where I add a namespace and I add an attribute to
extent. The following validates against an RDF validator:
<?xml version="1.0"?>
<rdf:RDF
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:dcterms="http://purl.org/dc/terms/"
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
xmlns:reader="https://distantreader.org/">
<rdf:Description
rdf:about="http://library.distantreader.org/carrels/homer/">
<dc:identifier>homer</dc:identifier>
<dc:format>
<dcterms:extent reader:units="megabytes">
<rdf:value>100</rdf:value>
<rdfs:label>100 MB (compressed)</rdfs:label>
</dcterms:extent>
</dc:format>
<dc:format>
<dcterms:extent reader:units="words">
<rdf:value>20000000</rdf:value>
<rdfs:label>20,000,000 words</rdfs:label>
</dcterms:extent>
</dc:format>
<dc:format>
<dcterms:extent reader:units="items">
<rdf:value>50</rdf:value>
<rdfs:label>50 textual items</rdfs:label>
</dcterms:extent>
</dc:format>
</rdf:Description>
</rdf:RDF>
This way arithmetic can be applied against extent values.
RDF Validator: https://www.w3.org/RDF/Validator/
--
Eric Morgan