I agree with John. Just to add: most file formats have already been defined and tagged outside of the cataloging community (see, for starters: https://en.wikipedia.org/wiki/Internet_media_type). We should make use of those definitions rather than making up our own, in my opinion.
--Ben Benjamin Abrahamse Cataloging Coordinator Acquisitions, Metadata and Enterprise Systems MIT Libraries 617-253-7137 From: Resource Description and Access / Resource Description and Access [mailto:RDA-L@listserv.lac-bac.gc.ca] On Behalf Of John Hostage Sent: Thursday, May 23, 2013 9:27 AM To: RDA-L@listserv.lac-bac.gc.ca Subject: Re: [RDA-L] Size of PDF files I'm saying that the example in the MARC format is in error. RDA seems to be using "text file" to mean any computer file that contains text, including binary files, but that conflicts with the normal meaning of text file. A typical PDF file is comprehended by Content type: text and Media type: computer. Of course, a PDF may have other kinds of content, such as still image, cartographic image, or notated music. Here are some definitions of "text file": Web definitions (computer science) a computer file that contains text (and possibly formatting instructions) using seven-bit ASCII characters wordnetweb.princeton.edu/perl/webwn A text file (sometimes spelled "textfile": an old alternate name is "flatfile") is a kind of computer file that is structured as a sequence of lines. A text file exists within a computer file system. ... en.wikipedia.org/wiki/Text_file A simple data file containing only plain, human-readable text, distinct from documents with embedded formatting; this sense?) (computing) A simple data file in a character encoding that allows it to be read in a simple editor: usually, seven-bit, as opposed to containing raw binary data en.wiktionary.org/wiki/text_file (Text files) Simple unformatted files that are widely recognized and created by many different programs. wps.prenhall.com/wps/media/objects/4848/4964879/go_office_i... or Textfile: A data file consisting entirely of printable ASCII characters, i.e. plain unformatted text. Text files often have a .txt Extension after the filename (e.g. readme.txt) and their contents can be viewed using programs such as Windows Notepad. ... www.ict4lt.org/en/en_glossary.htm<http://www.ict4lt.org/en/en_glossary.htm> A file containing ASCII text created by any standard editor. Such text files can contain, for example, COBOL source code or Compiler directives. COBOL programs can read or write such files by specifying ORGANIZATION LINE SEQUENTIAL. www.microfocus.co.jp/manuals/SE/books/mxglos.htm<http://www.microfocus.co.jp/manuals/SE/books/mxglos.htm> A file with text that has no formatting. None of the text is bold, underlined, italicized, or tabbed. www.michigan.gov/msp/0,1607,7-123-1589_1711_4579-1421... A file in which the bytes represent printable characters organized into lines separated by newline characters. www.difranco.net/progstuff/voc_list.htm<http://www.difranco.net/progstuff/voc_list.htm> A file that contains text to be compiled during the build. Source: NTK www.splorp.com/newton/glossary/<http://www.splorp.com/newton/glossary/> with tab delimited values (aka csv file): go to point 2. wasaty.pl/blog/2010/05/30/importing-glossary-entries-to-multite... This file contains all text strings used by the system to convert various codes into alpha descriptions for display and report purposes. The file also contains much of the parametric data (not stored in the Parameter File) that is necessary to control many optional functions. ... www.leadtec.com/en/home/glossary.html<http://www.leadtec.com/en/home/glossary.html> contains plain text and may be opened in a text editor wps.aw.com/aw_gaddis_vb2008_4/82/21177/5421435.cw/conte... A file containing ASCII characters. www.sitemasterinternet.co.uk/i_pps/glossary.htm<http://www.sitemasterinternet.co.uk/i_pps/glossary.htm> the term "text file" is usually used to indicate a computer file that has no special formatting or additional structure that most word processor and spreadsheet programs use. Text files can be displayed on the screen with no "garbage" characters showing up. The AUTOEXEC.BAT and CONFIG. ... www.morgancc.edu/abm/curric/reference/terms/ObsoleteCompT... ------------------------------------------ John Hostage Authorities and Database Integrity Librarian // Harvard Library--Information and Technical Services // Langdell Hall 194 // Cambridge, MA 02138 host...@law.harvard.edu<mailto:host...@law.harvard.edu> +(1)(617) 495-3974 (voice) +(1)(617) 496-4409 (fax) From: Resource Description and Access / Resource Description and Access [mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA] On Behalf Of Mitchell, Michael Sent: Thursday, May 23, 2013 08:34 To: RDA-L@LISTSERV.LAC-BAC.GC.CA<mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA> Subject: Re: [RDA-L] Size of PDF files I'm looking at http://www.loc.gov/marc/bibliographic/bd347.html and the first example under $b is 347 ##$atext file$bPDF$2rda so I'm a little confused as to what you mean when you say "a PDF file is not a text file." What am I missing? Michael Mitchell Technical Services Librarian Brazosport College Lake Jackson, TX Michael.mitchell at brazosport.edu From: Resource Description and Access / Resource Description and Access [mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA] On Behalf Of John Hostage Sent: Wednesday, May 22, 2013 10:23 PM To: RDA-L@LISTSERV.LAC-BAC.GC.CA<mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA> Subject: Re: [RDA-L] Size of PDF files Kyley, I'm saying that it's not necessary to give a file type as defined in RDA at all. For one thing, Digital file characteristic is not a core element. For another, the file types given in 3.19.2.3 consist of information that's already found in the content type and media type, and the terms are misleading. I'm no expert, but I doubt the types are a useful categorization. A PDF file is not a text file. If you had to characterize it, you might call it a document file. John ------------------------------------------ John Hostage Authorities and Database Integrity Librarian Harvard Library--Information and Technical Services Langdell Hall 194 Harvard Law School Library Cambridge, MA 02138 host...@law.harvard.edu<mailto:host...@law.harvard.edu> +(1)(617) 495-3974 (voice) +(1)(617) 496-4409 (fax) ________________________________ From: Resource Description and Access / Resource Description and Access [RDA-L@LISTSERV.LAC-BAC.GC.CA] on behalf of Felix, Kyley [kfe...@parliament.wa.gov.au] Sent: Wednesday, May 22, 2013 21:29 To: RDA-L@LISTSERV.LAC-BAC.GC.CA<mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA> Subject: Re: [RDA-L] Size of PDF files John, Are you saying it's not necessary to put the 347 field in at all, or that the 347 (a) field should not be text file? What should it be? Many thanks, Kyley Kyley Felix Librarian Parliamentary Library Parliament House Harvest Tce Perth WA 6000 Phone: (08) 9222 7393 From: Resource Description and Access / Resource Description and Access [mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA] On Behalf Of John Hostage Sent: Wednesday, 22 May 2013 9:39 PM To: RDA-L@LISTSERV.LAC-BAC.GC.CA<mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA> Subject: Re: [RDA-L] Size of PDF files The file types in RDA 3.19.2 do not convey anything that is not already conveyed by content type and media type, so there is no need for this element to be in RDA. In any event, a PDF file is not a text file as the term is commonly understood (http://en.wikipedia.org/wiki/Text_file) ------------------------------------------ John Hostage Authorities and Database Integrity Librarian // Harvard Library--Information and Technical Services // Langdell Hall 194 // Cambridge, MA 02138 host...@law.harvard.edu<mailto:host...@law.harvard.edu> +(1)(617) 495-3974 (voice) +(1)(617) 496-4409 (fax) From: Resource Description and Access / Resource Description and Access [mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA] On Behalf Of Felix, Kyley Sent: Tuesday, May 21, 2013 23:13 To: RDA-L@LISTSERV.LAC-BAC.GC.CA<mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA> Subject: [RDA-L] Size of PDF files I'm cataloguing a lot of PDF files in my library. I want to make it easy for users to see the size of the documents. This is what I am thinking of doing in the 300 and 347 fields. The 347 field is hidden from the user so I want the file size also showing in the 300 field. I wasn't sure if this is the best way to do it. Also not sure whether the file size should be within the brackets with the extent? I'm unable to find examples where both the number of pages and the file size are used. 300 (10 a) 1 online resource (v, 23 pages), 840 KB : (20 b) text file, PDF. 347 (10 a) text file (20 b) PDF (30 c) 840 KB Your thoughts would be appreciated. Kyley Felix Librarian Parliamentary Library Parliament House Harvest Tce Perth WA 6000 Phone: (08) 9222 7393 --------------------------------------------------------------------- PARLIAMENT OF WESTERN AUSTRALIA CONDITIONS OF USE, PUBLICATION, OR DISCLOSURE OF THIS EMAIL APPLICABLE TO RECIPIENT The content of this email (including any attachments) - is provided for the use of the intended recipient only; and - mere receipt in no way authorises any recipient to disclose or publish all or part of it to another person or in any form. If this email relates to matters that were, or are being, considered by one or both Houses of Parliament or a committee of either or both Houses, any unauthorised use, publication or disclosure may amount to a breach of the privileges of the House(s). A person who is not an intended recipient is requested to advise the sender and delete this email immediately. Although this email has been scanned for viruses, this email is not guaranteed to be free of viruses and should be vetted by your own security mechanisms. The Parliament of Western Australia accepts no liability for any loss or damage arising from the use of this email or its attachments. --------------------------------------------------------------------- PARLIAMENT OF WESTERN AUSTRALIA CONDITIONS OF USE, PUBLICATION, OR DISCLOSURE OF THIS EMAIL APPLICABLE TO RECIPIENT The content of this email (including any attachments) - is provided for the use of the intended recipient only; and - mere receipt in no way authorises any recipient to disclose or publish all or part of it to another person or in any form. If this email relates to matters that were, or are being, considered by one or both Houses of Parliament or a committee of either or both Houses, any unauthorised use, publication or disclosure may amount to a breach of the privileges of the House(s). A person who is not an intended recipient is requested to advise the sender and delete this email immediately. Although this email has been scanned for viruses, this email is not guaranteed to be free of viruses and should be vetted by your own security mechanisms. The Parliament of Western Australia accepts no liability for any loss or damage arising from the use of this email or its attachments.