Re: linked open data and PDF

2015-02-19 Thread Larry Masinter
I haven’t dropped the thread, but I’d like to up-level a little: - I think we might have different models for how documents typically get produced, and this difference in models results in different perspectives on what constitutes “easy” and “hard”. We should compare typical document

Re: linked open data and PDF

2015-02-03 Thread Sarven Capadisli
On 2015-01-30 16:48, Larry Masinter wrote: There are a number of issues and shortcomings with the PDF approach which in the end will not play well with the Web is intended to be, nor how it functions. I think I have familiarity with what the Web is intended to be, and how it functions, and I

RE: linked open data and PDF

2015-01-30 Thread Larry Masinter
((sorry, this sat in my drafts box, incomplete, and out of sequence now)) In reply to Sarven Capadisli's email of 2015-01-19 12:20: First off, I totally acknowledge your interest to improve the state of things for PDF. thanks I'm welcome to be proven wrong, but for the big picture, I don't

Re: linked open data and PDF

2015-01-29 Thread Alexander Garcia Castro
Dear all, we are organising Sepublica 2015; this workshop is precisely about standards, metadata and semantic web improving scholarly communication. please take a look at http://sepublica.info/, also, take a look at the call for submissions. SePublica 2015: Do show, do not tell!

Re: linked open data and PDF

2015-01-24 Thread Michael Brunnbauer
Hello Paul, On Fri, Jan 23, 2015 at 11:21:32AM -0500, Paul Houle wrote: I don't think that is true; I see this example in the XMP spec rdf:RDF xmlns:rdf=http://www.w3.org/1999/02/22-rdf-syntax-ns#; xmlns:xmp=http://ns.adobe.com/xap/1.0/; rdf:Description rdf:about= xmp:BaseURL

Re: linked open data and PDF

2015-01-24 Thread Michael Brunnbauer
Hello Larry, On Fri, Jan 23, 2015 at 11:03:40PM +, Larry Masinter wrote: And Michael Brunnbauer replied: I am right that such a pointer to external data would have to be a literal? The way I read the XMP standard is that only literals, blank nodes, rdf:Bag and rdf:Seq are allowed

RE: linked open data and PDF

2015-01-23 Thread Larry Masinter
I wrote: Image formats like JPEG and PNG (for which there is support for XMP) don't have a standard, uniform way of attaching other files, though, so allowing data (or a pointer to external data) in the XMP would broaden the applicability. And Michael Brunnbauer replied: I am right

Re: linked open data and PDF

2015-01-23 Thread Roderic Page
There’s a survey by Ross Mounce here: http://rossmounce.co.uk/2013/01/06/pdf-metadata-using-exiftool/ Regards Rod - Roderic Page Professor of Taxonomy Institute of Biodiversity, Animal Health and Comparative Medicine College of Medical,

Re: linked open data and PDF

2015-01-23 Thread Michael Brunnbauer
Hello Larry, On Tue, Jan 20, 2015 at 05:28:38PM +, Larry Masinter wrote: Image formats like JPEG and PNG (for which there is support for XMP) don't have a standard, uniform way of attaching other files, though, so allowing data (or a pointer to external data) in the XMP would broaden

Re: linked open data and PDF

2015-01-23 Thread Paul Houle
I don't think that is true; I see this example in the XMP spec rdf:RDF xmlns:rdf=http://www.w3.org/1999/02/22-rdf-syntax-ns#; xmlns:xmp=http://ns.adobe.com/xap/1.0/; rdf:Description rdf:about= xmp:BaseURL rdf:resource=http://www.adobe.com// /rdf:Description /rdf:RDF Isn't this the same as

Re: linked open data and PDF

2015-01-22 Thread Alexander Garcia Castro
ledger. Not for a minute do I think this was a good thing, and in any case no fault of GRDDL. On Wed, 1/21/15, Paul Tyson phty...@sbcglobal.net wrote: Subject: Re: linked open data and PDF To: Norman Gray nor...@astro.gla.ac.uk Cc: Paul Houle

Re: linked open data and PDF

2015-01-22 Thread Gannon Dick
. On Wed, 1/21/15, Paul Tyson phty...@sbcglobal.net wrote: Subject: Re: linked open data and PDF To: Norman Gray nor...@astro.gla.ac.uk Cc: Paul Houle ontolo...@gmail.com, Herbert Van de Sompel hvds...@gmail.com, jschnei...@pobox.com jschnei...@pobox.com, public-lod@w3.org public

RE: linked open data and PDF

2015-01-21 Thread Larry Masinter
Paul Houle ontolo...@gmail.com: You should be able to pipe the InputStream that comes out of a PDF filew/ PDFBox into Jena or some other RDF toolset. Hopefully. If not, you might be better off using one of the XMP toolkits; I'm not sure. A much more challenging issue is developing a

Re: linked open data and PDF

2015-01-21 Thread Norman Gray
Paul and Rod, hello. On 2015 Jan 21, at 16:32, Paul Houle ontolo...@gmail.com wrote: I think the world needs a survey of XMP metadata in the field. Only by inspection of a large set of diverse files can we say how good or bad the situation actually is. Rod's link at

Re: linked open data and PDF

2015-01-21 Thread Norman Gray
Greetings. On 2015 Jan 20, at 14:42, Herbert Van de Sompel hvds...@gmail.com wrote: Larry How about HTTP Link headers (RFC 5988) to convey links and metadata expressed as links when serving PDFs? I can imagine an authoring tool embedding the info in XMP. But I have a harder time

RE: linked open data and PDF

2015-01-21 Thread Larry Masinter
I want to back up a bit from the question people normally ask: “I have a PDF. How do I get the data out?” to an earlier stage “I have a PDF I got somehow, and I also have some data I think the PDF shows. What can I do to make sure the data is easy to get out of (a new version of) the PDF?”

Re: linked open data and PDF

2015-01-21 Thread Paul Houle
I think the world needs a survey of XMP metadata in the field. Only by inspection of a large set of diverse files can we say how good or bad the situation actually is. There ought to be a tool that gives XMP-annotated documents a point score for metadata quality; you ought to get a

Re: linked open data and PDF

2015-01-21 Thread Paul Tyson
On Wed, 2015-01-21 at 17:16 +, Norman Gray wrote: (also it's not even really about XMP; there are all sorts of ways of scraping metadata out of objects and turning it into something which an RDF parser can read, and from that point you can start being imaginative. This is of course

Re: linked open data and PDF

2015-01-21 Thread Paul Houle
the data in RDF/XML; I can't imagine it is hard to get this data out with third party tools in most cases. On Mon, Jan 19, 2015 at 2:36 PM, Larry Masinter masin...@adobe.com wrote: I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF

Re: linked open data and PDF

2015-01-21 Thread Sarven Capadisli
On 2015-01-20 18:28, Larry Masinter wrote: There's some background that you might find helpful in the discussion. PDF is now defined by ISO 32000. PDF has profiles, including PDF/A-3 http://www.digitalpreservation.gov/formats/fdd/fdd000360.shtml ISO 19005-3. PDF/A-3 defines how to add arbitrary

Re: linked open data and PDF

2015-01-20 Thread Michael Brunnbauer
Hello Martynas, On Tue, Jan 20, 2015 at 02:34:23AM +0100, Martynas Jusevi??ius wrote: IMO mixing RDF/XML with JSON doesn't make sense. This is not RDF/XML. This is XMP. The last time I looked, XMP was a ridiculously crippled version of RDF/XML:

RE: linked open data and PDF

2015-01-20 Thread Jodi Schneider
From: Herbert Van de Sompel hvds...@gmail.com Date: Mon, 19 Jan 2015 18:10:45 -0700 Message-Id: 464e492a-1394-4a59-9c1d-520f9cb63...@gmail.com Cc: public-lod@w3.org public-lod@w3.org To: Larry Masinter masin...@adobe.com Larry How about HTTP Link headers (RFC 5988) to convey links and

Re: linked open data and PDF

2015-01-20 Thread Martynas Jusevičius
Thanks for clarifying that Michael. Well, then I give up :) XMP looks ridiculous indeed. If normal RDF/XML was allowed, it could be actually useful. On Jan 20, 2015 9:28 AM, Michael Brunnbauer bru...@netestate.de wrote: Hello Martynas, On Tue, Jan 20, 2015 at 02:34:23AM +0100, Martynas

Re: linked open data and PDF

2015-01-20 Thread Herbert Van de Sompel
On Jan 20, 2015, at 3:05, Jodi Schneider jschnei...@pobox.com wrote: From: Herbert Van de Sompel hvds...@gmail.com Date: Mon, 19 Jan 2015 18:10:45 -0700 Message-Id: 464e492a-1394-4a59-9c1d-520f9cb63...@gmail.com Cc: public-lod@w3.org public-lod@w3.org To: Larry Masinter masin...@adobe.com

RE: linked open data and PDF

2015-01-20 Thread Larry Masinter
There's some background that you might find helpful in the discussion. PDF is now defined by ISO 32000. PDF has profiles, including PDF/A-3 http://www.digitalpreservation.gov/formats/fdd/fdd000360.shtml ISO 19005-3. PDF/A-3 defines how to add arbitrary file attachments to PDF. XMP

Re: linked open data and PDF

2015-01-20 Thread Paul Houle
it will be for the ecosystem. On Mon, Jan 19, 2015 at 5:20 PM, Kingsley Idehen kide...@openlinksw.com wrote: On 1/19/15 2:36 PM, Larry Masinter wrote: I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps

Re: linked open data and PDF

2015-01-20 Thread Kingsley Idehen
: I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing list archive here. http://lists.w3.org/Archives/Public/public

Re: linked open data and PDF

2015-01-19 Thread Alfredo Serafini
imagine it is hard to get this data out with third party tools in most cases. On Mon, Jan 19, 2015 at 2:36 PM, Larry Masinter masin...@adobe.com wrote: I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one

RE: linked open data and PDF

2015-01-19 Thread Larry Masinter
I made a little example PDF based on the example at http://5stardata.info/ where document metadata is in the XMP itself, but document data is just a string value (this example uses JSON). The same data from http://5stardata.info/gtd-5.html should be available in the attached, starting from

Re: linked open data and PDF

2015-01-19 Thread Martynas Jusevičius
Masinter masin...@adobe.com wrote: I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing list archive here. http://lists.w3.org/Archives/Public

Re: linked open data and PDF

2015-01-19 Thread Alfredo Serafini
cases. On Mon, Jan 19, 2015 at 2:36 PM, Larry Masinter masin...@adobe.com wrote: I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing

Re: linked open data and PDF

2015-01-19 Thread Herbert Van de Sompel
improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing list archive here. http://lists.w3.org/Archives/Public/public-lod/2014Oct/0169.html but I’m still looking. Any clues, problem statements

Re: linked open data and PDF

2015-01-19 Thread Martynas Jusevičius
improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing list archive here. http://lists.w3.org/Archives/Public/public-lod/2014Oct/0169.html but I’m still looking. Any clues, problem statements

Re: linked open data and PDF

2015-01-19 Thread Martynas Jusevičius
Larry, IMO mixing RDF/XML with JSON doesn't make sense. Why not keep it RDF/XML? Like this (not tested): x:xmpmeta xmlns:x=adobe:ns:meta/ rdf:RDF xmlns:rdf=http://www.w3.org/1999/02/22-rdf-syntax-ns#; rdf:Description rdf:about= xmlns:pdf=http://ns.adobe.com/pdf/1.3/;

Re: linked open data and PDF

2015-01-19 Thread Paul Houle
Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing list archive here. http://lists.w3.org/Archives/Public/public-lod/2014Oct/0169.html but I’m still looking. Any clues, problem statements, sample sites? Larry

linked open data and PDF

2015-01-19 Thread Larry Masinter
I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing list archive here. http://lists.w3.org/Archives/Public/public-lod/2014Oct/0169.html but I’m

Re: linked open data and PDF

2015-01-19 Thread Sarven Capadisli
On 2015-01-19 21:20, Sarven Capadisli wrote: Here is another paper: http://linked-reseach.270a.info/ Typo: http://linked-research.270a.info/ :)

Re: linked open data and PDF

2015-01-19 Thread Sarven Capadisli
On 2015-01-19 20:36, Larry Masinter wrote: I just joined this list. I’m looking to help improve the story for Linked Open Data in PDF, to lift PDF (and other formats) from one-star to five, perhaps using XMP. I’ve found a few hints in the mailing list archive here. http://lists.w3.org/Archives