Linked data in packaged content (ePub)
I'm interested in putting linked data into eBooks published in the (open standard) ePub format (http://www.openebook.org/ ). The format is essentially a relocatable zip file of XHTML, associated media files and a few metadata files. The target platforms of this content impose some restrictions on what is practical: e-ink devices (which are the only current eBook readers with the battery life to last an entire novel) typically don't have an internet connection (thus no resolving of links) and have very little in the way of processing power (thus no full reasoning). We already have some data-interlinking between our collection (http://www.nzetc.org/ ) and librarything (http://www.librarything.com/ ) at the FRBR work level (http://vocab.org/frbr/core.html#Work ) and also some links to wikipedia / dbpedia for named entities (principally authors and places). We believe we have quite good authority control over author names, even those who published under multiple names (see, for example http://www.nzetc.org/tm/scholarly/name-208662.html or http://www.nzetc.org/tm/scholarly/name-208310.html ). We have ~1300 ePubs, the largest of which exceed the size limits of most ePub tools. Does anyone know of any other attempts to put linked data into packages like this? There are two main issues I can see: (a) how to self-identify the package (naive hashing doesn't work, as some eBook readers open the package and add custom metadata) and (b) how to package the linked data to get maximal use when a paucity of CPU precludes a full reasoner. The traditional identifier used in this field, the ISBN, is essentially a print-run identifier, and not of a whole lot of obvious use to us since: (a) most of our books' original publishing predates ISBNs and (b) our digital republishing of them doesn't qualify for an ISBN according to our local ISBN issuer (the National Library of New Zealand). cheers stuart
Re: Linked data in packaged content (ePub)
2010-04-27 22:40 Stuart A. Yeates syea...@gmail.com: I'm interested in putting linked data into eBooks published in the (open standard) ePub format (http://www.openebook.org/ ). The format is essentially a relocatable zip file of XHTML, associated media files and a few metadata files. ... Does anyone know of any other attempts to put linked data into packages like this? The mere embedding is a current topic of interest of the RDFa WG (see http://lists.w3.org/Archives/Public/public-rdfa-wg/2010Mar/0200.html), and I suppose they will be quite interested in the further implications you mentioned. Cheers, Christoph -- Christoph Lange, Jacobs Univ. Bremen, http://kwarc.info/clange, Skype duke4701
Re: Linked data in packaged content (ePub)
On Tue, Apr 27, 2010 at 4:40 PM, Stuart A. Yeates syea...@gmail.com wrote: Does anyone know of any other attempts to put linked data into packages like this? While arguably not Linked Data per-se, you might be interested in work being done on the Open Publication Distribution System (OPDS) [1], which aims to use Atom for making metadata about ebooks available. A key part of an OPDS feed are opds:acquisition links between an atom:entry and a epub document identified with a URI and a media type, for example: link type=application/epub+zip href=http://www.feedbooks.com/book/4440.epub; rel=http://opds-spec.org/acquisition/ Implementors include people at O'Reilly, Internet Archive, Ibis Reader, FeedBooks, to name a few. Much of the work is actually going on in open bi-weekly conference calls, and on a discussion list [3] if you are interested. On a recent call Hadrien Gardeur of Feedbooks was talking about embedding opds atom documents in the ebook serializations, so it might be worthwhile pinging him and/or the discussion list to see where things are at. //Ed [1] http://code.google.com/p/openpub/wiki/CatalogSpecDraft [2] http://ibisreader.com/ [3] http://groups.google.com/group/openpub
Re: [dady] Dataset Dynamics meet-up at WWW2010
Sandro, (funny, I posted this last Friday and it showed up only yesterday ...) I think for the use cases I have in mind (large scale, ad hoc, real-time mirroring of RDF), a key requirement is constant time (per triple) to apply deltas, including maintaining a secure hash. I was happy to see I could (I think) meet this with some tweaking of blank nodes. I sketched both a delta format (gruf) and a subscription protocol (websub), which are separate. Rough specs are here: http://websub.org/wiki/GRUF http://websub.org/wiki/Spec This looks great, thanks! It would be interesting to learn more about the status of this project (are implementations available, etc.) and your plans with it. Indeed, as I recently wrote [1], there are plenty of approaches and proposals out there and I think the time is ripe to sit together, get more practical experience in implementing and deploying stuff. I hope you can make it to the Dataset Dynamics meeting [2] and introduce your proposal to the community. I'll sync with Juergen to catch up. Cheers, Michael [1] http://blog.semantic-web.at/2010/04/26/a-dynamic-web-of-data/ [2] http://esw.w3.org/DatasetDynamics/Meetings -- Dr. Michael Hausenblas LiDRC - Linked Data Research Centre DERI - Digital Enterprise Research Institute NUIG - National University of Ireland, Galway Ireland, Europe Tel. +353 91 495730 http://linkeddata.deri.ie/ http://sw-app.org/about.html From: Sandro Hawke san...@w3.org Reply-To: dady dataset-dynam...@googlegroups.com Date: Tue, 27 Apr 2010 14:53:39 -0400 To: dady dataset-dynam...@googlegroups.com Cc: Linked Data community public-lod@w3.org, Jürgen Umbrich juergen.umbr...@deri.org Subject: Re: [dady] Dataset Dynamics meet-up at WWW2010 Changes in Linked Data sources (dataset dynamics) and how to deal with it are an important and emerging issue [1][2]. As already mentioned, there will be a dataset dynamics meet-up at WWW2010. I first planned to register a BoF, but it is still unclear if this is possible [3]. I'd now propose to have a break-out session at the W3C LOD Track [4] - as I won't be able to make it to WWW, Juergen Umbrich (in CC) would take care of the local organisation. Sounds like a good idea, although I don't know what else might be on the menu. So, I just learned of the dataset dynamics term and community last week (from Michael), but I've been thinking about this for many years. I got inspired and sketched out a design a few months ago, which I think is pretty good. I've been hoping to return to it some day soon, but if we're talking about this Thursday, I might was well share the drafts now. I think for the use cases I have in mind (large scale, ad hoc, real-time mirroring of RDF), a key requirement is constant time (per triple) to apply deltas, including maintaining a secure hash. I was happy to see I could (I think) meet this with some tweaking of blank nodes. I sketched both a delta format (gruf) and a subscription protocol (websub), which are separate. Rough specs are here: http://websub.org/wiki/GRUF http://websub.org/wiki/Spec -- Sandro Cheers, Michael [1] http://esw.w3.org/DatasetDynamics [2] http://data-gov.tw.rpi.edu/wiki/TWC_Data-gov_Vocabulary_Proposal#Change_of_d ataset [3] http://twitter.com/WWW2010/status/12452736301 [4] http://esw.w3.org/Camps:LODCampW3CTrack#breakout -- Dr. Michael Hausenblas LiDRC - Linked Data Research Centre DERI - Digital Enterprise Research Institute NUIG - National University of Ireland, Galway Ireland, Europe Tel. +353 91 495730 http://linkeddata.deri.ie/ http://sw-app.org/about.html -- Subscription settings: http://groups.google.com/group/dataset-dynamics/subscr ibe?hl=en
www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples
Dear all: I am happy to announce one of many new major sources of GoodRelations data: http://www.shopforia.com/ recently added GoodRelations in RDFa to their 104,000 items pages in 26 categories: Example: http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=B003DRBA3S Sitemap: There seems to be no sitemap for the site, but you can reconstruct it by 1. crawling http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?SearchIndex=CategoryItemPage=n with n = 1...400 for Category = Apparel, Beauty etc. and 2. extracting all item links, which follow the pattern http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=ASIN, with ASIN = e.g. B0009RL86E Many items have EAN/UPC codes, which allows for powerful linking with product model data (features etc.). I guess that the total amount of triples will be in the order of magnitude of 50 triples per page x 104,000 pages = 5,200,000 triples. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009
RE: www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples
Looking at the example source, I'm curious what's going on here: div class=description about=http://www.shopforia.com/; typeof=owl:Ontology div rel=owl:imports resource=http://www.w3.org/2006/vcard/ns;/div div rel=owl:imports resource=http://purl.org/goodrelations/v1;/div Is this considered good practise? I can't see the benefit this might give to any agent making use of the triples. -Original Message- From: public-lod-requ...@w3.org [mailto:public-lod-requ...@w3.org] On Behalf Of Martin Hepp (UniBW) Sent: 28 April 2010 11:07 To: public-lod@w3.org Subject: www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples Dear all: I am happy to announce one of many new major sources of GoodRelations data: http://www.shopforia.com/ recently added GoodRelations in RDFa to their 104,000 items pages in 26 categories: Example: http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=B003DRBA3S Sitemap: There seems to be no sitemap for the site, but you can reconstruct it by 1. crawling http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?SearchIndex=CategoryItemPage=n with n = 1...400 for Category = Apparel, Beauty etc. and 2. extracting all item links, which follow the pattern http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=ASIN, with ASIN = e.g. B0009RL86E Many items have EAN/UPC codes, which allows for powerful linking with product model data (features etc.). I guess that the total amount of triples will be in the order of magnitude of 50 triples per page x 104,000 pages = 5,200,000 triples. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009 This email and any attachments may contain confidential or privileged information and is intended for the addressee only. If you are not the intended recipient, please immediately notify us by email or telephone and delete the original email and attachments without using, disseminating or reproducing its contents to anyone other than the intended recipient. Wolters Kluwer shall not be liable for the incorrect or incomplete transmission of of this email or any attachments, nor for unauthorized use by its employees. Wolters Kluwer nv has its registered address in Alphen aan den Rijn, The Netherlands, and is registered with the Trade Registry of the Dutch Chamber of Commerce under number 33202517.
Re: Linked data in packaged content (ePub)
Stuart, t's not clear to me what you're trying to accomplish...For whom are you trying to add value? Since ePub is XHTML, it makes sense to embed metadata as RDFa. But why? Is the purpose to enhance the reading experience? Or perhaps the local collection management experience? Publishers should be, and most likely are, obtaining DOIs http://doi.org for their materials through a registration agency such as CrossRef http://crossref.org. As we have discussed elsewhere (and for more than a decade...) the DOI enables multiple stakeholders to manage and publish metadata about the item; linked data best practices are a promising approach , but RDFa on an RA's landing page for the item is also a possibility. The DOI is not about embedding metadata in individual instances, however, which is what you seem to be asking about. Are you imagining creating some kind of meshup within the reading experience, perhaps meshing metadata and links bound to entities within the ePub'd document with external linked data? John On Tue, Apr 27, 2010 at 4:40 PM, Stuart A. Yeates syea...@gmail.com wrote: I'm interested in putting linked data into eBooks published in the (open standard) ePub format (http://www.openebook.org/ ). The format is essentially a relocatable zip file of XHTML, associated media files and a few metadata files. The target platforms of this content impose some restrictions on what is practical: e-ink devices (which are the only current eBook readers with the battery life to last an entire novel) typically don't have an internet connection (thus no resolving of links) and have very little in the way of processing power (thus no full reasoning). We already have some data-interlinking between our collection (http://www.nzetc.org/ ) and librarything (http://www.librarything.com/ ) at the FRBR work level (http://vocab.org/frbr/core.html#Work ) and also some links to wikipedia / dbpedia for named entities (principally authors and places). We believe we have quite good authority control over author names, even those who published under multiple names (see, for example http://www.nzetc.org/tm/scholarly/name-208662.html or http://www.nzetc.org/tm/scholarly/name-208310.html ). We have ~1300 ePubs, the largest of which exceed the size limits of most ePub tools. Does anyone know of any other attempts to put linked data into packages like this? There are two main issues I can see: (a) how to self-identify the package (naive hashing doesn't work, as some eBook readers open the package and add custom metadata) and (b) how to package the linked data to get maximal use when a paucity of CPU precludes a full reasoner. The traditional identifier used in this field, the ISBN, is essentially a print-run identifier, and not of a whole lot of obvious use to us since: (a) most of our books' original publishing predates ISBNs and (b) our digital republishing of them doesn't qualify for an ISBN according to our local ISBN issuer (the National Library of New Zealand). cheers stuart -- John S. Erickson, Ph.D. http://bitwacker.wordpress.com olyerick...@gmail.com Twitter: @olyerickson
Re: www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples
Hi Constatine, Yes, I consider that good practice. I can't see the benefit this might give to any agent making use of the triples. owl:imports means that the model shall include all axioms from the imported ontologies. Often (but not always), the same effect can be achieved by dereferencing the URIs of all conceptual elements mentioned in a document. I know that some people in the LOD community don't think owl:imports is necessary, but I do. There have been lengthy discussions about that, but basically RDFa is just one of many syntactical variants for RDF, so there is no strong argument for arbitrarily omitting anything. BTW, I just found that there are some other quirks in the markup. We already contacted the site owner and hope he will fix them shortly. Best Martin On 28.04.10 12:57, Hondros, Constantine wrote: Looking at the example source, I'm curious what's going on here: div class=description about=http://www.shopforia.com/; typeof=owl:Ontology div rel=owl:imports resource=http://www.w3.org/2006/vcard/ns;/div div rel=owl:imports resource=http://purl.org/goodrelations/v1;/div Is this considered good practise? I can't see the benefit this might give to any agent making use of the triples. -Original Message- From: public-lod-requ...@w3.org [mailto:public-lod-requ...@w3.org] On Behalf Of Martin Hepp (UniBW) Sent: 28 April 2010 11:07 To: public-lod@w3.org Subject: www.shopforia.com exposes GoodRelations in RDFa for 104,000 items / 5 mio. triples Dear all: I am happy to announce one of many new major sources of GoodRelations data: http://www.shopforia.com/ recently added GoodRelations in RDFa to their 104,000 items pages in 26 categories: Example: http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=B003DRBA3S Sitemap: There seems to be no sitemap for the site, but you can reconstruct it by 1. crawling http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?SearchIndex=CategoryItemPage=n with n = 1...400 for Category = Apparel, Beauty etc. and 2. extracting all item links, which follow the pattern http://www.shopforia.com/cgi-bin/apf4/apf4.cgi?Operation=ItemLookupItemId=ASIN, with ASIN = e.g. B0009RL86E Many items have EAN/UPC codes, which allows for powerful linking with product model data (features etc.). I guess that the total amount of triples will be in the order of magnitude of 50 triples per page x 104,000 pages = 5,200,000 triples. Best wishes Martin Hepp -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/ (personal) skype: mfhepp twitter: mfhepp Check out GoodRelations for E-Commerce on the Web of Linked Data! = Project page: http://purl.org/goodrelations/ Resources for developers: http://www.ebusiness-unibw.org/wiki/GoodRelations Webcasts: Overview - http://www.heppnetz.de/projects/goodrelations/webcast/ How-to - http://vimeo.com/7583816 Recipe for Yahoo SearchMonkey: http://www.ebusiness-unibw.org/wiki/GoodRelations_and_Yahoo_SearchMonkey Talk at the Semantic Technology Conference 2009: Semantic Web-based E-Commerce: The GoodRelations Ontology http://www.slideshare.net/mhepp/semantic-webbased-ecommerce-the-goodrelations-ontology-1535287 Overview article on Semantic Universe: http://www.semanticuniverse.com/articles-semantic-web-based-e-commerce-webmasters-get-ready.html Tutorial materials: ISWC 2009 Tutorial: The Web of Data for E-Commerce in Brief: A Hands-on Introduction to the GoodRelations Ontology, RDFa, and Yahoo! SearchMonkey http://www.ebusiness-unibw.org/wiki/Web_of_Data_for_E-Commerce_Tutorial_ISWC2009 This email and any attachments may contain confidential or privileged information and is intended for the addressee only. If you are not the intended recipient, please immediately notify us by email or telephone and delete the original email and attachments without using, disseminating or reproducing its contents to anyone other than the intended recipient. Wolters Kluwer shall not be liable for the incorrect or incomplete transmission of of this email or any attachments, nor for unauthorized use by its employees. Wolters Kluwer nv has its registered address in Alphen aan den Rijn, The Netherlands, and is registered with the Trade Registry of the Dutch Chamber of Commerce under number 33202517. -- -- martin hepp e-business web science research group universitaet der bundeswehr muenchen e-mail: h...@ebusiness-unibw.org phone: +49-(0)89-6004-4217 fax: +49-(0)89-6004-4620 www: http://www.unibw.de/ebusiness/ (group) http://www.heppnetz.de/
linking facebook data
I just built a demo that provides dereferenable HTTP URIs (with RDF/XML data) for Facebook data using data retrieved from the recently announced Graph API by Facebook. see http://sam.tw.rpi.edu/ws/face_lod.html In the demo, I observed inconsistent term usage between the Facebook data API (JSON) and open graph protocol vocabulary. There is also some good potential to get the Facebook terms mapped to FOAF and DCterms terms. Please see my blog at http://tw.rpi.edu/weblog/2010/04/28/putting-open-facebook-data-into-linked-data-cloud/ . Comments are welcome. best, -- Li Ding http://www.cs.rpi.edu/~dingl/
Re: Linked data in packaged content (ePub)
On Wed, Apr 28, 2010 at 11:36 PM, John Erickson olyerick...@gmail.com wrote: Stuart, t's not clear to me what you're trying to accomplish...For whom are you trying to add value? We are funded to digitise teaching, learning and research materials for our staff and students. Value to anyone else is incidental, but indicative. Are you imagining creating some kind of meshup within the reading experience, perhaps meshing metadata and links bound to entities within the ePub'd document with external linked data? Ideally, I'd like a protocol such as Open URL ( http://en.wikipedia.org/wiki/Open_URL ), linking books on the device up to the bibliographies of other books that also happen to be on the device. For low CPU devices the links might have to be pre-calculated when connected to a desktop PC. I understand that Open URL can't actaully do this because it assumes the web. cheers stuart
Re: Linked data in packaged content (ePub)
On Wed, Apr 28, 2010 at 2:39 PM, Stuart A. Yeates syea...@gmail.com wrote: Ideally, I'd like a protocol such as Open URL (http://en.wikipedia.org/wiki/Open_URL ), linking books on the device up to the bibliographies of other books that also happen to be on the device. For low CPU devices the links might have to be pre-calculated when connected to a desktop PC. I understand that Open URL can't actually do this because it assumes the web. So the zeroth answer is to only provide such behavior under only certain conditions, for example when explicit DOIs for the items in the bibliographic record are provided and the other items on the device have DOIs...this would allow for an easy registration when books were put on the device. The browser of course would have to know to intervene, however. To actually do what OpenURL enables --- embed citation data ala COinS http://ocoins.info/, use that as the basis for constructing OpenURL references, and resolve those on the device --- would be a stretch on a disconnected device, but not impossible. The citation metadata for each installed ePub would have to be indexed, and then a proxy on the device would have to check this before attempting to resolve off-device. This sort of resolution doesn't require any advanced reasoning, although it still might be intense. Is this what you were thinking? -- John S. Erickson, Ph.D. http://bitwacker.wordpress.com olyerick...@gmail.com Twitter: @olyerickson
Re: linking facebook data
Hello, Am cc'ing the foaf-dev mailing (sorry for cross posting)... I just had a look at your fb graph API - foaf rdf service[1], firstly cool stuff, but I have a few points I will address below. I recall Matthew Rowe[2] making a similar service a few years ago which spat out foaf data for a user's fb account, and I recall fb getting annoyed. Am guessing they mentally might have shifted since danbri's good work in getting them involve with SW tech (great work once again by danbri ... *tonnes of applause), I guess we will find out soon ... My point of reference is my facebook account, and the URI you mint of your service looks like so : http://sam.tw.rpi.edu/data/face/resource/mischat#me Points: 1. I would be nice if the person URI you mint, was explicitly a foaf:Person, e.g. : http://sam.tw.rpi.edu/data/face/resource/mischat#me a foaf:Person . 2. The document URI could have a foaf:primaryTopic triple in there such as: http://sam.tw.rpi.edu/data/face/resource/mischat foaf:primaryTopic http://sam.tw.rpi.edu/data/face/resource/mischat#me . Which in turn identifies which foaf:Person in the RDF document is a main (wo)man being defined. This would be useful in the case where the given user's friends are also listed in the document. 3. Ideally the document URI would be a foaf:PersonalProfileDocument such as : http://sam.tw.rpi.edu/data/face/resource/mischat a foaf:PersonalProfileDocument . 4. Why didn't you make use of foaf:givenname and foaf:familyname which are spec'ed out in the foaf ontology[3], not a biggy but reuse is said to be the right thing™ todo :) 5. You could add an xsd:dateTime to the dcterms:created triple, as apposed to exposing the data as a plain text literal. http://sam.tw.rpi.edu/data/face/resource/mischat dcterms:created 2010-04-27T12:42:11Z^^xsd:dateTime . 6. If i am not mistaken you could also get your hands on my fb avatar URI via the calls you make, that would be nice to see in your rdf. But yeah, good work :) Mischa *is happy that hardly any of my fb data is accessible via the fb graph API :) [1] http://sam.tw.rpi.edu/ws/face_lod.html [2] mailto:m.r...@dcs.shef.ac.uk [3] http://xmlns.com/foaf/spec/ On 28 Apr 2010, at 15:02, Li Ding wrote: I just built a demo that provides dereferenable HTTP URIs (with RDF/XML data) for Facebook data using data retrieved from the recently announced Graph API by Facebook. see http://sam.tw.rpi.edu/ws/face_lod.html In the demo, I observed inconsistent term usage between the Facebook data API (JSON) and open graph protocol vocabulary. There is also some good potential to get the Facebook terms mapped to FOAF and DCterms terms. Please see my blog at http://tw.rpi.edu/weblog/2010/04/28/putting-open-facebook-data-into-linked-data-cloud/ . Comments are welcome. best, -- Li Ding http://www.cs.rpi.edu/~dingl/
Re: linking facebook data
Hi Li, I have just had another look at another user which isn't me (school boy error on my behalf ;) http://sam.tw.rpi.edu/data/face/resource/borisjohnson And I think I found a bug you in your implementation : the above rdf document has a dob of which is set to : -1/16/-001 Which doesn't seem correct to me, perhaps a bug in your code? I looked at boris's public facebook page and, didn't see his dob, http://www.facebook.com/borisjohnson#!/borisjohnson?v=info but after consulting wikipedia it should be 19th June 1964 Anyways, just another pointer. Regards, Mischa On 28 Apr 2010, at 21:13, Mischa Tuffield wrote: Hello, Am cc'ing the foaf-dev mailing (sorry for cross posting)... I just had a look at your fb graph API - foaf rdf service[1], firstly cool stuff, but I have a few points I will address below. I recall Matthew Rowe[2] making a similar service a few years ago which spat out foaf data for a user's fb account, and I recall fb getting annoyed. Am guessing they mentally might have shifted since danbri's good work in getting them involve with SW tech (great work once again by danbri ... *tonnes of applause), I guess we will find out soon ... My point of reference is my facebook account, and the URI you mint of your service looks like so : http://sam.tw.rpi.edu/data/face/resource/mischat#me Points: 1. I would be nice if the person URI you mint, was explicitly a foaf:Person, e.g. : http://sam.tw.rpi.edu/data/face/resource/mischat#me a foaf:Person . 2. The document URI could have a foaf:primaryTopic triple in there such as: http://sam.tw.rpi.edu/data/face/resource/mischat foaf:primaryTopic http://sam.tw.rpi.edu/data/face/resource/mischat#me . Which in turn identifies which foaf:Person in the RDF document is a main (wo)man being defined. This would be useful in the case where the given user's friends are also listed in the document. 3. Ideally the document URI would be a foaf:PersonalProfileDocument such as : http://sam.tw.rpi.edu/data/face/resource/mischat a foaf:PersonalProfileDocument . 4. Why didn't you make use of foaf:givenname and foaf:familyname which are spec'ed out in the foaf ontology[3], not a biggy but reuse is said to be the right thing™ todo :) 5. You could add an xsd:dateTime to the dcterms:created triple, as apposed to exposing the data as a plain text literal. http://sam.tw.rpi.edu/data/face/resource/mischat dcterms:created 2010-04-27T12:42:11Z^^xsd:dateTime . 6. If i am not mistaken you could also get your hands on my fb avatar URI via the calls you make, that would be nice to see in your rdf. But yeah, good work :) Mischa *is happy that hardly any of my fb data is accessible via the fb graph API :) [1] http://sam.tw.rpi.edu/ws/face_lod.html [2] mailto:m.r...@dcs.shef.ac.uk [3] http://xmlns.com/foaf/spec/ On 28 Apr 2010, at 15:02, Li Ding wrote: I just built a demo that provides dereferenable HTTP URIs (with RDF/XML data) for Facebook data using data retrieved from the recently announced Graph API by Facebook. see http://sam.tw.rpi.edu/ws/face_lod.html In the demo, I observed inconsistent term usage between the Facebook data API (JSON) and open graph protocol vocabulary. There is also some good potential to get the Facebook terms mapped to FOAF and DCterms terms. Please see my blog at http://tw.rpi.edu/weblog/2010/04/28/putting-open-facebook-data-into-linked-data-cloud/ . Comments are welcome. best, -- Li Ding http://www.cs.rpi.edu/~dingl/
Re: linking facebook data
Related: http://dev.intellidimension.com/oger/ open graph - rdf - sparql and explored via a silverlight front app; worth a look. Best, Nathan Mischa Tuffield wrote: Hi Li, I have just had another look at another user which isn't me (school boy error on my behalf ;) http://sam.tw.rpi.edu/data/face/resource/borisjohnson And I think I found a bug you in your implementation : the above rdf document has a dob of which is set to : -1/16/-001 Which doesn't seem correct to me, perhaps a bug in your code? I looked at boris's public facebook page and, didn't see his dob, http://www.facebook.com/borisjohnson#!/borisjohnson?v=info but after consulting wikipedia it should be 19th June 1964 Anyways, just another pointer. Regards, Mischa On 28 Apr 2010, at 21:13, Mischa Tuffield wrote: Hello, Am cc'ing the foaf-dev mailing (sorry for cross posting)... I just had a look at your fb graph API - foaf rdf service[1], firstly cool stuff, but I have a few points I will address below. I recall Matthew Rowe[2] making a similar service a few years ago which spat out foaf data for a user's fb account, and I recall fb getting annoyed. Am guessing they mentally might have shifted since danbri's good work in getting them involve with SW tech (great work once again by danbri ... *tonnes of applause), I guess we will find out soon ... My point of reference is my facebook account, and the URI you mint of your service looks like so : http://sam.tw.rpi.edu/data/face/resource/mischat#me Points: 1. I would be nice if the person URI you mint, was explicitly a foaf:Person, e.g. : http://sam.tw.rpi.edu/data/face/resource/mischat#me a foaf:Person . 2. The document URI could have a foaf:primaryTopic triple in there such as: http://sam.tw.rpi.edu/data/face/resource/mischat foaf:primaryTopic http://sam.tw.rpi.edu/data/face/resource/mischat#me . Which in turn identifies which foaf:Person in the RDF document is a main (wo)man being defined. This would be useful in the case where the given user's friends are also listed in the document. 3. Ideally the document URI would be a foaf:PersonalProfileDocument such as : http://sam.tw.rpi.edu/data/face/resource/mischat a foaf:PersonalProfileDocument . 4. Why didn't you make use of foaf:givenname and foaf:familyname which are spec'ed out in the foaf ontology[3], not a biggy but reuse is said to be the right thing™ todo :) 5. You could add an xsd:dateTime to the dcterms:created triple, as apposed to exposing the data as a plain text literal. http://sam.tw.rpi.edu/data/face/resource/mischat dcterms:created 2010-04-27T12:42:11Z^^xsd:dateTime . 6. If i am not mistaken you could also get your hands on my fb avatar URI via the calls you make, that would be nice to see in your rdf. But yeah, good work :) Mischa *is happy that hardly any of my fb data is accessible via the fb graph API :) [1] http://sam.tw.rpi.edu/ws/face_lod.html [2] mailto:m.r...@dcs.shef.ac.uk [3] http://xmlns.com/foaf/spec/ On 28 Apr 2010, at 15:02, Li Ding wrote: I just built a demo that provides dereferenable HTTP URIs (with RDF/XML data) for Facebook data using data retrieved from the recently announced Graph API by Facebook. see http://sam.tw.rpi.edu/ws/face_lod.html In the demo, I observed inconsistent term usage between the Facebook data API (JSON) and open graph protocol vocabulary. There is also some good potential to get the Facebook terms mapped to FOAF and DCterms terms. Please see my blog at http://tw.rpi.edu/weblog/2010/04/28/putting-open-facebook-data-into-linked-data-cloud/ . Comments are welcome. best, -- Li Ding http://www.cs.rpi.edu/~dingl/