Re: See Other

Jesse Weaver Wed, 28 Mar 2012 13:30:40 -0700

Hi Hugh.

I have avoided participating in these httpRange-14 debates, but sinceyou have brought the Facebook Linked Data into the discussion, I feelcompelled to respond. The goal (or my goal) regarding Facebook'sLinked Data provided through its Graph API was to allow for sensibleLinked Data RDF to be published in a way that did not interfere withmaintenance of existing code and in a way that would require verylittle maintenance in the future. Please see my inline commentsbelow, and also some comments at the end.


On Mar 28, 2012, at 6:44 AM, Hugh Glaser wrote:

Executive summary:
TAG, please don't come back with something that does not allow, oreven encourage, sites like Facebook to offer RDF back in return for:
curl -L -H Accept:application/rdf+xml https://www.facebook.com/hugh.glaser
Challenge: Try telling me what to put in sameAs.org for the LD URIfor you on Facebook.
Detail:
I support Jeni et al.'s Proposal, because it is an improvement, andseems to have some chance of success.
Actually, I am pretty sure I align with Giovanni and his ilk.
My preference is to lose the whole thing (and these discussions!) -but there is no point, I think, in proposing that because it has nochance of success.
When people talk about "users", they seem to mean developers.

With regard to Facebook's Graph API, it is indeed targeted towarddevelopers (Linked Data or otherwise).

The users I think of are the eyeballs that look at and manipulatethe stuff on their screens, usually in a browser.
Also, when a posting on this list has:
"Well, if I wanted to do this, " or "Imagine…"
my own eyeballs sort of glaze over.
Well, there have been 6 years to do it or for someone else toactually feel the need to do it - if it hasn't blazed a trail in thehuge range of Linked Data-enabled applications (irony intended)being used by users out there, then it probably isn't a veryimportant use case.
My slightly shorter story (thanks Dan, that was great, and I readthe whole thing!) involves Facebook as a LD site.In fact, I think this story is complementary to Dan's, as it givessome view of the experience that Bob's users will get after Alice'sconsultation and the subsequent implementation.
This actually happened to me last night.
Recalling that I now have a LD ID on Facebook, I go to Facebook andget my ID (well, I think of it as my ID, and it's what I give anyoneif they ask for a link to "me").
https://www.facebook.com/hugh.glaser
(I could stop there, as we all know I already have a problem, but …)
Being a brave little chap, before putting it in my signature as oneof my LD IDs, I decide to check that this is OK, by pasting it intosomething that wants a LD ID, such as the W3C validator (in thiscase I use curl -H Accept:application/rdf+xml).
It actually gave a 200, so it must be OK, right?
Of course, this doesn't validate because the URI actually does 302 -> 200 and returns text/html in response to my curl.
506 would have been possibly less helpful, by the way.
So I am done - nothing I can do now.
However, being not only brave, but also intrepid, I start googlingfor support.I eventually (it wasn't easy), find that I should be using graphinstead of www.
With excitement, I try
curl -i -L -H Accept:application/rdf+xml https://graph.facebook.com/hugh.glaser
Close, but no cigar.
I get text/javascript back.
More digging (I'll spare you the details)...
curl -i -L -H Accept:text/turtle https://graph.facebook.com/hugh.glaser
I cannot contain my excitement; I have some RDF at last!
So I can use https://graph.facebook.com/hugh.glaser as my FacebookLD ID.
Er, not quite.
The turtle this returns is
</720591128#>
        user:id "720591128" ;
Ah yes, I knew I had a numeric ID, 720591128 - so it being late Iguess my LD ID is https://graph.facebook.com/720591128
Of course, er no, not quite again.
I suddenly notice a little # lurking in the turtle.
So I finally decide that the URI I should put in my signature is
https://graph.facebook.com/720591128#
Of course, this is sufficiently ugly, compared with 
https://www.facebook.com/hugh.glaser
that I don't bother, and go to bed.

I'm surprised that perceived ugliness of a URI (although it is not sougly to me; beauty is in the eye of the beholder) would deter someonefrom taking advantage of the Linked Data. The only differences --- asyou have pointed out --- is that graph should be used instead of www,the FBID 720591128 is used instead of hugh.glaser, and the Linked DataURI has (what I call) an empty fragment. Here are the reasons forthese differences:1. I think (without certainty) that it is Facebook's intention thateverything at www.facebook.com be for human eyeballs. Admittedly,there could be some RDFa, and for some pages, there is RDFa containingOpen Graph Protocol markup (do not conflate the Open Graph Protocoland the Graph API). "Raw" data is made available --- targetingdevelopers --- via the Graph API at http://graph.facebook.com (if youclick that link without adding a path, it will redirect todocumentation).2. The FBID is used instead of the relative "vanity URL" (e.g., /hugh.glaser) because not every user has a vanity URL, and even if eachuser did, not every *thing* has a vanity URL. The Graph API providesmore than just data about users, and to quote Facebook's documention ( https://developers.facebook.com/docs/reference/api/): "Every object in the social graph has a unique ID."3. The use of the empty fragment is the easiest way to take advantageof how the Graph API works. Prior to serving up text/turtle, theGraph API served up only JSON at, e.g., http://graph.facebook.com/720591128. That is the place to find data about you. With littleinterference to existing code, when text/turtle is requested, the JSONis merely translated into text/turtle, making use of the internalsystem to provide meaningful semantics. One of the problems is that aURI needs to be minted for instances (e.g., a user), and givenhttpRange-14, I have the choice of using a hash URI and returning 200OK or using a slash URI and 303'ing to somewhere else. Using theempty fragment seemed like the most acceptable option. (See dialogueat the end of this email.)

Now I'm not saying that the TAG is going to solve all these issues.
And there are lots of issues about 303 and # and RDFa …
But I think this is a real Use Case for a user, which should meanthat the developer who provides this system (Facebook) is a Use Casefor the TAG.

The developer of the Linked Data would be me. I worked on this whileinterning at Facebook during the summer of 2011. I have sincereturned to RPI to continue working toward my Ph.D.

I could have gone through a very similar process with almost anyLinked Data site, such as ePrints, myexperiment and dbpdedia(including my own, such as RKBExplorer) - it just happened I wantedFacebook last night.And Linked Data people go around saying hows exciting it is thatFacebook is offering Linked Data - I can't possibly use this as anexample to a customer, such as Dan's Bob.
This whole experience is just crap.


Perhaps that experience was unpleasant.  Here's a marginally better one:

1. When you log into Facebook and go to your timeline (your own page),the path of the URL in the browser either looks like, e.g., /hugh.glasier or /profile.php?id=720591128 . In the latter case, youhave already found your FBID.2. If you have a vanity URL, like /hugh.glasier , simply do a HTTP GETfor http://graph.facebook.com/hugh.glasier , and that contains yourFBID.3. The URI representing you is http://graph.facebook.com/FBID# , whereFBID should be the FBID number.

Yes, there is the HTTPS discrepancy, and yes, this probably isn'tideal in terms of discovering the URI that identifies a user.

If I had trouble with this, exactly what does Facebook expect anormal user to do?I'm sure we can point out ways in which Facebook might have donethings better, but that is not the point.

Although I no longer work at Facebook, I would be interesting in such"ways in which Facebook might have done things better." Thatdiscussion would be more appropriate in another thread.

Can they actually make it easy for users using the current orproposed standards?
TAG, please don't come back with something that does not allow, oreven encourage, sites like Facebook to offer RDF back in return for:curl -H Accept:application/rdf+xml https://www.facebook.com/hugh.glaser
Best
Hugh
PS
I left the https in, because that is actually what cut and pastegave me.
I'm guessing that would have been a whole new thread.

http works, too, unless you're trying to access permissions-protecteddata, in which case you need to use https and provide a securitytoken. I'm not sure what the implications are regarding http/httpsURIs in Linked Data. Indeed, that would be a whole new thread.

PPS
If you read through to here, or even if you just skipped to here,then if you really do send me your Facebook LD URI (along with oneof more other ones to pair it with), I will drop everything and putthem in sameAs.org :-)
--
Hugh Glaser,
            Web and Internet Science
            Electronics and Computer Science,
            University of Southampton,
            Southampton SO17 1BJ
Work: +44 23 8059 3670, Fax: +44 23 8059 3045
Mobile: +44 75 9533 4155 , Home: +44 23 8061 5652
http://www.ecs.soton.ac.uk/~hg/

Finally, I would like to respond to an earlier comment made by TomHeath (sorry for the incomplete-looking cut-and-paste): "a rigorousassessment of how difficult people *really* find it to understanddistinctions such as 'things vs documents about things'. I've heardmany people claim that they've failed to explain this (or similar)successfully to developers/adopters; my personal experience is thateveryone gets it, it's no big deal (and IRs/NIRs would probably neverenter into the discussion)." My experience at Facebook agrees withTom Heath's experience. Understanding the distinction between"things" versus "documents about things" was easily understood. Themain source of contention was around its pragmatism and necessity.One developer said to me (paraphrase): "I would conflate documents andthings if I could." It is a strange statement to me, butnevertheless, the distinction was understood.

In the fashion of Dan Brickley, I would like to present another_hypothetical_ dialogue, one between a proponent of Linked Data and atypical web developer (although perhaps not quite as clever andthorough as Dan's).


BEGIN DIALOGUE

Proponent: "I found a way to meaningfully publish our already-published data as Linked Data, and I've implemented a prototype."


Developer: "Since you've already done it, let's take a look."

Proponent: "Okay, go to [link]."

Developer: "Hmmmm... [skip discussion about Turtle vs. RDF/XML].Everything looks okay, except I notice these URIs have #me at theend. Why? Can't we just lose the fragment?"

Proponent: "Well, URIs are used to identify things both on and off theweb. For example, no HTTP GET will ever squeeze you over a cable andpop you up in my browser."


Developer: "Sure.  So what?"

Proponent: "... so we need a way to mint URIs for both things on andoff the web that makes sense with how the web already works."


Developer: "Okay, but why the fragment?"

Proponent: "I'm getting to that. The current standard (which shallnot be named) is based on the notion that any URI for which a HTTP GETreturns with 200 OK (these are URIs without fragments) represents thedocument that is retrieved, that is, something *on* the web."


Developer: "Okay... seems logical."

Proponent: "So some conventions have been made for how to identifythings *off* the web. One is to simply add a fragment (understatementmeant to avoid confusion at this point), and that can identifysomething *off* the web."

Developer: "So I have to have a fragment? It seems unnecessary andugly."

Proponent: "There is an alternative. You can use a URI without afragment, but then doing an HTTP GET on the URI must return a 303which redirects to a document about the thing the URI represents."


Developer: "303?  What is that?"

Proponent: "See Other."

Developer: "Never heard of that. I don't want to have to createanother service just to 303 redirect to already-available data. Seemssuperfluous. Is there any other way?"

Proponent: "Well, we could actually let the URIs 404. It's not ideal,but it's legal."

Developer: "No, I don't want anything to 404. Never mind then. Whatabout this #me? Why 'me'?"

Proponent: "Well, that's just a common convention for saying that[URL] returns information about [URL]#me. #this is another common one."


Developer: "Hmmm... I don't know about that."

Proponent: "Well, if we don't want to 404, and we don't want tosupport 303, we'll need some kind of fragment to conform with thecurrent standard. We could just have an empty fragment so that thechanges are minimal, both in terms of effort and appearance."


Developer: "Okay... I guess... let's go with that, then."

END DIALOGUE

Glean from the dialogue what you will. How would I describehttpRange-14? Minimally sufficient.


Jesse Weaver
Ph.D. Student, Patroon Fellow
Tetherless World Constellation
Rensselaer Polytechnic Institute
http://www.cs.rpi.edu/~weavej3/index.xhtml

Re: See Other

Reply via email to