Hi Rupert et al,

Thanks a lot for your valuable insights regarding use cases of digital
identity with Stanbol and in semantic web in general. I think verified
digital identity and semantic-web goes hand in hand, merging the concepts
is beneficial for efficient and accurate disambiguation of entities and
accurate semantic enhancements.

As you have suggested WebID can be used as a federated social profile
across multiple SNSs, connecting to the same resources (contact
info,photos, friends, interests) identified uniquely via WebID. The
linked-data described by the WebID can be used to extract verified
knowledge about the person. As Antonio has described WebID is a unique URI
representing an entity and security is implemented using PKI. It can be
used as an authentication mechanism across the web with WebID supported
platforms. Essentially WebID is written in FOAF + some SSL which can be
used to create a trusted graph of entities. For more use cases please refer
[1], [2].

WebID is an emerging protocol and it's yet to be widely adopted. I too
don't have prior experience with WebID integrations, but I'm very much
interested to take a deep dive into the subject domain based on my research
interests. There are several WebID providers such as OpenLink Data Spaces
[4] and My-Profile [5]. If we go forward with integrating WebID in Stanbol
as part of the federated social network integration or disambiguation tasks
we can implement a WebID connector to such systems or implement a WebID
provider on Stanbol side. WDYT?

@Rupert: With regard to your opinion on using EntityHub as an index rather
than the primary storage for the knowledge base, can you please elaborate
more on why do you think so? Maybe Wiki-Link or Freebase [3] is a good
option for such a knowledge base?

Thanks,
Dileepa

[1]http://www.w3.org/wiki/WebID
[2] http://www.w3.org/wiki/Foaf%2Bssl/RelyingParties
[3] http://wiki.freebase.com/wiki/Main_Page
[4] http://ods.openlinksw.com/wiki/ODS/VirtODSSecurityWebID
[5] https://my-profile.eu



On Wed, Apr 24, 2013 at 3:42 PM, Antonio Perez <[email protected]> wrote:

> Hi Rupert, all
>
> First of all, let me introduce myself. I'm Antonio and recently I have
> started to work with Stanbol at Zaizi. I would like to prepare a proposal
> for GSoC related to the disambiguation problem.
>
> Regarding your last question Rupert, the objective of WebID is precisely
> that, to represent an entity by an URI and also give security (trust) using
> client certificates.
> These certificates (representing entities for example) contains the URI of
> the resource and you can access it to obtain more information.
> Take into account that Webid is basically a fancy name for SSL client side
> certificates with a profile url in it. So you can trust that the resource
> is a valid resource (using the certificate public key against the public
> key contained in the URI of the resource) and you can extract more
> information about it using the data given in the content of that URI.
>
> Regards
>
>
>
> On Wed, Apr 24, 2013 at 10:43 AM, Rupert Westenthaler <
> [email protected]> wrote:
>
> > Hi Dileepa, all
> >
> > Interesting discussion with a lot of very relevant topics. In this
> > mail I select (1..4) points of the ongoing discussion and add my
> > thoughts to it. In addition I will add (5) Knowledge base population
> > as an additional usage case scenario.
> >
> > (1) Digital Identity in social Networks: I am not a export for the
> > ontonet, rules and reasoning component, so I will not comment on how
> > those could be used to process structured information of profiles.
> > However the Stanbol Enhancer would be well suited to deal with
> > unstructured information provided by such profiles.
> >
> > (2) Social Profile of a Person/Organization: Building up a knowledge
> > base by using information collected from Social Network activities. I
> > would not recommend to use the Stanbol Entityhub as primary storage
> > for such a knowledge base. However the Entityhub could be used to
> > create/store specialized indexes derived from such a knowledge base.
> > Those indexes would then be used (instead of the knowledge base) by
> > Enhancement Engines to make lookups for words of the text, Prefix
> > based suggestion as used by user interfaces ...
> >
> > (3) Disambiguation #1: With (1) and (2) you will have multiple Social
> > Profiles using the same name from different information sources (e.g.
> > Twitter and Facebook). Deciding if those profiles are in fact the same
> > Entity will need some kind of disambiguation. The result of those
> > could mean to merge/split profiles in the knowledge base.
> >
> > (4) Disambiguation #2: Profiles in the Knowledge might be referenced
> > in texts by the same word/phrase (e.g. similar person names, nick
> > names, acronyms or organizations ...). Profile information can be used
> > to ensure that the correct Person/Organization is linked.
> >
> > (5) Knowledge base population: This would allow (a) Users or (b) an
> > Enhancement Engine to create new Persons/Organizations in the
> > knowledge base. An Example for (b) could be detected Named Entities
> > (NER) that where not linked to an known Entity. In addition an (c)
> > Entity Disambiguation Engine that comes to the conclusion that none of
> > the existing Entities is a fit for the mention in the text might also
> > create an new Entity. (1) and (2) could then be used to populate the
> > knowledge base with further information originating from social
> > networks.
> >
> > Finally I would like to ask how widely used is WebId? Or in other
> > words: Would it usage limit the list of possible information sources?
> > Could it be used as unique identifier for data collected in the
> > knowledge base holding the Social Profile Information of a
> > Person/Organizations?
> >
> > best
> > Rupert
> >
> > On Tue, Apr 23, 2013 at 10:32 PM, Dileepa Jayakody
> > <[email protected]> wrote:
> > > Hi Rafa,
> > >
> > >
> > > Thanks for your ideas.
> > >
> > > WebID [1] is a URI based protocol to identify an entity (person,
> > > organization) uniquely over the web. Since WebID is based on
> linked-data
> > > (FOAF [3] syntax) we can extract information such as person's
> interests,
> > > employment history, cv, publications, and friends by analyzing the
> linked
> > > resources described in the WebID. When a WebID is associated with a
> > person
> > > it can be used as a single-sign-on protocol like OpenID [2] as well.
> > >
> > > WebID can also be instrumental in social-web to have a single-profile
> > over
> > > multiple social networks with linked resources like photos,
> friend-lists
> > > etc. Also security layers can be implemented on top WebID to apply
> > > fine-grained access control to such linked-resources described by the
> > > WebID. l
> > >
> > > As you have pointed out WebID can be used to implement a disambiguation
> > > algorithm for people entities in Stanbol. I also think it can be highly
> > > useful in semantic lifting by extracting information like, employment
> > data,
> > > friend's profiles, publications etc described by the WebID. IMO
> sentiment
> > > analysis can also benefit from WebID.
> > >
> > >
> > > Thanks,
> > >
> > > Dileepa
> > >
> > > [1] http://www.w3.org/wiki/WebID
> > >
> > > [2] http://en.wikipedia.org/wiki/OpenID
> > > [3] http://xmlns.com/foaf/spec/
> > >
> > >
> > > On Tue, Apr 23, 2013 at 8:37 PM, Rafa Haro <[email protected]> wrote:
> > >
> > >> Hi Dileepa,
> > >>
> > >> El 23/04/13 13:45, Dileepa Jayakody escribió:
> > >>
> > >>  Hi Fabian et al,
> > >>>
> > >>> Thanks a lot for your valuable ideas.
> > >>> Yes it's really interesting to implement a 'person | organization'
> > >>> disambiguation module using WebID protocol as part of Stanbol
> > Enhancement
> > >>> Engine. I went through the documentation of Stanbol and I have gained
> > an
> > >>> overall idea about the architecture of Stanbol.
> > >>>
> > >> +1. That's a great idea. I don't know very much about WebID protocol,
> > but
> > >> as far as you could use some profile data as disambiguation contexts,
> it
> > >> should be feasible to implement a disambiguation algorithm. Could you
> > >> please give us a concrete example where WebID is used? I suppose that
> > the
> > >> general use case is to link name mentions in web pages with digital
> > >> identities. What kind of information is it possible to gather from
> WebID
> > >> identities?
> > >>
> > >>
> > >>> It would be great to get more ideas, suggestions about how to use
> > Stanbol
> > >>> for people, organization disambiguation and to discuss the objectives
> > and
> > >>> milestones in the GSOC project idea at [1].
> > >>> I also think one of the main factor for disambiguation is the
> > >>> data-set/knowledge base used for the process. What is the data-set
> > Stanbol
> > >>> uses to verify data? Is Google Wiki-links released recently [1] a
> > >>> candidate
> > >>> for Stanbol data-set?
> > >>>
> > >> Initially, you can use any knowledge base in Stanbol. I always
> identify
> > >> EntityHub component as a "Knowledge Base" management system, although
> > maybe
> > >> formally the EntityHub is not exactly that. Anyway, Google Wiki-links
> > could
> > >> be a good resource for disambiguation when the knowledge base is
> > Wikipedia
> > >> or DBpedia. In fact, Wiki-links contains 40 millions of mentions and
> its
> > >> contexts retrieved from web pages. This information can be eventually
> > added
> > >> to a Wikipedia or DBpedia knowledge base as disambiguation contexts
> for
> > the
> > >> entities covered in the dataset. Another interesting resource, as the
> > new
> > >> in techcrunch points, is the dictionary of Wikipedia concepts released
> > last
> > >> year [1]. This resource can be used to include more labels for each
> > entity
> > >> (possible names), improving then the candidate selection step. As
> > always,
> > >> we face a recall/precision problem with such dictionary.
> > >>
> > >> [1]  - http://googleresearch.**blogspot.com.es/2012/05/from-**
> > >> words-to-concepts-and-back.**html<
> >
> http://googleresearch.blogspot.com.es/2012/05/from-words-to-concepts-and-back.html
> > >
> > >>
> > >> Regards!
> > >>
> > >>
> > >>> Thanks,
> > >>> Dileepa
> > >>>
> > >>> [1]
> > >>> http://techcrunch.com/2013/03/**08/google-research-releases-**
> > >>> wikilinks-corpus-with-40m-**mentions-and-3m-entities/<
> >
> http://techcrunch.com/2013/03/08/google-research-releases-wikilinks-corpus-with-40m-mentions-and-3m-entities/
> > >
> > >>> On Mon, Apr 22, 2013 at 7:32 PM, Fabian Christ <
> > >>> [email protected]
> > >>>
> > >>>> wrote:
> > >>>> Hi,
> > >>>>
> > >>>> 2013/4/22 Dileepa Jayakody <[email protected]>:
> > >>>>
> > >>>>> Could it be a valid use-case to integrate WebID protocol in Stanbol
> > to
> > >>>>> create social graphs and related ontologies?
> > >>>>>
> > >>>> the already mentioned entity disambiguation for persons might be
> such
> > >>>> a use case.
> > >>>>
> > >>>> Another idea could be that the enhancement process uses some
> > >>>> information from the personal profile of the user who sends the
> > >>>> request. I do not have any concrete example at the moment but
> engines
> > >>>> might be interested in knowing who is sending an enhancement
> request.
> > >>>> This may also be a relevant information for the disambiguation task.
> > >>>>
> > >>>> Best,
> > >>>>    - Fabian
> > >>>>
> > >>>>
> > >>>> --
> > >>>> Fabian
> > >>>> http://twitter.com/fctwitt
> > >>>>
> > >>>>
> > >>
> > >> --
> > >>
> > >> ------------------------------
> > >> This message should be regarded as confidential. If you have received
> > this
> > >> email in error please notify the sender and destroy it immediately.
> > >> Statements of intent shall only become binding when confirmed in hard
> > copy
> > >> by an authorised signatory.
> > >>
> > >> Zaizi Ltd is registered in England and Wales with the registration
> > number
> > >> 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam
> > Road,
> > >> London W10 5JJ, UK.
> > >>
> >
> >
> >
> > --
> > | Rupert Westenthaler             [email protected]
> > | Bodenlehenstraße 11                             ++43-699-11108907
> > | A-5500 Bischofshofen
> >
>
> --
>
> ------------------------------
> This message should be regarded as confidential. If you have received this
> email in error please notify the sender and destroy it immediately.
> Statements of intent shall only become binding when confirmed in hard copy
> by an authorised signatory.
>
> Zaizi Ltd is registered in England and Wales with the registration number
> 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam Road,
> London W10 5JJ, UK.
>

Reply via email to