Hi Dileepa, all

Interesting discussion with a lot of very relevant topics. In this
mail I select (1..4) points of the ongoing discussion and add my
thoughts to it. In addition I will add (5) Knowledge base population
as an additional usage case scenario.

(1) Digital Identity in social Networks: I am not a export for the
ontonet, rules and reasoning component, so I will not comment on how
those could be used to process structured information of profiles.
However the Stanbol Enhancer would be well suited to deal with
unstructured information provided by such profiles.

(2) Social Profile of a Person/Organization: Building up a knowledge
base by using information collected from Social Network activities. I
would not recommend to use the Stanbol Entityhub as primary storage
for such a knowledge base. However the Entityhub could be used to
create/store specialized indexes derived from such a knowledge base.
Those indexes would then be used (instead of the knowledge base) by
Enhancement Engines to make lookups for words of the text, Prefix
based suggestion as used by user interfaces ...

(3) Disambiguation #1: With (1) and (2) you will have multiple Social
Profiles using the same name from different information sources (e.g.
Twitter and Facebook). Deciding if those profiles are in fact the same
Entity will need some kind of disambiguation. The result of those
could mean to merge/split profiles in the knowledge base.

(4) Disambiguation #2: Profiles in the Knowledge might be referenced
in texts by the same word/phrase (e.g. similar person names, nick
names, acronyms or organizations ...). Profile information can be used
to ensure that the correct Person/Organization is linked.

(5) Knowledge base population: This would allow (a) Users or (b) an
Enhancement Engine to create new Persons/Organizations in the
knowledge base. An Example for (b) could be detected Named Entities
(NER) that where not linked to an known Entity. In addition an (c)
Entity Disambiguation Engine that comes to the conclusion that none of
the existing Entities is a fit for the mention in the text might also
create an new Entity. (1) and (2) could then be used to populate the
knowledge base with further information originating from social
networks.

Finally I would like to ask how widely used is WebId? Or in other
words: Would it usage limit the list of possible information sources?
Could it be used as unique identifier for data collected in the
knowledge base holding the Social Profile Information of a
Person/Organizations?

best
Rupert

On Tue, Apr 23, 2013 at 10:32 PM, Dileepa Jayakody
<[email protected]> wrote:
> Hi Rafa,
>
>
> Thanks for your ideas.
>
> WebID [1] is a URI based protocol to identify an entity (person,
> organization) uniquely over the web. Since WebID is based on linked-data
> (FOAF [3] syntax) we can extract information such as person's interests,
> employment history, cv, publications, and friends by analyzing the linked
> resources described in the WebID. When a WebID is associated with a person
> it can be used as a single-sign-on protocol like OpenID [2] as well.
>
> WebID can also be instrumental in social-web to have a single-profile over
> multiple social networks with linked resources like photos, friend-lists
> etc. Also security layers can be implemented on top WebID to apply
> fine-grained access control to such linked-resources described by the
> WebID. l
>
> As you have pointed out WebID can be used to implement a disambiguation
> algorithm for people entities in Stanbol. I also think it can be highly
> useful in semantic lifting by extracting information like, employment data,
> friend's profiles, publications etc described by the WebID. IMO sentiment
> analysis can also benefit from WebID.
>
>
> Thanks,
>
> Dileepa
>
> [1] http://www.w3.org/wiki/WebID
>
> [2] http://en.wikipedia.org/wiki/OpenID
> [3] http://xmlns.com/foaf/spec/
>
>
> On Tue, Apr 23, 2013 at 8:37 PM, Rafa Haro <[email protected]> wrote:
>
>> Hi Dileepa,
>>
>> El 23/04/13 13:45, Dileepa Jayakody escribió:
>>
>>  Hi Fabian et al,
>>>
>>> Thanks a lot for your valuable ideas.
>>> Yes it's really interesting to implement a 'person | organization'
>>> disambiguation module using WebID protocol as part of Stanbol Enhancement
>>> Engine. I went through the documentation of Stanbol and I have gained an
>>> overall idea about the architecture of Stanbol.
>>>
>> +1. That's a great idea. I don't know very much about WebID protocol, but
>> as far as you could use some profile data as disambiguation contexts, it
>> should be feasible to implement a disambiguation algorithm. Could you
>> please give us a concrete example where WebID is used? I suppose that the
>> general use case is to link name mentions in web pages with digital
>> identities. What kind of information is it possible to gather from WebID
>> identities?
>>
>>
>>> It would be great to get more ideas, suggestions about how to use Stanbol
>>> for people, organization disambiguation and to discuss the objectives and
>>> milestones in the GSOC project idea at [1].
>>> I also think one of the main factor for disambiguation is the
>>> data-set/knowledge base used for the process. What is the data-set Stanbol
>>> uses to verify data? Is Google Wiki-links released recently [1] a
>>> candidate
>>> for Stanbol data-set?
>>>
>> Initially, you can use any knowledge base in Stanbol. I always identify
>> EntityHub component as a "Knowledge Base" management system, although maybe
>> formally the EntityHub is not exactly that. Anyway, Google Wiki-links could
>> be a good resource for disambiguation when the knowledge base is Wikipedia
>> or DBpedia. In fact, Wiki-links contains 40 millions of mentions and its
>> contexts retrieved from web pages. This information can be eventually added
>> to a Wikipedia or DBpedia knowledge base as disambiguation contexts for the
>> entities covered in the dataset. Another interesting resource, as the new
>> in techcrunch points, is the dictionary of Wikipedia concepts released last
>> year [1]. This resource can be used to include more labels for each entity
>> (possible names), improving then the candidate selection step. As always,
>> we face a recall/precision problem with such dictionary.
>>
>> [1]  - http://googleresearch.**blogspot.com.es/2012/05/from-**
>> words-to-concepts-and-back.**html<http://googleresearch.blogspot.com.es/2012/05/from-words-to-concepts-and-back.html>
>>
>> Regards!
>>
>>
>>> Thanks,
>>> Dileepa
>>>
>>> [1]
>>> http://techcrunch.com/2013/03/**08/google-research-releases-**
>>> wikilinks-corpus-with-40m-**mentions-and-3m-entities/<http://techcrunch.com/2013/03/08/google-research-releases-wikilinks-corpus-with-40m-mentions-and-3m-entities/>
>>> On Mon, Apr 22, 2013 at 7:32 PM, Fabian Christ <
>>> [email protected]
>>>
>>>> wrote:
>>>> Hi,
>>>>
>>>> 2013/4/22 Dileepa Jayakody <[email protected]>:
>>>>
>>>>> Could it be a valid use-case to integrate WebID protocol in Stanbol to
>>>>> create social graphs and related ontologies?
>>>>>
>>>> the already mentioned entity disambiguation for persons might be such
>>>> a use case.
>>>>
>>>> Another idea could be that the enhancement process uses some
>>>> information from the personal profile of the user who sends the
>>>> request. I do not have any concrete example at the moment but engines
>>>> might be interested in knowing who is sending an enhancement request.
>>>> This may also be a relevant information for the disambiguation task.
>>>>
>>>> Best,
>>>>    - Fabian
>>>>
>>>>
>>>> --
>>>> Fabian
>>>> http://twitter.com/fctwitt
>>>>
>>>>
>>
>> --
>>
>> ------------------------------
>> This message should be regarded as confidential. If you have received this
>> email in error please notify the sender and destroy it immediately.
>> Statements of intent shall only become binding when confirmed in hard copy
>> by an authorised signatory.
>>
>> Zaizi Ltd is registered in England and Wales with the registration number
>> 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam Road,
>> London W10 5JJ, UK.
>>



-- 
| Rupert Westenthaler             [email protected]
| Bodenlehenstraße 11                             ++43-699-11108907
| A-5500 Bischofshofen

Reply via email to