[ 
https://issues.apache.org/jira/browse/STANBOL-1140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Antonio David Pérez Morales updated STANBOL-1140:
-------------------------------------------------

    Description: 
Freebase is a large collaborative knowledge base consisting of metadata 
composed mainly by its community members. It is an online collection of 
structured data harvested from many sources, including individual 'wiki' 
contributions. Freebase aims to create a global resource which allows people 
(and machines) to access common information more effectively.

Freebase data is available for free/libre for commercial and non-commercial use 
under a Creative Commons Attribution License, and an open API, RDF endpoint, 
and database dump are provided for programmers. Freebase contains 1.2 billion 
of triples so having this information in a graph will be very useful in order 
to be able to create new graph-based algorithms for disambiguation.

The goal of this task is to develop a tool that will be able to parse a 
Freebase dump and import it in a graph database. For a first version, the 
selected database will be Neo4j, managed by Tinkerpop Blueprints API. In order 
to build the graph, we are going to store as vertexes all the Freebase 
entities. Between two vertex, it will be an edge if there is a direct 
relationship between both entities in Freebase or a mediated relationship. 
Mediated relationship relates two entities that are Topics (concepts) in 
Freebase through an entity that is not a Topic (categories). This categorized 
edges can be very useful for the later disambiguation.

  was:
Freebase is a large collaborative knowledge base consisting of metadata 
composed mainly by its community members. It is an online collection of 
structured data harvested from many sources, including individual 'wiki' 
contributions. Freebase aims to create a global resource which allows people 
(and machines) to access common information more effectively. 

Freebase data is available for free/libre for commercial and non-commercial use 
under a Creative Commons Attribution License, and an open API, RDF endpoint, 
and database dump are provided for programmers.

Freebase contains 1.2 billion of triples so having this information in a graph 
will be very useful in order to be able to create new graph-based algorithms 
for disambiguation.

    
> Freebase To Graph Importer
> --------------------------
>
>                 Key: STANBOL-1140
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1140
>             Project: Stanbol
>          Issue Type: Sub-task
>          Components: Entityhub
>            Reporter: Antonio David Pérez Morales
>              Labels: Freebase, disambiguation, graph, neo4j
>   Original Estimate: 336h
>  Remaining Estimate: 336h
>
> Freebase is a large collaborative knowledge base consisting of metadata 
> composed mainly by its community members. It is an online collection of 
> structured data harvested from many sources, including individual 'wiki' 
> contributions. Freebase aims to create a global resource which allows people 
> (and machines) to access common information more effectively.
> Freebase data is available for free/libre for commercial and non-commercial 
> use under a Creative Commons Attribution License, and an open API, RDF 
> endpoint, and database dump are provided for programmers. Freebase contains 
> 1.2 billion of triples so having this information in a graph will be very 
> useful in order to be able to create new graph-based algorithms for 
> disambiguation.
> The goal of this task is to develop a tool that will be able to parse a 
> Freebase dump and import it in a graph database. For a first version, the 
> selected database will be Neo4j, managed by Tinkerpop Blueprints API. In 
> order to build the graph, we are going to store as vertexes all the Freebase 
> entities. Between two vertex, it will be an edge if there is a direct 
> relationship between both entities in Freebase or a mediated relationship. 
> Mediated relationship relates two entities that are Topics (concepts) in 
> Freebase through an entity that is not a Topic (categories). This categorized 
> edges can be very useful for the later disambiguation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to