On Fri, Sep 27, 2013 at 7:46 PM, Nicolas Torzec <torz...@yahoo-inc.com>wrote:

>  Thanks for the info and link Dimitris !
>
>  I am curious about the reason for making the extractor separate, and not
> making the resulting data visible as a first-class dataset on the download
> page?
>

We needed an extractor to get all(*) templates from a page and not only the
ones with parameters, see [1] for example (from Dutch DBpedia). After that
there was no need to duplicate data and we decided to separate the template
generation from the raw infobox data generation.
We forgot to include it in the download page, it got lost in the many new
datasets we created. Will fit it soon.


>  Any plan for recursively analyzing templates within infobox templates,
> and thus improving the coverage of the infobox extraction?
> (I haven't checked 3.9 yet…)
>

Not yet, the first step is to give the data to the community.

Cheers,
Dimitris

(*) we still apply the same heuristics to remove noisy templates
[1] http://nl.dbpedia.org/page/Harry_Mulisch

>
>  Cheers,
> Nicolas.
>
>
>
>
>
>   From: Dimitris Kontokostas <kontokos...@informatik.uni-leipzig.de>
> Date: Thursday, September 26, 2013 11:50 PM
> To: Nicolas Torzec <torz...@yahoo-inc.com>
> Cc: Martin Ovciarik <xovci...@stud.fit.vutbr.cz>, DBpedia Discussions <
> dbpedia-discussion@lists.sourceforge.net>
> Subject: Re: [Dbpedia-discussion] DBpedia v39 extraction problem.
>
>    Hi,
>
>  We created a separate extractor for this case (ArticleTemplatesExtractor)
> that extracts more templates from a wikipage.
>  This extractor produces a separate dump file [1] but I don't this it is
> loaded online. If more people need this we can ask OpenLink to upload it.
>
>  Best,
>  Dimitris
>
> [1] http://downloads.dbpedia.org/3.9/en/article_templates_en.ttl.bz2
>
>
> On Fri, Sep 27, 2013 at 1:07 AM, Nicolas Torzec <torz...@yahoo-inc.com>wrote:
>
>> Confirming that I can no longer find the template information in the
>> raw_infobox_properties_en.nt file.
>>
>> Is this information now available somewhere else?
>> Or is it simply no longer extracted by the DBpedia extraction framework?
>>
>> Template information proved to be useful when developing more advanced
>> information extraction systems on top of the DBpedia extraction frameworkŠ
>>
>>
>>
>> Find below the collection of triples having Brad_Pitt as a subject:
>>
>> <http://dbpedia.org/resource/Brad_Pitt> <http://dbpedia.org/property/name
>> >
>> "Brad Pitt"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/caption> "Pitt at the BAFTA Film Awards 2012
>> in Covent Garden, London"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/birthName> "William Bradley Pitt"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/birthDate>
>> "1963-12-18"^^<http://www.w3.org/2001/XMLSchema#date> .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/birthPlace> "Shawnee, Oklahoma, U.S."@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/almaMater>
>> <http://dbpedia.org/resource/University_of_Missouri> .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/occupation> "Actor, film producer"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/partner>
>> <http://dbpedia.org/resource/Angelina_Jolie> .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/children> "[[#Children"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/colwidth>
>> "30"^^<http://www.w3.org/2001/XMLSchema#integer> .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/title> "Awards for Brad Pitt"@en .
>> <http://dbpedia.org/resource/Brad_Pitt> <http://dbpedia.org/property/name
>> >
>> "Pitt, William Bradley"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/alternativeNames> "Pitt, Brad"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/shortDescription> "American actor"@en .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/dateOfBirth>
>> "1963-12-18"^^<http://www.w3.org/2001/XMLSchema#date> .
>> <http://dbpedia.org/resource/Brad_Pitt>
>> <http://dbpedia.org/property/placeOfBirth> "Shawnee, Oklahoma, U.S."@en .
>>
>>
>>
>> -Nicolas
>>
>>
>>
>>
>>
>> On 9/26/13 4:51 AM, "Martin Ovciarik" <xovci...@stud.fit.vutbr.cz> wrote:
>>
>> >> Hi,
>> >>
>> >> i am working on a project in which i am extracting data from DBpedia.
>> >>
>> >> In older versions (v38, live dump-s) of DBpedia, there was an
>> >> attribute " <http://dbpedia.org/property/wikiPageUsesTemplate> ".
>> >>
>> >> According to this attribute i determinate the type of entity (ex.
>> >> Person, Location, Artist etc.). In v39 i can't find this atribute (i
>> >> am extracting data from Raw Infobox Properties).
>> >>
>> >> Where can i find this atribute ?
>> >>
>> >> Martin Ovciarik
>> >
>> >
>>
>> >--------------------------------------------------------------------------
>> >----
>> >October Webinars: Code for Performance
>> >Free Intel webinars can help you accelerate application performance.
>> >Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most
>> >from
>> >the latest Intel processors and coprocessors. See abstracts and register
>> >
>> >
>> http://pubads.g.doubleclick.net/gampad/clk?id=60133471&iu=/4140/ostg.clktr
>> >k
>> >_______________________________________________
>> >Dbpedia-discussion mailing list
>> >Dbpedia-discussion@lists.sourceforge.net
>> >https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>>
>>
>>
>> ------------------------------------------------------------------------------
>> October Webinars: Code for Performance
>> Free Intel webinars can help you accelerate application performance.
>> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most
>> from
>> the latest Intel processors and coprocessors. See abstracts and register >
>>
>> http://pubads.g.doubleclick.net/gampad/clk?id=60133471&iu=/4140/ostg.clktrk
>> _______________________________________________
>> Dbpedia-discussion mailing list
>> Dbpedia-discussion@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>>
>
>
>
> --
> Dimitris Kontokostas
> Department of Computer Science, University of Leipzig
> Research Group: http://aksw.org
> Homepage:http://aksw.org/DimitrisKontokostas
>
>
> ------------------------------------------------------------------------------
> October Webinars: Code for Performance
> Free Intel webinars can help you accelerate application performance.
> Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most
> from
> the latest Intel processors and coprocessors. See abstracts and register >
> http://pubads.g.doubleclick.net/gampad/clk?id=60133471&iu=/4140/ostg.clktrk
> _______________________________________________
> Dbpedia-discussion mailing list
> Dbpedia-discussion@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>
>


-- 
Dimitris Kontokostas
Department of Computer Science, University of Leipzig
Research Group: http://aksw.org
Homepage:http://aksw.org/DimitrisKontokostas
------------------------------------------------------------------------------
October Webinars: Code for Performance
Free Intel webinars can help you accelerate application performance.
Explore tips for MPI, OpenMP, advanced profiling, and more. Get the most from 
the latest Intel processors and coprocessors. See abstracts and register >
http://pubads.g.doubleclick.net/gampad/clk?id=60133471&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-discussion mailing list
Dbpedia-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to