Sure! Or fix existing table mappings. For example, the table mapping that
generates triples for automobile engines is for some reason also triggered
on http://en.wikipedia.org/wiki/Airbus_A300 . Maybe the algorithm by which
we choose the tables needs to be changed, but hopefully we just have to
change the settings for the automobile engines table mapping.
Cheers,
JC
On Mar 13, 2012 8:09 AM, "Pablo Mendes" <[email protected]> wrote:
>
> Thanks, Jona. So I guess we can go ahead and create some table mappings?
>
> Cheers,
> Pablo
>
> On Tue, Mar 13, 2012 at 1:17 AM, Jona Christopher Sahnwaldt <
> [email protected]> wrote:
>
>> > I would love to see extractor.TableMappings fixed, and more table
>> mappings
>> > created. Jona has already stopped the likely infinite loop in that
>> class. I
>> > quickly scanned through the code and had the impression that it lacks a
>> > termination criterion for its recursion. Shouldn't be too hard to fix
>> if you
>> > have a couple of hours to throw at it.
>> Unless there was a cycle in the class relations (which might even
>> make sense with equivalent classes), the recursion did terminate.
>> The problem was that the triple list was appended to itself
>> (and thus doubled in size) each time the method writeType was
>> called and in the end grew by a factor of 2^(r*c) (or thereabouts),
>> where r is the number of resources extracted from one table
>> and c is the average number of all related classes (transitive base
>> and equivalent classes) of each of these resources. For
>> pages like http://en.wikipedia.org/wiki/Ford_Crown_Victoria (see [1])
>> or http://en.wikipedia.org/wiki/Airbus_A300 (see [2]) the table
>> extractor generates around 20 variants with around 3 related classes for
>> each. 2^60 is a big number. :-) A server with a few
>> exabytes RAM might have handled that, but remember
>> that we then try to send that page to a poor browser. :-)
>>
>> The thing is that there already is correct code for adding all
>> types for a resource to a triple set. We also do that in
>> TemplateMapping:
>>
>>
>> http://dbpedia.hg.sourceforge.net/hgweb/dbpedia/extraction_framework/file/3180c9d769fa/core/src/main/scala/org/dbpedia/extraction/mappings/TemplateMapping.scala#l83
>>
>> I'd like to move that code to OntologyClass, but I'd also like
>> to think about improving it. I think the code in TemplateMapping still
>> blows up if there are cycles in class relations. And there may be other
>> places where we re-implemented this process.
>>
>> Cheers,
>> JC
>>
>> [1]
>> http://mappings.dbpedia.org/server/mappings/en/extractionSamples/Mapping:Infobox_Automobile_generation
>> [2]
>> http://mappings.dbpedia.org/server/mappings/en/extractionSamples/Mapping:Infobox_aircraft_type
>>
>> On Mon, Mar 12, 2012 at 23:24, Pablo Mendes <[email protected]>
>> wrote:
>> >
>> > Hi emijrp,
>> > If by "underdeveloped" you mean where does DBpedia need more data, then
>> you
>> > should take a look at the mappings statistics for the language of your
>> > interest:
>> > http://mappings.dbpedia.org/index.php/Mapping_Statistics
>> >
>> > If by "underdeveloped" you mean where does DBpedia needs some coding,
>> then I
>> > would say TableMappings. Some tables on Wikipedia seem that they would
>> > easily produce good data:
>> > http://en.wikipedia.org/wiki/List_of_social_networking_websites
>> >
>> > While others would be harder (e.g. multiple links within a cell):
>> > http://en.wikipedia.org/wiki/List_of_sovereign_states
>> >
>> > But it would be great if it caught at least the easy cases. This would
>> > enable the extraction of about 73.6% [1] of the list pages on Wikipedia.
>> >
>> > I would love to see extractor.TableMappings fixed, and more table
>> mappings
>> > created. Jona has already stopped the likely infinite loop in that
>> class. I
>> > quickly scanned through the code and had the impression that it lacks a
>> > termination criterion for its recursion. Shouldn't be too hard to fix
>> if you
>> > have a couple of hours to throw at it.
>> >
>> > Cheers,
>> > Pablo
>> >
>> > [1]
>> http://articles.businessinsider.com/2010-02-17/strategy/30008803_1_market-forecasts-analysis-data-projections
>> >
>> > On Mon, Mar 12, 2012 at 10:50 PM, emijrp <[email protected]> wrote:
>> >>
>> >> Cool, thanks!
>> >>
>> >> What are the under-developed areas in dbpedia?
>> >>
>> >> 2012/3/12 Jona Christopher Sahnwaldt <[email protected]>
>> >>>
>> >>> Hi emijrp, Anja gave you editor rights on Feb 8.
>> >>> Maybe you already know, but there was no
>> >>> reply to your mail on the list. Regards, JC
>> >>>
>> >>>
>> >>> On Wed, Feb 8, 2012 at 22:58, emijrp <[email protected]> wrote:
>> >>> > Hi. My account "emijrp" on the mappings wiki has not been activated
>> >>> > yet,
>> >>> > after some weeks waiting. Regards.
>> >>
>> >>
>> >>
>> >>
>> ------------------------------------------------------------------------------
>> >> Try before you buy = See our experts in action!
>> >> The most comprehensive online learning library for Microsoft developers
>> >> is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3,
>> MVC3,
>> >> Metro Style Apps, more. Free future releases when you subscribe now!
>> >> http://p.sf.net/sfu/learndevnow-dev2
>> >> _______________________________________________
>> >> Dbpedia-discussion mailing list
>> >> [email protected]
>> >> https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion
>> >>
>> >
>>
>
>
------------------------------------------------------------------------------
Keep Your Developer Skills Current with LearnDevNow!
The most comprehensive online learning library for Microsoft developers
is just $99.99! Visual Studio, SharePoint, SQL - plus HTML5, CSS3, MVC3,
Metro Style Apps, more. Free future releases when you subscribe now!
http://p.sf.net/sfu/learndevnow-d2d
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion