Vladimir, I'm thinking of trying to do some stats on the existing ontology and the mappings to see where there is room for improvement. I'm tied up this week with a couple deadlines that I seem to moving towards at greater than light speed, though my progress is not.
As soon as I get the rough cut done, I'll share the results with you and maybe we can discuss paths forward? I'm with you on the 30% error rate...that doesn't help anyone. Aaron On Feb 25, 2015, at 08:02, Vladimir Alexiev <vladimir.alex...@ontotext.com> wrote: >> From: M. Aaron Bossert [mailto:maboss...@gmail.com] >> I am more than happy to work the ML problem with you. > > Hi Aaron! > Would be great to work with someone from Cray but I don't have a good idea > how to use ML here, > nor indeed a lot of trust in using ML to produce or fix mappings. > > E.g. see this exchange: > https://twitter.com/valexiev1/status/565814870973890560 > Generating 30% wrong prop maps for the Ukrainian dbpedia is IMHO doing them a > disservice! > Who's gonna clean up all this? > > I guess I'm more of a MLab (Manual Labor) guy, I just learned they coined > such alias for crowdsourcing: > http://link.springer.com/chapter/10.1007%2F978-3-319-13704-9_14 > >>> DBO: dbo:parent rdfs:range dbo:Person >>> Wikipedia: | mother = [[Queen Victoria]] of [[England]] >> For your example of the dichotomy with the domain and range of "mother" and >> queen Victoria being the "mother", this begs for contextual approach to that >> concept.... > > She IS the mother, not sure what you mean. > > Here a simple post-extraction cleanup can take care of it: > remove all statements that violate range (so dbo:parent [[England]] will be > removed). > But we dare not do it, because many of the ranges are imprecise, or set > wishfully without regard to existing data / mappings. > (As usual, the real data is more complex than any model of it.) > > So we need to check our Ontological Assumptions and precise domains/ranges > before such cleanup. > See example in > http://vladimiralexiev.github.io/pres/20150209-dbpedia/dbpedia-problems-long.html#sec-6-7 > > ------------------------------------------------------------------------------ Dive into the World of Parallel Programming The Go Parallel Website, sponsored by Intel and developed in partnership with Slashdot Media, is your hub for all things parallel software development, from weekly thought leadership blogs to news, videos, case studies, tutorials and more. Take a look and join the conversation now. http://goparallel.sourceforge.net/ _______________________________________________ Dbpedia-discussion mailing list Dbpedia-discussion@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion