Hi Gaurav,

On 04/09/2013 11:46 AM, gaurav pant wrote:
Hi All,

We are getting improper and inconsistent data for many records.Some issue which i have faced till yet are


1- Some person place of birth is not proper. It is having an integer value. 2- Sometimes we are getting birth date data pointed from "placeOfBirth" and sometimes from "birthPlace".
3-  Sometimes birthDate is not proper.

"
<http://dbpedia.org/resource/Vania_King> <http://dbpedia.org/property/*placeOfBirth*%C2%A0%C2%A0> <http://dbpedia.org/resource/Monterey_Park,_California> . <http://dbpedia.org/resource/Azhar_Naazir> <http://dbpedia.org/property/placeOfBirth%C2%A0%C2%A0> <http://dbpedia.org/resource/Rawalpindi> . <http://dbpedia.org/resource/Kaia_Kanepi> <http://dbpedia.org/property/placeOfBirth%C2%A0%C2%A0> <http://dbpedia.org/resource/Haapsalu> . <http://dbpedia.org/resource/Vernon_Cook> <http://dbpedia.org/property/placeOfBirth%C2%A0%C2%A0> <http://dbpedia.org/resource/Kent,_Ohio> .

<http://dbpedia.org/resource/Jeannine_Riley> <http://dbpedia.org/property/placeOfBirth> "1"^^<http://www.w3.org/2001/XMLSchema#int> . <http://dbpedia.org/resource/Henri_Verbrugghen> <http://dbpedia.org/property/placeOfBirth> "1"^^<http://www.w3.org/2001/XMLSchema#int> . <http://dbpedia.org/resource/Jozef_Spruyt> <http://dbpedia.org/property/placeOfBirth> "1"^^<http://www.w3.org/2001/XMLSchema#int> . <http://dbpedia.org/resource/Carl_Feilberg> <http://dbpedia.org/property/placeOfBirth> *"1"^^*<http://www.w3.org/2001/XMLSchema#int>

<http://dbpedia.org/resource/Bill_Vanthoff> <http://dbpedia.org/property/birthPlace> *"1"^^*<http://www.w3.org/2001/XMLSchema#integer> . <http://dbpedia.org/resource/L%C3%A9on_Brunschvicg> <http://dbpedia.org/property/birthPlace> "2"^^<http://www.w3.org/2001/XMLSchema#integer> . <http://dbpedia.org/resource/Victor_Amadeus,_Landgrave_of_Hesse-Rotenburg> <http://dbpedia.org/property/*birthPlace*> *"2"^^*<http://www.w3.org/2001/XMLSchema#integer> . <http://dbpedia.org/resource/D%C3%A9borah_Rodr%C3%ADguez> <http://dbpedia.org/property/birthPlace> "2"^^<http://www.w3.org/2001/XMLSchema#integer> . <http://dbpedia.org/resource/John_Belcher_%28architect%29> <http://dbpedia.org/property/birthPlace> "3"^^<http://www.w3.org/2001/XMLSchema#int> . <http://dbpedia.org/resource/William_I%2C_Elector_of_Hesse> <http://dbpedia.org/property/birthPlace> "3"^^<http://www.w3.org/2001/XMLSchema#int> . <http://dbpedia.org/resource/Bill_Vanthoff> <http://dbpedia.org/property/*birthDate*> *<http://dbpedia.org/resource/Elmore,_Victoria>*
"


All of these triples are extracted by the "InfoboxExtractor".
The "MappingExtractor" extracts much better and more accurate data as discussed in that thread [1].


How to deal with such inconsistencies of the data.


--
Regards
Gaurav Pant
+91-7709196607,+91-9405757794


--
Kind Regards
Mohamed Morsey
Department of Computer Science
University of Leipzig

------------------------------------------------------------------------------
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis & visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter
_______________________________________________
Dbpedia-discussion mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbpedia-discussion

Reply via email to