Hi Bonnie

I will try to draw the complete picture in this mail.

It is perfectly true, that you can run KIM with a completely new ontology. 
Importing the RDF in OWLIM is only the first part of the process. After this 
you will have to make the resources you use for IE aware of the new ontology. 
For example changing the way the Gazetteer fills its dictionary, making 
grammars aware of the new classes (Lookup.class feature) etc.  That way you 
will be able to extract the new entities.

Extraction is one of the modules of KIM. Over that, we provide indexing and 
search mechanisms. Searches are accessible though the web interface and on low 
level though the Java API. But because the WEB UI is tightly related PROTON, 
you will not be able to use it. You will have to use the Java API  to search 
for entities and documents.


So using a completely new ontology is a lot of work and efforts.

KIM makes it easier if you map your ontology to PROTON. That way you can reuse 
the already existing mechanisms and save a lot of work. Reading this resource 
carefully will answer most of the questions how to do that:
* http://ontotext.com/kim/doc/KimDocs-3.0-EN/ExtendInformationExtraction.html

The minimum requirements are:
-  your classes should inherit protons:Entity directly or indirectly
-  make your classes and properties visible by setting  kimso:visibilityLevel1
-  provide at least a MainAlias* to all the instances

* you can use either aliases or labels to describe your entities. If you use 
labels, the property is protons:mainLabel . Which mechanism you use is 
described in KIM/config/install.properties 
(com.ontotext.kim.KIMConstants.ENTITY_DESCR property).

It is generally a very good idea to subclass the main classes in PROTON 
* protont:Person
* protont:Organization
* protont:Location
This will make them visible for some of the resources.

Another thing to have in mind, is the default behavior of the gazetteer. Only 
entities generated by trusted sources will be included in the dictionary. That 
is, the below RDF is valid for them:
?entity protons:generatedBy ?trusted .
?trusted rdf:type protons:Trusted .

You can add your own trusted source or use one of the already existing. You can 
also provide a custom query to fill the dictionary of the gazetteer. This is 
done by putting a text file with the query (sparql or serql) in 
KIM/config/query.txt .


When using some ontology modeling tool (like Protege), it is enough to include 
the proton modules and follow the rules above to allow KIM to make a good use 
of the ontology.

And again, all the information you need and even much more detailed is in the 
documentation:
* http://ontotext.com/kim/doc/KimDocs-3.0-EN/ExtendInformationExtraction.html

Hope this helps,
Philip Alexiev
Software Engineer, KIM Team
_______________________________________________
Kim-discussion mailing list
Kim-discussion@ontotext.com
http://ontotext.com/mailman/listinfo/kim-discussion

Reply via email to