Hi, One of the necessary steps for implementing the Event extraction Engine feature : https://issues.apache.org/jira/browse/STANBOL-1121 is to have coreference resolution in the given text. This is provided now via the stanford-nlp project but as far as I saw this module is performing mostly pronomial (He, She) or nominal (Barack Obama and Mr. Obama) coreference resolution.
In order to get more coreferences from the text I though of creating some logic that would detect this kind of coreference : "Apple reaches new profit heights. The software company just announced its 2013 earnings." Here "The software company" obviously refers to "Apple". So I'd like to detect coreferences of Named Entities which are of the rdf:type of the Named Entity , in this case "company" and also have attributes which can be found in the dbpedia categories of the named entity, in this case "software". The detection of coreferences such as "The software company" in the text would also be done by either using the new Pos Tag Based Phrase extraction Engine (noun phrases) or by using a dependency tree of the sentence and picking up only subjects or objects. At this point I'd like to know if this kind of logic would be useful as a separate Enhancement Engine (in case the precision and recall are good enough) in Stanbol? Thanks, Cristian