On Wed, Jan 30, 2013 at 4:51 PM, Rafa Haro <[email protected]> wrote: > Thanks Rupert for your valuable feedback and contributions! > > El 30/01/13 16:03, Rupert Westenthaler escribió: > >> As those algorithm will be the main source for requirements on the >> disambiguation index model we might need to investigate this while >> designing the disambiguation index model. > > > It would be very great to have a brainstorming session for that. I can point > out a lot of papers related to disambiguation topic, although most of them > are focused on disambiguation against Wikipedia, being difficult to find > papers proposing a most generic approach. >
I would start with some typical scenarios: * SKOS like controlled vocabulary (such as gemet [1]) * Knowledge base based on Company data (we could e.g. start from the data model used by Sugar CRM [2]) * Assuming a Scenario where Users do use Stanbol to annotate content and manually correct suggestions (The annotate.js [3] use case). So the input for disambiguation is the original knowledge base plus the mentions of those in manually corrected content of the user. Based on those scenarios it should be possible to evaluate if and how we could acquire the data required by the algorithms. WDYT Rupert [1] http://www.eionet.europa.eu/gemet/about?langcode=en [2] http://www.sugarcrm.com/company-overview [3] http://szabyg.github.com/annotate.js/ > Regards > > > > -- > > ------------------------------ > This message should be regarded as confidential. If you have received this > email in error please notify the sender and destroy it immediately. > Statements of intent shall only become binding when confirmed in hard copy > by an authorised signatory. > > Zaizi Ltd is registered in England and Wales with the registration number > 6440931. The Registered Office is 222 Westbourne Studios, 242 Acklam Road, > London W10 5JJ, UK. -- | Rupert Westenthaler [email protected] | Bodenlehenstraße 11 ++43-699-11108907 | A-5500 Bischofshofen
