hi,

one question about generating the candidate index.
the CandidateIndexer class has one parameter

--case-sensitive


but to my surprise every surface form is lowercased too because
a lowercased variant of the surface form is always generated in
AddSurfaceFormsToIndex.

what are the pros and cons to ignore the case sensitive flag when
generating the
lowercase variant?
at the begin of a sentence words start with a capital letter.

why i ask?
in german the verb and surface form  "lassen" is matched with the
dbpedia resource "Lassen Peak" or something similar
(http://de.dbpedia.org/page/Lassen),
even if i set the case sensitive flag.

best regards
reinhard


------------------------------------------------------------------------------
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to