At the day job, one of our lead technologies is a device that can decide that 'Barak Obama' and 'Barack Obama' are probably the same thing, or even that 歐巴馬 is another spelling. Is there an extension model for SPARQL queries? In this case, it wouldn't really work to just live in the FILTER, since the fundamental selection would be something like:
?s something:hasName "Barak Obama" and we want to tamper with how the literal string gets compared. We have one API that says "how similar are these strings" and another more complex model in which we build an index that rapidly returns all the strings that are within some distance of a query. We could, of course, build our own index by mining TDB, make our own query, and then get busy SPARQL-ing starting from a set of URI's thus derived, but I just wondered about a more integrated approach.
