custom index

Benson Margulies Fri, 28 Jan 2011 07:50:29 -0800

At the day job, one of our lead technologies is a device that can
decide that 'Barak Obama' and 'Barack Obama' are probably the same
thing, or even that 歐巴馬 is another spelling. Is there an extension
model for SPARQL queries? In this case, it wouldn't really work to
just live in the FILTER, since the fundamental selection would be
something like:



?s something:hasName "Barak Obama"

and we want to tamper with how the literal string gets compared. We
have one API that says "how similar are these strings" and another
more complex model in which we build an index that rapidly returns all
the strings that are within some distance of a query. We could, of
course, build our own index by mining  TDB, make our own query, and
then get busy SPARQL-ing starting from a set of URI's thus derived,
but I just wondered about a more integrated approach.

Extending TDB/Fuseki with a fuzzy match primitive/custom index

Reply via email to