Prefixes

Andy Seaborne Tue, 29 Jan 2013 04:50:44 -0800

There aren't many compatibility issues at the moment and preserving thename as the interface would be better compatibility anyway. IMO Theinterface name is more important then implementation name so I thinkthat it gets priority.

1/ shall we make the interface PrefixMap, and call current PrefixFixPrefixFixStd, and have a factory?

Other future implementations include one for XML-valid prefixing forexample as Turtle and XML are diverging.

The cost of finding a perfect Turtle one vs fast may become significantas well. There are escapes in the local par tof a prefix name inTurtle-1.1.

The abbrevKey used in FastPrefixMap is the URI upto the last '#' orfailing that '/'.


This is a prefix look up only when two prefix URI abbrevKeys share a prefix.

This is rare (I claim!). I added some tests toAbstractTestLightweightPrefixMap to investigate ...and the two implementations differ here. The FastPrefixMap does notabbreviate where the old code does.


2/ In the general purpose PrefixMapStd class, we could keep a Map from
registered URI string to prefix to speed up that common case.

A Map is smaller than the Trie (which has a map at each level, sohttp:// is 7 maps). It does not optimize cases where one prefixpossibility is a prefix of another, and it falls back to brute forcehere. Does this case matter? (Using the abbrevKey and stripping backthe possibilities would be a possibility.)

This all gets a lot more complicated for RDF 1.1 where the local partcan have escaped / and # in it.


Over-engineering 1:

Delay the Tries creation until the first call of abbrev or at the end ofparsing (the case of many small files concatenated might have had a lotof prefixes)

Or build the Trie during output only? if there is an abbrevKey, findthe abbreviate old style, and install it. This is using in a cache fashion.


Otherwise the suggestion of OutputPrefixMap looks good.

Over-engineering 2:

I thought a bit about two interfaces - one for parsers, one for writersbut it seem to be quite complicated for little or no value. What's morea prefix map may start by being used for parsing and then used forwriting (if and when old PrefixMapping gets sorted which would be goodat Jena3 - it's XML centric).



I'm happy to put time in to help with changes.

        Andy

Prefixes

Reply via email to