On 9/27/2011 6:17 PM, Jörn Kottmann wrote: > On 9/27/11 6:07 PM, Riccardo Tasso wrote: >> Another question about dictionaries: I'm interested to implement my >> own Dictionary classes (especially for POSDictionary) whith its own >> backend (e.g. Redis instead of memory). It wouldn't be better if >> train method had a Dictionary interface, instead of a class as a >> parameter? > > Be aware of the fact that the POS Tagger is accessing the dictionary > very frequently, the current memory > dictionary does a couple of thousand look ups per second. > > As said in the other mail, you would need to sub-class the Dictionary > class, and the POSModel class. If there are > any issues with that please let us know so we can improve our code. > > Jörn Jorn,
The POS Dictinary class really needs a shakedown, because right now a lot of items are broken. I'm waiting until after this release to re-vamp the class so... it both works correctly (with the case sensitivity flag) and a consistent interface like the Dictionary class already has. James