On 9/27/2011 6:17 PM, Jörn Kottmann wrote:
> On 9/27/11 6:07 PM, Riccardo Tasso wrote:
>> Another question about dictionaries: I'm interested to implement my
>> own Dictionary classes (especially for POSDictionary) whith its own
>> backend (e.g. Redis instead of memory). It wouldn't be better if
>> train method had a Dictionary interface, instead of a class as a
>> parameter? 
>
> Be aware of the fact that the POS Tagger is accessing the dictionary
> very frequently, the current memory
> dictionary does a couple of thousand look ups per second.
>
> As said in the other mail, you would need to sub-class the Dictionary
> class, and the POSModel class. If there are
> any issues with that please let us know so we can improve our code.
>
> Jörn
Jorn,

The POS Dictinary class really needs a shakedown, because right now a
lot of items are broken.  I'm waiting until after this release to
re-vamp the class so... it both works correctly (with the case
sensitivity flag) and a consistent interface like the Dictionary class
already has.

James

Reply via email to