Andy Buerki has recently released some modifications to NSP that have some nice features, including improved ability to deal with unicode characters and more efficient use of RAM. He's made a branch of NSP available at https://github.com/buerki/ngramprocessor with a home page at http://buerki.github.io/ngramprocessor
I think this is a great development for NSP users, so if you have found yourself struggling at all with unicode or RAM utilization I really encourage you to check out this project. Many thanks to Andy for making this available! Cordially, Ted -- Ted Pedersen http://www.d.umn.edu/~tpederse