Tom, I noticed the en-default.pynn.gz file is huge, 80Mb, 350Mb uncompressed. I thought neural networks were just small sets of weights. Why are the models so big?
Laurent On Tuesday, December 16, 2014 at 11:26:52 PM UTC+1, Tom wrote: > > I joined Google this year. Google permits me to spend time on the OCRopus > project and contribute. As part of this, I moved the project to Github, > because it's easier to maintain there. > > I just pushed out a new update of ocropy. This includes mainly > faster/smaller saving of models, as well as a C++ implementation of the > LSTM network. The C++ LSTM implementation is a pretty straightforward port > of the Python version and runs much faster. The C++ classes have been > wrapped as Python classes and are callable from Python. There are two new > top-level drivers, ocropus-ltrain and ocropus-lpred, for the C++ > implementation. The C++ implementation appears to be numerically close to > the Python implementation and yield good recognizers when trained, but it > requires more testing. > > As before, this is research-level software with minimal documentation (do > look at the iPython Notebooks, the .ipynb files, since they contain > significant information). Feel free to contribute patches, documentation, > etc. using the usual Github mechanisms of merge requests. I'll try to > incorporate them as time permits. > > Tom > -- You received this message because you are subscribed to the Google Groups "ocropus" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/ocropus/106b6a90-1c61-4c06-bfa9-5bca72967f99%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
