Hi Andrei I'm not sure what you exactly meant by "well-formed" pdf. In fact, we already have some deskewing function in MuseScore's lightweight omr, so we're able to handle some moderate distortion of the page. We also take scanned scores as default input; the test data was acquired from the online score repository IMSLP (see some here: https://github.com/liang-chen/MuseScore/tree/omr_dev_clean/omr/data). But if we can enhance or restore the low-quality pdf's in an efficient way, I believe that'll benefit the later recognition process.
Regarding OCR, the previous version developed by Werner Schweer had already used tesseract. We might need to revisit this part and enable it for the current program. If you're more interested in the current status, just take a look at this folder: https://github.com/musescore/MuseScore/tree/master/omr. I don't know in detail how Audiveris works, but I believe the future development of OMR should depend more on the recent breakthroughs of computer vision (graphical models, deep learning etc.) rather than heuristic-based approach (ad hoc segmentation, pixel projections etc). Traditional OMR used a lot of heuristics and the performance was quite unreliable. Liang -- View this message in context: http://dev-list.musescore.org/OCR-for-Musecscore-tp7579927p7579934.html Sent from the MuseScore Developer mailing list archive at Nabble.com. ------------------------------------------------------------------------------ What NetFlow Analyzer can do for you? Monitors network bandwidth and traffic patterns at an interface-level. Reveals which users, apps, and protocols are consuming the most bandwidth. Provides multi-vendor support for NetFlow, J-Flow, sFlow and other flows. Make informed decisions using capacity planning reports. https://ad.doubleclick.net/ddm/clk/305295220;132659582;e _______________________________________________ Mscore-developer mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/mscore-developer
