There's this article about using node.js to parse PDFs http://www.garysieling.com/blog/parsing-pdfs-at-scale-with-node-js-pdf-js-and-lunr-js
You may also be interested in https://github.com/mozilla/pdf.js/ even though it is meant to run in the browser On Thursday, 3 April 2014 16:02:54 UTC+2, Joseph Koziatek wrote: > > Hello all, > > Is there a way to convert an in memory binary string (pdf content) to text > from within node.js? > I receive pdf content through a web request and have the pdf in memory as > a string object. > I see there are many tools available that operate from system calls but I > was wondering > if there is anything to convert the pdf totally in memory without spawning > a system call.. > > Thanks In Advance > > Joe > -- -- Job Board: http://jobs.nodejs.org/ Posting guidelines: https://github.com/joyent/node/wiki/Mailing-List-Posting-Guidelines You received this message because you are subscribed to the Google Groups "nodejs" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/nodejs?hl=en?hl=en --- You received this message because you are subscribed to the Google Groups "nodejs" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. For more options, visit https://groups.google.com/d/optout.
