On 6/1/06, Jens Kraemer <[EMAIL PROTECTED]> wrote: > On Thu, Jun 01, 2006 at 09:35:18AM +0200, Tom On wrote: > > > > Okay, thanks. I've got it working now for simple text files. Can > > anybody share any experience/opinion they have on using Ruby to process > > and index/search Microsoft documents and PDFs ??? Thanks for any help. > > In RDig I use the wvText and pdftotext to extract textual content from word > and pdf documents. Imho there is no Ruby lib yet to do this.
I second that. I've tried the Ruby pdf reader alternitives on RAA without much luck. If anyone knows a good pdf reading opensource C library I'd be happy to write some bindings. But I think wvText and pdftotext are your best options right now. _______________________________________________ Ferret-talk mailing list [email protected] http://rubyforge.org/mailman/listinfo/ferret-talk

