On 6/1/06, Jens Kraemer <[EMAIL PROTECTED]> wrote:
> On Thu, Jun 01, 2006 at 09:35:18AM +0200, Tom On wrote:
> >
> > Okay, thanks. I've got it working now for simple text files.  Can
> > anybody share any experience/opinion they have on using Ruby to process
> > and index/search Microsoft documents and PDFs ???  Thanks for any help.
>
> In RDig I use the wvText and pdftotext to extract textual content from word
> and pdf documents. Imho there is no Ruby lib yet to do this.

I second that. I've tried the Ruby pdf reader alternitives on RAA
without much luck. If anyone knows a good pdf reading opensource C
library I'd be happy to write some bindings. But I think wvText and
pdftotext are your best options right now.
_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Reply via email to