Hi, we are new to Lucene. We would like to use Lucene for our archive project. In this project we have to get some images of documents, get text out of them via OCR and index them using Lucene. In order to see if Lucene is suitable for our project we need to test Lucene with sample data. But we need huge data set that is composed of images of documents. I searched the net but couldn't find something. Could anyone suggest something about this issue?
Thanks in advance, -- Deniz