On 5 September 2013 06:53, Alberto Marques <[email protected]> wrote: > Hello > My question is simple, I believe in a project using Lucene. To be able to > index a website. As > http://boc.cantabria.es/boces/boletines.do?boton=UltimoBOCPublicado, seeking > information on pdf files. Is it possible?
Yes, it is eminently possible. I would suggest using Solr instead of Lucene directly. You should be able to get started by searching Google on the topic, or looking at the Solr Wiki, e.g., http://wiki.apache.org/solr/ExtractingRequestHandler If you need further help, such a question is better addressed to the solr-user mailing list rather than this one, which is meant for discussions related to development. Regards, Gora --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
