Re: Search PDF ???
Hi Eric, Try zilverline http://www.zilverline.org Michael Eric Chow wrote: Hello, 1. Is it possibleto use Lucene to search PDF contents ? 2. Can it search Chinese contents PDF files ??? Eric - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Search PDF ???
--- Eric Chow [EMAIL PROTECTED] wrote: Hello, 1. Is it possibleto use Lucene to search PDF contents ? Yes, you need to use some external tools to extract the text from the PDF file and then pass it to lucene for indexing. If you do a search of this list you will get lot of mails related to that. 2. Can it search Chinese contents PDF files ??? I have used a tool called xpdf (in linux) and it works with both chinese traditional and chinese simplified. It gives language support packages for many of the languages. Please take a look at the URL below. http://www.foolabs.com/xpdf/download.html Now the tool only helps in extracting the text. Whether you can search chinese text or not depends on the analyzer you use in Lucene. Try CJKAnalyzer for CJK text search. Thanks, George ___ALL-NEW Yahoo! Messenger - all new features - even more fun! http://uk.messenger.yahoo.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
Re: Search PDF, Excel, Word, RTF files
Eric, Please refer to the FAQ. On Thu, 19 Dec 2002 10:21:05 +0800, Eric Chow said: Hello, Is it possible to search PDF, Excel, Word, RTF files in Lucene ? Would you please to give me a simple example? Best regards, Eric == If you know what you are doing, it is not called RESEARCH! == -- To unsubscribe, e-mail: mailto:[EMAIL PROTECTED] For additional commands, e-mail: mailto:[EMAIL PROTECTED]