Hi Shukla,

Lucene indexes just "text" files. Therefore conversion of a pdf document(or 
word,excel,image etc.) to text is not related with Lucene. Before indexing, you 
should convert them to text.

IFilter provides just a standard approach for this kind of conversions.

Below link may be helpful for you 
http://www.codeproject.com/csharp/IFilter.asp

DIGY



-----Original Message-----
From: shukla dhaval v (JIRA) [mailto:[EMAIL PROTECTED] 
Sent: Monday, June 25, 2007 3:49 PM
To: [email protected]
Subject: [jira] Created: (LUCENENET-44) Indexing of some pdf files doesnt give 
desired result in ver 1.9.0.5 but works fine in ver 1.3.3.1

Indexing of some pdf files doesnt give desired result in ver 1.9.0.5 but works 
fine in ver 1.3.3.1
--------------------------------------------------------------------------------------------------

                 Key: LUCENENET-44
                 URL: https://issues.apache.org/jira/browse/LUCENENET-44
             Project: Lucene.Net
          Issue Type: Bug
         Environment: .NET, Windows XP,lucene.net ver1.9.0.5
            Reporter: shukla dhaval v


Dear Sir,
 
We are using lucene.net ver. 1.9.0.5 for content searching. The problem 
we are facing is with indexing of .pdf files. We have installed the 
ifilters for pdf files. There are certain pdf files which give result 
with the older version of lucene.net 1.3.3.1 but not with the current 
one.  Please advise how to solve this issue.
 
Thank you
Dhaval Shukla
Programmer
Sansun Software Pvt Ltd
 
Product Development Division of:
Easy Data Access
5988 Mid Rivers Mall Drive
St. Charles, MO 63304
www.edausa.com


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to