While it won't fix the problem, as a last ditch effort you can consider using my pdfUtils CFC from RIAForge. It has a function to read the text out of a PDF. You could read that in and then index the text. It is what my Seeker project (CF/Lucene integration) uses to read and index PDF files.
On Mon, Oct 27, 2008 at 7:50 AM, JediHomer <[EMAIL PROTECTED]> wrote: > Hi, > > We're currently having an issue on Linux with Verity. > > We're using CF8 on Debian, ubuntu for dev, where a client has PDFs > that need indexing but contain extended characters in the filename, in > particular the registered symbol. > > On Linux this is not being indexed by the Verity engine, on Windows > (Where we are migrating away from) these files are correctly indexed. > > Using CFDirectory and Java's listFiles() I can confirm that CF and > Java both see the files correctly, a FileExists() also returns true, > however when CFIndexing the paths I get the following: > > "Error E3-0422 (VDK ): Could not find file > /home/jedi/Dev/htdocs/jedi/verity/mypdf.pdf" ( -- Including the > extended char, CF-Talk won't accept the Base64 post) > > In the Messages section of the Status dump (the symbol is shown > correctly) but in the bad keys section of the Status dump the > character does not come out correctly. > > Has anyone come across this before and know of a fix/work around? > > Thanks in advance > > > Jedi > > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| Adobe® ColdFusion® 8 software 8 is the most important and dramatic release to date Get the Free Trial http://ad.doubleclick.net/clk;207172674;29440083;f Archive: http://www.houseoffusion.com/groups/cf-talk/message.cfm/messageid:314396 Subscription: http://www.houseoffusion.com/groups/cf-talk/subscribe.cfm Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=11502.10531.4

