I am faced with exactly the same problem at the moment. The only way round I have managed is pretty much exactly what you suggest. I use pdf2text (I assume this is v. similar to ps2ascii) to convert all the .pdf's to .txt's and then just hide the txt files on the search results page. Its not ideal but I use it conjunction with a Verity database search and UNION up the results.
I have looked at calling pdf2text via the cfexecute tag so that indexing can be completely automated via the CF scheduler but can't get it to work. Sorry I can't be any more help. I am also interested in any ideas on this matter. >>> "Jillian Carroll" <[EMAIL PROTECTED]> 10/09 2:10 pm >>> I can understand that. :) >From what I can tell the reason CF on Linux doesn't index PDFs is because the filter was not purchased from verity... here is my question: - I can run ps2ascii on my Linux box to convert the PDFs to text... which CF will be able to index. Given this, could anybody give me any guidance/suggestions on how I might create some sort of custom tag for CF to do this on the fly? Is that even possible? My only other alternative (that I can see) would be to create a shadow directory and use ps2ascii and shadow ALL of the PDFs on this site (400 or so)... let CF index that directory and then manipulate the search results to point back to the original PDF. I'd rather not have to do this. -- Jillian -----Original Message----- From: Jesse Noller [mailto:[EMAIL PROTECTED]] Sent: Wednesday, October 09, 2002 7:02 AM To: CF-Linux Subject: RE: Indexing PDFs See, this is why I need to finish my coffee before posting. Jesse Noller [EMAIL PROTECTED] Macromedia Server Development "No concept man forms is valid unless he integrates it without contradiction into the sum of his knowledge." - Ayn Rand > -----Original Message----- > From: Jillian Carroll [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, October 09, 2002 8:57 AM > To: CF-Linux > Subject: RE: Indexing PDFs > > Jesse, > > I am well aware of this... hence my asking for alternative suggestions. > > -- > Jillian > > -----Original Message----- > From: Jesse Noller [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, October 09, 2002 6:24 AM > To: CF-Linux > Subject: RE: Indexing PDFs > > > Read the release notes, AFAIK indexing PDFs on Linux is not, and has not > been supported. > > Jesse Noller > [EMAIL PROTECTED] > Macromedia Server Development > > > -----Original Message----- > > From: Jillian Carroll [mailto:[EMAIL PROTECTED]] > > Sent: Tuesday, October 08, 2002 2:34 PM > > To: CF-Linux > > Subject: Indexing PDFs > > > > I'm really running into a problem with the fact that CF on Linux cannot > > index PDFs... even though it works perfectly well on Windows. > > > > Does anybody have any suggestions for me? I'd be VERY appreciative! > > > > -- > > Jillian > > > > > > ______________________________________________________________________ Your ad could be here. Monies from ads go to support these lists and provide more resources for the community. http://www.fusionauthority.com/ads.cfm ------------------------------------------------------------------------------ Archives: http://www.mail-archive.com/cf-linux%40houseoffusion.com/ To Unsubscribe visit http://www.houseoffusion.com/index.cfm?sidebar=lists&body=lists/cf_linux or send a message to [EMAIL PROTECTED] with 'unsubscribe' in the body.
