Throwing in my 2 cents.
cfexecute isn't the greatest thing in world, but it does work. Make sure
you reference the absolute path to any command you're going to execute.
Bad: pdf2text mypdf.pdf mypdf.txt
Good: /usr/bin/pdf2text /opt/docs/pdfs/mypdf.pdf /opt/docs/pdfs/mypdf.txt
Better: If you're going to use cfexecute to run pdf2text (or ps2ascii)
and pass parameters, it's much better to wrap it in a shell script and pass
parameters to the script. It simplifies things greatly.
#!/bin/sh
#######################################################
# pdf2ascii.sh - a shell script wrapper for pdf2text
PDFPATH=/opt/docs/pdfs
FILENAMEIN=$1
FILENAMEOUT=$2
/path/to/pdf2text $PDFPATH/$FILENAMEIN $PDFPATH/$FILENAMEOUT
To execute via CF
<cfexecute name="/path/to/script/pdf2ascii.sh"
arguments="pdffile.pdf pdffile.txt"
timeOut="60"></cfexecute>
At 12:17 PM 10/10/2002 +0100, Adam Bailin wrote:
>I am faced with exactly the same problem at the moment. The only way round
>I have managed is pretty much exactly what you suggest. I use pdf2text (I
>assume this is v. similar to ps2ascii) to convert all the .pdf's to .txt's
>and then just hide the txt files on the search results page. Its not ideal
>but I use it conjunction with a Verity database search and UNION up the
>results.
>
>I have looked at calling pdf2text via the cfexecute tag so that indexing
>can be completely automated via the CF scheduler but can't get it to work.
>
>Sorry I can't be any more help. I am also interested in any ideas on this
>matter.
>
> >>> "Jillian Carroll" <[EMAIL PROTECTED]> 10/09 2:10 pm >>>
>I can understand that. :)
>
> From what I can tell the reason CF on Linux doesn't index PDFs is because
>the filter was not purchased from verity... here is my question:
>
>- I can run ps2ascii on my Linux box to convert the PDFs to text...
>which CF
>will be able to index.
>
>Given this, could anybody give me any guidance/suggestions on how I might
>create some sort of custom tag for CF to do this on the fly? Is that even
>possible?
>
>My only other alternative (that I can see) would be to create a shadow
>directory and use ps2ascii and shadow ALL of the PDFs on this site (400 or
>so)... let CF index that directory and then manipulate the search results to
>point back to the original PDF. I'd rather not have to do this.
>
>--
>Jillian
>
>-----Original Message-----
>From: Jesse Noller [mailto:[EMAIL PROTECTED]]
>Sent: Wednesday, October 09, 2002 7:02 AM
>To: CF-Linux
>Subject: RE: Indexing PDFs
>
>
>See, this is why I need to finish my coffee before posting.
>
>Jesse Noller
>[EMAIL PROTECTED]
>Macromedia Server Development
>
>"No concept man forms is valid unless he
>integrates it without contradiction into the
>sum of his knowledge."
>- Ayn Rand
>
> > -----Original Message-----
> > From: Jillian Carroll [mailto:[EMAIL PROTECTED]]
> > Sent: Wednesday, October 09, 2002 8:57 AM
> > To: CF-Linux
> > Subject: RE: Indexing PDFs
> >
> > Jesse,
> >
> > I am well aware of this... hence my asking for alternative suggestions.
> >
> > --
> > Jillian
> >
> > -----Original Message-----
> > From: Jesse Noller [mailto:[EMAIL PROTECTED]]
> > Sent: Wednesday, October 09, 2002 6:24 AM
> > To: CF-Linux
> > Subject: RE: Indexing PDFs
> >
> >
> > Read the release notes, AFAIK indexing PDFs on Linux is not, and has not
> > been supported.
> >
> > Jesse Noller
> > [EMAIL PROTECTED]
> > Macromedia Server Development
> >
> > > -----Original Message-----
> > > From: Jillian Carroll [mailto:[EMAIL PROTECTED]]
> > > Sent: Tuesday, October 08, 2002 2:34 PM
> > > To: CF-Linux
> > > Subject: Indexing PDFs
> > >
> > > I'm really running into a problem with the fact that CF on Linux cannot
> > > index PDFs... even though it works perfectly well on Windows.
> > >
> > > Does anybody have any suggestions for me? I'd be VERY appreciative!
> > >
> > > --
> > > Jillian
> > >
> > >
> >
> >
>
>
>
______________________________________________________________________
Structure your ColdFusion code with Fusebox. Get the official book at
http://www.fusionauthority.com/bkinfo.cfm
------------------------------------------------------------------------------
Archives: http://www.mail-archive.com/cf-linux%40houseoffusion.com/
To Unsubscribe visit
http://www.houseoffusion.com/index.cfm?sidebar=lists&body=lists/cf_linux or send a
message to [EMAIL PROTECTED] with 'unsubscribe' in the body.