no no,
the external converter is started from aspseek during the index process when aspseek finds a pdf file.
so in your case:
when aspseek indexes www.crazy.com and finds beer.pdf it starts the converter. the converter reads the pdf-document convert it to txt/html. now aspseek indexes this export.
no your users can search also in pdf documents. so when "beer" is in beer.bdf, aspseek will list the link to beer.pdf as a result and even displays the short extract. your users now can click on the link and acrobat reader opens to display the pdf-file.
so external converter means a helper programme for apseek to index pdf-documents.
Markus Rietzler
* kommunikation & online service
* RZF NRW
* Tel: 0211.4572-130
-----Urspr�ngliche Nachricht-----
Von: Diego Montalvo [mailto:[EMAIL PROTECTED]]
Gesendet am: Donnerstag, 21. Februar 2002 16:55
An: [EMAIL PROTECTED]
Betreff: Re: [aseek-users] ASPSeek - PDF / RTF
Kir,
I am somewhat confused, so ASPSeek will crawl and
index .PDF and such files, but will not present them
as .html? Therefore I need a external converter?
Or does an external converter first convert, then I
run ASPSeek?
example: I want to index "www.crazy.com/beer.pdf" i
simply use ASPSeek, to retreive words from "beer.pdf"
but then I mst use an external program to view in
html?
do you have a link to such a search engine using
ASPSeek with external converters?
Diego
--- Kir Kolyshkin <[EMAIL PROTECTED]> wrote:
> Diego Montalvo wrote:
> >
> > Hello,
> >
> > In the ASPSeek Manual pages there is a mention
> that
> > ASPSeek understands PDF, RTF formats with help of
> an
> > external program, what program is that? I would
> like
> > to embed it into ASPSeek.
>
> There's no need to embed. Manual talks about
> External Converters,
> described in
> http://www.aspseek.org/man/aspseek.conf.5.html#lbAM
> So as long as you have program that can convert,
> say, pdf to html,
> you can index pdf documents with aspseek.
>
> Good ps to text (or html) converter is here:
> http://www.nzdl.org/html/prescript.html
> There are also links to other such tools.
>
> As for converter from rtf or doc format, I know of
> word2x: http://word2x.alcom.co.uk/
> antiword: http://www.winfield.demon.nl/index.html
> unrtf: http://www.geocities.com/tuorfa/unrtf.html
> --
> [EMAIL PROTECTED] http://kir.vtx.ru/ ICQ 7551596
> Phone +7 903 6722750
> Hi, I'm a signature virus: copy me to your
> .signature to help me spread!
> --
__________________________________________________
Do You Yahoo!?
Yahoo! Sports - Coverage of the 2002 Olympic Games
http://sports.yahoo.com
