Wonderfull; the indexing is now correct.
In fact I'll not read the r2net output to make the new STDOUT.
Thank you very much David
-----Message d'origine-----
De : David Adams [mailto:[EMAIL PROTECTED]]
Envoy� : mercredi 13 juin 2001 12:26
� : [EMAIL PROTECTED]
Cc : htdig
Objet : Re: [htdig] problem with r2net
You should use doc2html.pl to produce output for htdig to index. Perhaps
this would work:
my $file = "$TMP/rtf.$$";
$cmdl = "(cp $Input $rtf ; $cmd -o $file $rtf ; cat $file; $RM $rtf $file)";
The command copies the file output by htdig to a file with the suffix .rtf
which r2netcmd will read,
then runs r2netcmd with output going to file $file, which is then copied to
STDOUT, and finally the
two files are deleted.
You do _not_ need $ED as r2netcmd should produce a sensible title.
--
David Adams
Computing Services
Southampton University
----- Original Message -----
From: <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Cc: <[EMAIL PROTECTED]>
Sent: Wednesday, June 13, 2001 11:03 AM
Subject: RE: [htdig] problem with r2net
> My problem is that I can't (and i don't know how)input back into htdig the
> r2netcmd output. that's why ,at least, I wanted to index the result
(.html)
> using in the htdig.conf "start_url= http://linuxsrv/test/
> http://linuxsrv/tampon/"
> in fact I don't know how use the cmdl for indexing the r2net output.
>
> Actually the only change I've made in doc2html.pl are bellow, ie copy the
> htdig argv0 in a temporary file .rtf (r2netcmd need a .rtf), apply
r2netcmd
> to this temporary file , but I don't know how htdig index the result.
>
>
> -----Message d'origine-----
> De : David Adams [mailto:[EMAIL PROTECTED]]
> Envoy� : mercredi 13 juin 2001 11:23
> � : [EMAIL PROTECTED]; [EMAIL PROTECTED]
> Objet : Re: [htdig] problem with r2net
>
>
> I have never used r2netcmd, but it seems very configurable. Perhaps you
> need to consult the manual (http://www.logictran.com/CustomRef.html) about
> translation files?
>
> I think that you do not need to use $ED like this with r2netcmd:
>
> $cmdl = "(cp $Input $rtf ; $cmd $rtf | $ED
> 's#^<TITLE>$Input</TITLE>#<TITLE>[$Name]</TITLE>#' ; $RM $rtf)";
>
> because you should be able to get r2netcmd to generate a sensible title
> itself. If you can't, then shouldn't the line be:
>
> $cmdl = "(cp $Input $rtf ; $cmd $rtf | $ED
> 's#^<TITLE>$rtf</TITLE>#<TITLE>[$Name]</TITLE>#' ; $RM $rtf)";
>
> ?
>
> How are you getting r2netcmd to produce its output on STDOUT for input
back
> into htdig?
>
> Sorry, I don't under stand what you mean by "if I index my tampon
directory
> containing "�" but with a wrong directory and a bad extension.( Instead
> http://linuxsrv/test/file.rtf i have http://linuxsrv/tampon/file.html )."
> How are you using doc2html? Are we misunderstanding something about your
> approach?
>
> --
> David Adams
> Computing Services
> Southampton University
>
>
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to
<[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html