My problem is that I can't (and i don't know how)input back into htdig the
r2netcmd output. that's why ,at least, I wanted to index the result (.html)
using in the htdig.conf "start_url= http://linuxsrv/test/
http://linuxsrv/tampon/"
in fact I don't know how use the cmdl for indexing the r2net output.
Actually the only change I've made in doc2html.pl are bellow, ie copy the
htdig argv0 in a temporary file .rtf (r2netcmd need a .rtf), apply r2netcmd
to this temporary file , but I don't know how htdig index the result.
-----Message d'origine-----
De : David Adams [mailto:[EMAIL PROTECTED]]
Envoy� : mercredi 13 juin 2001 11:23
� : [EMAIL PROTECTED]; [EMAIL PROTECTED]
Objet : Re: [htdig] problem with r2net
I have never used r2netcmd, but it seems very configurable. Perhaps you
need to consult the manual (http://www.logictran.com/CustomRef.html) about
translation files?
I think that you do not need to use $ED like this with r2netcmd:
$cmdl = "(cp $Input $rtf ; $cmd $rtf | $ED
's#^<TITLE>$Input</TITLE>#<TITLE>[$Name]</TITLE>#' ; $RM $rtf)";
because you should be able to get r2netcmd to generate a sensible title
itself. If you can't, then shouldn't the line be:
$cmdl = "(cp $Input $rtf ; $cmd $rtf | $ED
's#^<TITLE>$rtf</TITLE>#<TITLE>[$Name]</TITLE>#' ; $RM $rtf)";
?
How are you getting r2netcmd to produce its output on STDOUT for input back
into htdig?
Sorry, I don't under stand what you mean by "if I index my tampon directory
containing "�" but with a wrong directory and a bad extension.( Instead
http://linuxsrv/test/file.rtf i have http://linuxsrv/tampon/file.html )."
How are you using doc2html? Are we misunderstanding something about your
approach?
--
David Adams
Computing Services
Southampton University
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html