Hi Gilles!
I already tried the url_rewrite_rules, but i got a strange
behavior.
E.g. the rule (.*)\\?(.*) \\0\\¶m=value should append
my parameter to every url with an already existing parameter.
But the spider indexes infinitely in a recursive manner:
url: abc.com/test?test=something
-> abc.com/test?test=something¶m=value
-> abc.com/test?test=something¶m=value¶m=value
-> abc.com/test?test=something¶m=value¶m=value¶m=value
-> ...
Winfried
> -----Ursprungliche Nachricht-----
> Von: Gilles Detillieux [mailto:[EMAIL PROTECTED]]
> Gesendet: Freitag, 19. April 2002 18:22
> An: [EMAIL PROTECTED]
> Cc: Thieme, Winfried; [EMAIL PROTECTED]
> Betreff: Re: AW: [htdig] add parameter to the url while indexing
>
>
> According to Ted Stresen-Reuter:
> > Well, I'm going to take a crack at it, but here's how I
> suspect Guilles
> > would answer...
> >
> > Rundig should index the sites as it normally does. Use the
> url_rewrite
> > options for htsearch to append arguments to urls only when
> users are
> > conducting the search.
>
> Well, I'm not sure who Guilles is, but if you mean me, I'd
> prefer not to
> have words put into my mouth. If you want to know how I'd
> answer, bear
> these points in mind.
>
> 1) Winfried mentioned not once, but twice, that he wants the
> parameters
> added to the URLs while indexing, not searching. I'd tend to
> take that
> at face value unless I had compelling reasons to question
> that, which I
> don't.
>
> 2) When I don't have exact recall of a particular attribute
> name, or the
> set of programs that use that attribute, I tend to verify my
> assumptions
> by checking attrs.html and/or FAQ.html (and sometimes the source code)
> for confirmation/correction before I post a reply, and include where
> possible a reference to the best source(s) of information.
> In this case,
> attrs.html makes it clear that (for version 3.1.6) url_rewrite_rules
> is used by htdig (at indexing time) and search_rewrite_rules is used
> by htsearch (at search time). As Winfried stated a preference for the
> former, I'd refer to
>
> http://www.htdig.org/attrs.html#url_rewrite_rules
>
> as the most likely solution to the problem. I'd normally leave it at
> that, unless I had reason to think there are likely problems with the
> proposed solution, in which case I might try to anticipate
> these and give
> further information, explanations or workarounds. In this
> case, though,
> as I can't anticipate what problems there might be with this, I would
> tend to wait until Winfried reads up on and tries this
> attribute, and I'd
> expect to see a follow-up post if there were problems with
> the solution.
>
> > It's not exactly what you're looking for, but without
> knowing more about why
> > you need to append args during the dig, it's hard to come up with
> > alternatives.
>
> Were it not for the fact that documented alternatives do exist in this
> case, I would tend to agree with this logic. When I can't
> find anything
> in the documentation that solves the problem directly, and
> after wracking
> my brain a bit to think of some clever but non-obvious
> application of some
> existing feature to solve the problem, I would usually
> propose something
> that's not quite what was asked for, in the hopes of
> prompting the person
> to search for alternative solutions, or I'd ask for more
> information about
> why the person thinks there's only one right way to tackle
> the problem.
>
> That, in a nutshell, is how I suspect I would answer.
>
> > Other than that, I have no idea how to append arguments
> during the dig.
> >
> > Good luck!
> >
> > Ted
> >
> > >From: "Thieme, Winfried" <[EMAIL PROTECTED]>
> > >To: 'Ted Stresen-Reuter' <[EMAIL PROTECTED]>,
> > >"'[EMAIL PROTECTED]'"
> > ><[EMAIL PROTECTED]>
> > >Subject: AW: [htdig] add parameter to the url while indexing
> > >Date: Wed, 17 Apr 2002 09:41:16 +0200
> > >
> > >Hello Ted!
> > >
> > >I don't need to add parameters while searching the database
> > >but while running htdig (indexing the website). Do you have
> > >any idea?
> > >
> > >Winfried
> > >
> > > > -----Ursprungliche Nachricht-----
> > > > Von: Ted Stresen-Reuter [mailto:[EMAIL PROTECTED]]
> ...
> > > > Winifred:
> > > >
> > > > I found myself having to add and strip url arguments to and
> > > > from the query
> > > > string too and have an example of how to do it using PHP here:
> > > >
> http://dev.susansexton.com/htdig/view.php?file=results-object.php
> > > >
> > > > You'll see the custom code near the top of the code.
> For a complete
> > > > description of how we integrated php with mysql you can
> check out
> > > > http://dev.susansexton.com/htdig/
> > > >
> > > > Ted Stresen-Reuter
> > > >
> > > >
> > > > >From: "Thieme, Winfried" <[EMAIL PROTECTED]>
> ...
> > > > >Is there any way to automatically add an url parameter to
> > > > every url the
> > > > >spider
> > > > >is indexing?
> > > > >
> > > > >e.g. calling the url "a/b/c.jsp" with "a/b/c.jsp?param=value"
> > > > >
> > > > >Winfried
>
> --
> Gilles R. Detillieux E-mail: <[EMAIL PROTECTED]>
> Spinal Cord Research Centre WWW:
> http://www.scrc.umanitoba.ca/
> Dept. Physiology, U. of Manitoba Winnipeg, MB R3E 3J7 (Canada)
>
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html