According to Da Ros Fabio:
> Because in the active pages (PHP,ASP) there is html code something, and
> htdig work on html documents, what is the configuration of htidg for
> search the html code in a active pages?
> For example if i have this php pages
>  
> -------------------------------
> <?php
> .........
> ?>
>  /* html code*/
> <?
> ?>
> ---------------------------------
>  
> how can search the word in html code with htdig? How i can configure
> htdig for to do this? I suppose that htdig should search php and asp
> pages in the local url and not from web server.

I assume that when you wrote this, you hadn't read my earlier reply to
you very carefully.  The whole point of my message is that ASP and PHP
do indeed produce HTML code, but they must be server-parsed in order
to do this.  The only way to get to the bottom of the problem is to
run with -vvv or -vvvv and see what's happening when htdig gets to those
files.  The trick then is to correctly interpret the results.

Later you wrote:
> I'am running htdig and htmerge for indexing the file in a site and i 've
> some result.
>  
> Htdig  :
> ------------------------------------------------------------
> pick: www.prova.net, # servers = 1
> 5:5:1:http://www.prova.net/test.php3: Trying local files
>   found existing file [EMAIL PROTECTED]/test.php3
>  no server running

OK, when it says "found existing file", that just means the file exists,
but not necessarily that it can use it.  See the documentation for
local_urls (http://www.htdig.org/attrs.html#local_urls).  Files with
.php3 aren't fetched locally because they need to be parsed by the server.
The problem is you don't have a server running, as evidenced by the "no
server running" message.  Earlier in the htdig output, you should be able
to find the first attempt that htdig made to contact the www.prova.net
HTTP server.  This attempt, or perhaps a later attempt, failed, so htdig
gave up trying the server again.  If you don't have a web server running
on this machine, you're going to need it to index active pages.

> pick: www.prova.net, # servers = 1
> 6:6:2:http://www.prova.net/index.htm: Trying local files
>   tried local file [EMAIL PROTECTED]/index.htm
>  no server running

The "tried local file" message means essentially tried and failed.
Either that file doesn't exist, or it's not a regular file.  Again,
normally htdig would fall back to the HTTP server in this case, but
as the server is not running, it can't.

> pick: www.prova.net, # servers = 1
> 7:7:2:http://www.prova.net/prova2.html: Trying local files
>   found existing file [EMAIL PROTECTED]/prova2.html

Here is an existing local file, with a correct suffix (or extension),
so htdig fetches it locally and puts it in the index.  It would do
the same with .htm files, as long as they exist, but not .asp or .php3
files.

> ---------------------------------------------------------
>  
> htmerge.
> ------------------------------------------------------------------------
> -------------
> Deleted, no excerpt: 6/http://www.prova.net/index.htm
> 0/http://www.prova.net/
> 2/http://www.prova.net/avvertenze.html
> 1/http://www.prova.net/prova.html
> 7/http://www.prova.net/prova2.html
> 3/http://www.prova.net/test.html
> Deleted, no excerpt: 5/http://www.prova.net/test.php3
> ------------------------------------------------------------------------
> ------------
> 
> I suppose that there is a configuration problem becasue htmerge don't
> read htm and php file php.

Well, technically htmerge doesn't read any of these document files.
It's htdig that does, or attempts to, and it simply puts commands into
the wordlist telling htmerge to clean up after it by removing records
for documents it couldn't index.

You need to find out why your HTTP server isn't running.

-- 
Gilles R. Detillieux              E-mail: <[EMAIL PROTECTED]>
Spinal Cord Research Centre       WWW:    http://www.scrc.umanitoba.ca/~grdetil
Dept. Physiology, U. of Manitoba  Phone:  (204)789-3766
Winnipeg, MB  R3E 3J7  (Canada)   Fax:    (204)789-3930

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to