Author: Garvin Hicking
Email: [EMAIL PROTECTED]
Message:
Hi!

I have to index a webpage containing 2 languages in where each of the language-part 
has a restricted area, totalling four parts which have to be searched seperatelly and 
additional 2 Tags where you can search language-independently.

I want to do this with tags, so I just want to use something like:

----------------------------------
Tag German_Normal
Server path http://www.server.com/index.html?lang=ger
Realm String NoMatch *language=en*
# SHOULD deny access to every link which contains the string "language=en", so the 
Spider can't change to English pages.

Tag German_Restricted
Server path http://www.server.com/index.html?lang=ger
Server path http://www.server.com/login.html?lang=ger&user=robot&pass=spider
# Will login the spider and the redirect to index.html where it will gain access to 
different, restricted areas and index those as well.
Realm String NoMatch *language=en*

Tag English_Normal
Server path http://www.server.com/index.html?lang=en
Realm String NoMatch *language=de*

Tag English_Restricted
Server path http://www.server.com/index.html?lang=en
Server path http://www.server.com/login.html?lang=en&user=robot&pass=spider
Realm String NoMatch *language=en*

Tag Normal
Server path http://www.server.com/index.html?lang=en
Server path http://www.server.com/index.html?lang=ger

Tag Restricted
Server path http://www.server.com/index.html?lang=en
Server path http://www.server.com/index.html?lang=ger
Server path http://www.server.com/login.html?lang=en&user=robot&pass=spider
Server path http://www.server.com/login.html?lang=ger&user=robot&pass=spider
----------------------------------

The above does not work as expected. The "Realm String NoMatch *language=en*" seems to 
have no effect on what Mnogosearch indexes. It will just plain use the URLs with 
"language=xx". If I put in an "Disallow"-Command, it works flawlessly with a single 
tag. As soon as two tags come into play, the Disallow-Commands are added up and in the 
end, not a single file gets indexed.

With the configuration seen above, all pages get indexed language-independently, and 
the url.txt file looks like all files only contain the first tag they appear in.

Am I using the Tag-Command in a wrong way? I currently use file-support for not 
lagging the SQL-Server on the machine too much, would using SQL provide different 
results?

Thanks for *ANY* help!

Regards,
Garvin.

Reply: <http://www.mnogosearch.org/board/message.php?id=4924>

___________________________________________
If you want to unsubscribe send "unsubscribe general"
to [EMAIL PROTECTED]

Reply via email to