Hi,
Some of the sections of my clients site are directories of various sorts, eg:
http://www.edinburgh.gov.uk/environmentaldirectory/Index.jsp
Since this section is effectively a gateway, we wanted to set things up so
that a relevant search might bring up the external sites' home page within
the results, as well as any relevant pages from the main site.
To do this, we are doing one dig of the main site, with unrestricted
depth/hops, but restricted URL's.
The URL database that this generates is then fed in as the start_url for a
second dig, with depth/hops limited but unrestricted URLs, feeding into a
second database.
This part appears to work okay, since I can do a search on this second
database and get some results.
This 'external sites' database is merged into the main site database but it
then become impossible to get any 'external' results from a search.
I'm running htmerge with  -vv but can't see any sign of a problem in the
output. Can anyone suggest what I should be looking for anyway?
I am running 3.1.6 on Windows 2003 server / Apache 2

Test page is available at:
http://search.edinburgh.gov.uk/htdig/external.html
My current test string is:
conference on road pricing
Which should get you the TransformScotland site for the first search, but
only internal documents for the second search.

Thanks,
Mike


********************************************************************

This email may contain information which is privileged or confidential. If you are not 
the intended recipient of this email, please notify the sender immediately and delete 
it without reading, copying, storing, forwarding or disclosing its contents to any 
other person
Thank you

Check us out at http://www.btsyntegra.com

********************************************************************



-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to