Hi,
I noticed that "subcollection" field has multiple space separated
subcollection values for documents that are in more than one
subcollection. I read the sources and tried several syntaxes.
Is there a way to search for membership in multiple subcollections?
A site:
site.foo.com/index.html
site.foo.com/faq.html
site.foo.com/contact.html
site.foo.com/bar/bar1.html
site.foo.com/foo/foo1.html
A subcollections.xml
<subcollections>
<subcollection>
<name>foosite</name>
<id>foosite</id>
<whitelist>http://site.foo.com</whitelist>
<blacklist />
</subcollection>
<subcollection>
<name>foobar</name>
<id>foobar</id>
<whitelist>http://site.foo.com/bar</whitelist>
<blacklist />
</subcollection>
</subcollections>
subcollection field for site.foo.com/bar/bar1.html
"foosite foobar"
subcollection field for site.foo.com/foo/foo1.html
"foosite"
I would like to search for documents in subcollections
foosite AND foobar
foosite OR foobar
Thanks,
Mark
Mark Jones
Sr. Systems Integration Specialist
[EMAIL PROTECTED]
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general