Will I figured out how to get Luke running on Fedora and I do have a
field subcollection however its empty.
Maybe my subcollection.xml file was to long or not valid.
? do I need to recrawl or can I just build index over to test new
subcollection.xml file
-Bud
On Aug 14, 2006, at 11:11 AM, Sami Siren wrote:
Congratulations! You must be first person (trying) to use
subcollection plug-in.
The correct syntax for querying is
subcollection:<subcollection-name> term
You can check out the index for example with luke. Look for field
named subcollection, if the field is there and contains proper
values then your index is ok.
--
Sami Siren
Bud Witney wrote:
Any one have success with the subcollections plugin in 8 if so how
have you setup and how do you query
I with below settings.
<property>
<name>plugin.includes</name>
<value>protocol-http|urlfilter-regex|parse-(text|html|js|pdf|swf|
msword|mspowerpoint|rss)|index-(basic|more)|query-(basic|site|url|
more)|subcollection|clustering-carrot2|summary-basic|scoring-opic</
value> <description>Regular expression naming plugin directory
names to
include. Any plugin not matching this expression is excluded.
In any case you need at least include the nutch-extensionpoints
plugin. By
default Nutch includes crawling just HTML and plain text via HTTP,
and basic indexing and search plugins.
</description>
</property>
For querying I tried collection:{collection name} term,
subcollection:{collection name} term , and {collection name}: term
the later had best results but did not seem to restrict to only
the collection. It found items outside of the collection
do I need to blacklist all others or is it a query /setup issue
-Bud
--
BEGIN-ANTISPAM-VOTING-LINKS
------------------------------------------------------
Teach CanIt if this mail (ID 27113619) is spam:
Spam: https://antispam.osu.edu/b.php?
c=s&i=27113619&m=962356ec66d9
Not spam: https://antispam.osu.edu/b.php?
c=n&i=27113619&m=962356ec66d9
Forget vote: https://antispam.osu.edu/b.php?
c=f&i=27113619&m=962356ec66d9
------------------------------------------------------
END-ANTISPAM-VOTING-LINKS