i am tried configure nutch with subcollectios
i check with luke my index and the subcollections yes work.
now my question is:
how i can configure my seach page, for see my subcollections and select the
subcollection for do my search?
thanks
Bud Witney wrote:
>
> Will I figured out how to get Luke running on Fedora and I do have a
> field subcollection however its empty.
>
> Maybe my subcollection.xml file was to long or not valid.
>
> ? do I need to recrawl or can I just build index over to test new
> subcollection.xml file
>
> -Bud
>
>
> On Aug 14, 2006, at 11:11 AM, Sami Siren wrote:
>
>> Congratulations! You must be first person (trying) to use
>> subcollection plug-in.
>>
>> The correct syntax for querying is
>> subcollection:<subcollection-name> term
>>
>> You can check out the index for example with luke. Look for field
>> named subcollection, if the field is there and contains proper
>> values then your index is ok.
>>
>> --
>> Sami Siren
>>
>>
>> Bud Witney wrote:
>>> Any one have success with the subcollections plugin in 8 if so how
>>> have you setup and how do you query
>>> I with below settings.
>>> <property>
>>> <name>plugin.includes</name>
>>> <value>protocol-http|urlfilter-regex|parse-(text|html|js|pdf|swf|
>>> msword|mspowerpoint|rss)|index-(basic|more)|query-(basic|site|url|
>>> more)|subcollection|clustering-carrot2|summary-basic|scoring-opic</
>>> value> <description>Regular expression naming plugin directory
>>> names to
>>> include. Any plugin not matching this expression is excluded.
>>> In any case you need at least include the nutch-extensionpoints
>>> plugin. By
>>> default Nutch includes crawling just HTML and plain text via HTTP,
>>> and basic indexing and search plugins.
>>> </description>
>>> </property>
>>> For querying I tried collection:{collection name} term,
>>> subcollection:{collection name} term , and {collection name}: term
>>> the later had best results but did not seem to restrict to only
>>> the collection. It found items outside of the collection
>>> do I need to blacklist all others or is it a query /setup issue
>>> -Bud
>>>
>>
>>
>>
>> --
>> BEGIN-ANTISPAM-VOTING-LINKS
>> ------------------------------------------------------
>> Teach CanIt if this mail (ID 27113619) is spam:
>> Spam: https://antispam.osu.edu/b.php?
>> c=s&i=27113619&m=962356ec66d9
>> Not spam: https://antispam.osu.edu/b.php?
>> c=n&i=27113619&m=962356ec66d9
>> Forget vote: https://antispam.osu.edu/b.php?
>> c=f&i=27113619&m=962356ec66d9
>> ------------------------------------------------------
>> END-ANTISPAM-VOTING-LINKS
>>
>
>
>
--
View this message in context:
http://www.nabble.com/Subcollection-setup-and-use-tp5797884p14373949.html
Sent from the Nutch - User mailing list archive at Nabble.com.