i am tried configure nutch with subcollectios

i check with luke my index and the subcollections yes work.

now my question is:

how i can configure my seach page, for see my subcollections and select the
subcollection for do my search?

thanks


Bud Witney wrote:
> 
> Will I figured out how to get Luke running on Fedora and I do have a  
> field subcollection however its empty.
> 
> Maybe my subcollection.xml file was to long or not valid.
> 
> ? do I need to recrawl or can I just build index over to test new  
> subcollection.xml file
> 
> -Bud
> 
> 
> On Aug 14, 2006, at 11:11 AM, Sami Siren wrote:
> 
>> Congratulations! You must be first person (trying) to use  
>> subcollection plug-in.
>>
>> The correct syntax for querying is
>> subcollection:<subcollection-name> term
>>
>> You can check out the index for example with luke. Look for field  
>> named subcollection, if the field is there and contains proper  
>> values then your index is ok.
>>
>> --
>>  Sami Siren
>>
>>
>> Bud Witney wrote:
>>> Any one have success with the subcollections plugin in 8 if so how  
>>> have you setup and how do you query
>>> I with below settings.
>>> <property>
>>>   <name>plugin.includes</name>
>>>   <value>protocol-http|urlfilter-regex|parse-(text|html|js|pdf|swf| 
>>> msword|mspowerpoint|rss)|index-(basic|more)|query-(basic|site|url| 
>>> more)|subcollection|clustering-carrot2|summary-basic|scoring-opic</ 
>>> value>   <description>Regular expression naming plugin directory  
>>> names to
>>>   include.  Any plugin not matching this expression is excluded.
>>>   In any case you need at least include the nutch-extensionpoints  
>>> plugin. By
>>>   default Nutch includes crawling just HTML and plain text via HTTP,
>>>   and basic indexing and search plugins.
>>>   </description>
>>> </property>
>>> For querying I tried collection:{collection name} term,  
>>> subcollection:{collection name} term , and {collection name}: term
>>> the later had best results but did not seem to restrict to only  
>>> the collection. It found items outside of the collection
>>> do I need to blacklist all others or is it a query /setup issue
>>> -Bud
>>>
>>
>>
>>
>> -- 
>> BEGIN-ANTISPAM-VOTING-LINKS
>> ------------------------------------------------------
>> Teach CanIt if this mail (ID 27113619) is spam:
>> Spam:        https://antispam.osu.edu/b.php? 
>> c=s&i=27113619&m=962356ec66d9
>> Not spam:    https://antispam.osu.edu/b.php? 
>> c=n&i=27113619&m=962356ec66d9
>> Forget vote: https://antispam.osu.edu/b.php? 
>> c=f&i=27113619&m=962356ec66d9
>> ------------------------------------------------------
>> END-ANTISPAM-VOTING-LINKS
>>
> 
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Subcollection-setup-and-use-tp5797884p14373949.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to