If you're using a web container like tomcat you need to make sure the nutch-site.xml configuration it is pointing to includes the query- more plugin, not just your nutch crawer base directory.


On May 15, 2007, at 8:34 AM, Emmanuel JOKE wrote:


Thanks for your advice but it still doesn't work.

Ive downloaded Luke and make a search based on the query type:pdf it gaves me some results. Unfortunetly when i used the webapp it doesn't show any results. However if i input a simple query with only the keyword "pdf" ive got some results but it shows a mix of HTML links and PDF links

Did you ever have the chance to get any result when you use a query "type:pdf" with the webapp on your index ?

I've attached some screenshot which show my results.

Thanks for you help

> Hi,
>
> On 5/14/07, Emmanuel JOKE <[EMAIL PROTECTED]> wrote:
>> Hi Guys,
>>
>> Could you please help me ?
>>
>> Does anybody has ever used a query search based on the filter type ?
>> How do you do ?
>>
>> Thanks for your help
>>
>> ---------- Forwarded message ----------
>> From: Emmanuel JOKE < [EMAIL PROTECTED] >
>> Date: 4 mai 2007 22:26
>> Subject: Type:PDF
>> To: nutch-user < [email protected]>
>>
>> Hi Guys,
>>
>> I've tried to configured the plugin query-more to search only PDF files.
>>
>> I've added the index-more and query-more plugin. I crawled a website
>> with
>> PDF files. I've made a search using the following query "type:pdf
>> apache"
>> but it didn't gave me any results. However if I input the query "apache"
>> it
>> gives me a lot of result including the PDF files.
>>
>> I'm wondering if my query is correct or if i missed any configuration.
>> Coudl you please help me ?
>>
>> Thanks
>> E
>>
>
> For any "Why doesn't this query return correct results" type of
> question, I would suggest using Luke ( http://www.getopt.org/luke/).
> You can view each document to check whether "type" field is indexed
> correctly, then you can do a search in Luke to see if that works.
>
> --
> Doğacan Güney
>


<Doc1.zip>

Reply via email to