Well, so we did add https to protocol-http's plugin.xml. Does your 
plugin.includes actually contain a protocol-* plugin?


 
 
-----Original message-----
> From:KRIS MUSSHORN <[email protected]>
> Sent: Tuesday 6th September 2016 20:39
> To: [email protected]
> Subject: Re: indexing metatags with Nutch 1.12
> 
> Markus, 
> I'm not sure how to answer your question.
> here are 2 xml files for your consideration.
> 
> Kris
> 
> ----------- 
> From: "Markus Jelsma" <[email protected]>
> To: [email protected]
> Sent: Tuesday, September 6, 2016 2:30:39 PM
> Subject: RE: indexing metatags with Nutch 1.12
> 
> Well, this is certainly not an indexing metatags problem. You need to use 
> protocol-httpclient for https, or configure protocol-http's plugin.xml to 
> support https. That's identical to protocol-httpclient's plugin.xml.
> 
> On the other hand, when we added support for https to protocol-http, did we 
> forget to add it to the plugin.xml?
> 
> 
> 
>  
>  
> -----Original message-----
> > From:KRIS MUSSHORN <[email protected]>
> > Sent: Tuesday 6th September 2016 19:29
> > To: [email protected]
> > Subject: indexing metatags with Nutch 1.12
> > 
> > https://wiki.apache.org/nutch/IndexMetatags 
> > <https://wiki.apache.org/nutch/IndexMetatags>
> > 
> > Soon as i switch to nutch-site_v2 nutch throws protocol missing errors 
> > during crawl.
> > 
> > 2016-09-06 12:23:53,102 INFO  fetcher.Fetcher - -activeThreads=50, 
> > spinWaiting=50, fetchQueues.totalSize=442, fetchQueues.getQueueCount=1
> > 2016-09-06 12:23:53,576 INFO  fetcher.FetcherThread - fetching 
> > https://snip/inside/events/events_summary/documents/Harford_Co_Sheriff_Special_Brief.pdf
> >  (queue crawl delay=500ms)
> > 2016-09-06 12:23:53,576 INFO  fetcher.FetcherThread - fetch of 
> > https://snip/inside/events/events_summary/documents/Harford_Co_Sheriff_Special_Brief.pdf
> >  failed with: org.apache.nutch.protocol.ProtocolNotFound: protocol not 
> > found for url=https
> >     at 
> > org.apache.nutch.protocol.ProtocolFactory.getProtocol(ProtocolFactory.java:84)
> >     at org.apache.nutch.fetcher.FetcherThread.run(FetcherThread.java:257) 
> > 
> > how can i fix this?
> > 
> > Kris
> > 
> 

Reply via email to