Hmmm, switched to crawl another site with pdf:S linked with target="_blank" and it worked fine... Please forget my question below.
Markus Källander Mobile +46 73 622 0547 -----Original Message----- From: Markus Källander Sent: den 11 februari 2014 17:06 To: [email protected]; 'Sebastian Nagel' Subject: RE: Follow target _blank links Yes, this is exactly what I meant. However, my Nutch installation follows and parses pdf:s at: <a href="http://mylink.xyz/a.pdf" > But not: <a href="http://mylink.xyz/a.pdf" target="_blank"> Markus Källander Mobile +46 73 622 0547 -----Original Message----- From: Sebastian Nagel [mailto:[email protected]] Sent: den 11 februari 2014 17:01 To: [email protected] Subject: Re: Follow target _blank links Hi Markus, you mean? <a href="http://mylink.xyz/" target="_blank"> Nutch follows these links same as any other link. Of course, the target URL (value href) must pass URL filters. Sebastian On 02/11/2014 03:58 PM, Markus Källander wrote: > Hi, > > How do I configure Nutch to follow and index links with target set to > "_blank"? > > Thanks > Markus > >

