Hi,

Injector is normalizing, there is no extra setup required.

In case, you want to have special rules for injected URLs
(e.g., strip "index.html"), it's possible to configure
a special rules files for this scope by:

<property>
  <name>urlnormalizer.regex.file.inject</name>
  <value>regex-normalize-inject.xml</value>
  <description>Name of the config file used by urlnormalizer-regex for scope 
inject.
  </description>
</property>

Sebastian


On 10/09/2015 04:38 PM, Roannel Fernández Hernández wrote:
> Hi all,
> 
> I'm using Nutch 1.9. How can I normalize the URL before the injector step?.
> 
> Thanks,
> 
> Roannel
> 
> 17 de octubre: Final Cubana 2015 del Concurso de Programación ACM-ICPC.
> http://coj.uci.cu/contest/contestview.xhtml?cid=1407

Reply via email to