Hi Gajalakshmi,

The NPE can be thrown because of the file not found on the disk. So in the
working directory/current directory check if you have the file
conf/regex-urlfilter.txt


*Regards*
  Shashanka Balakuntala Srinivasa



On Wed, Oct 7, 2020 at 2:09 PM Gajalakshmi G <gajalakshm...@tcs.com.invalid>
wrote:

> Hi all,
>
> I am trying to crawl dynamic webpage using Nutch 2.4 with Selenium 3.6.0
> with Firefox version 79. I am getting the below error in injector job
> itself.
>
> java.lang.Exception: java.lang.NullPointerException
>     at
> org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
>     at
> org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:522)
> Caused by: java.lang.NullPointerException
>     at java.io.Reader.<init>(Reader.java:78)
>     at java.io.BufferedReader.<init>(BufferedReader.java:101)
>     at java.io.BufferedReader.<init>(BufferedReader.java:116)
>     at
> org.apache.nutch.urlfilter.api.RegexURLFilterBase.readRules(RegexURLFilterBase.java:199)
>     at
> org.apache.nutch.urlfilter.api.RegexURLFilterBase.setConf(RegexURLFilterBase.java:171)
>     at
> org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:163)
>     at org.apache.nutch.net.URLFilters.<init>(URLFilters.java:62)
>     at
> org.apache.nutch.crawl.InjectorJob$UrlMapper.setup(InjectorJob.java:113)
>     at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
>     at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
>     at org.apache.hadoop.mapred.MapTask.run(MapTask.java:340)
>     at
> org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:243)
>     at
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>     at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>     at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>     at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>     at java.lang.Thread.run(Thread.java:748)
>
> Please guide me on resolving this issue.
>
>
>
> Thanks & Regards,
>
> Gajalakshmi.G
>
> Assistant Consultant
>
> Tata Consultancy Services
> Mailto: gajalakshm...@tcs.com<
> https://mail.tcs.com/owa/redir.aspx?C=15cf4bf65eff4bdab465e0a2dd682f11&URL=mailto%3agajalakshmi.g%40tcs.com
> >
> =====-----=====-----=====
> Notice: The information contained in this e-mail
> message and/or attachments to it may contain
> confidential or privileged information. If you are
> not the intended recipient, any dissemination, use,
> review, distribution, printing or copying of the
> information contained in this e-mail message
> and/or attachments to it are strictly prohibited. If
> you have received this communication in error,
> please notify us by reply e-mail or telephone and
> immediately and permanently delete the message
> and any attachments. Thank you
>
>
>

Reply via email to