Hi All,
I'm getting following errors when updatedb. can someone tell me whats going
wrong and how to solve it.
thanks.
16/06/04 00:58:42 INFO mapreduce.Job: map 0% reduce 0%
16/06/04 00:59:27 INFO mapreduce.Job: Task Id :
attempt_1464314319848_0309_m_000000_0, Status : FAILED
Error: java.net.MalformedURLException: unknown protocol: t00
at java.net.URL.<init>(URL.java:603)
at java.net.URL.<init>(URL.java:493)
at java.net.URL.<init>(URL.java:442)
at org.apache.nutch.util.TableUtil.reverseUrl(TableUtil.java:43)
at
org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:96)
at
org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:38)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:146)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:787)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)
16/06/04 01:00:14 INFO mapreduce.Job: Task Id :
attempt_1464314319848_0309_m_000000_1, Status : FAILED
Error: java.net.MalformedURLException: unknown protocol: t00
at java.net.URL.<init>(URL.java:603)
at java.net.URL.<init>(URL.java:493)
at java.net.URL.<init>(URL.java:442)
at org.apache.nutch.util.TableUtil.reverseUrl(TableUtil.java:43)
at
org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:96)
at
org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:38)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:146)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:787)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)
16/06/04 01:00:42 INFO mapreduce.Job: Task Id :
attempt_1464314319848_0309_m_000001_0, Status : FAILED
Error: java.net.MalformedURLException: unknown protocol: data
I use Apache Nutch 2.3.1 and hbase as backend.
Regards,