Hi Nana, It seems that your problem maybe related to base64 data. Here is a link about it: http://stackoverflow.com/questions/12458390/embed-java-applet-through-url-data Could you share the pages that you get error for?
Kind Regards, Furkan KAMACI On Mon, Jun 6, 2016 at 4:26 AM, Nana Pandiawan < [email protected]> wrote: > Hi All, > > I'm getting following errors when updatedb. can someone tell me whats going > wrong and how to solve it. > thanks. > > 16/06/04 00:58:42 INFO mapreduce.Job: map 0% reduce 0% > 16/06/04 00:59:27 INFO mapreduce.Job: Task Id : > attempt_1464314319848_0309_m_000000_0, Status : FAILED > Error: java.net.MalformedURLException: unknown protocol: t00 > at java.net.URL.<init>(URL.java:603) > at java.net.URL.<init>(URL.java:493) > at java.net.URL.<init>(URL.java:442) > at org.apache.nutch.util.TableUtil.reverseUrl(TableUtil.java:43) > at > org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:96) > at > org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:38) > at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:146) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:787) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341) > at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:415) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657) > at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158) > 16/06/04 01:00:14 INFO mapreduce.Job: Task Id : > attempt_1464314319848_0309_m_000000_1, Status : FAILED > Error: java.net.MalformedURLException: unknown protocol: t00 > at java.net.URL.<init>(URL.java:603) > at java.net.URL.<init>(URL.java:493) > at java.net.URL.<init>(URL.java:442) > at org.apache.nutch.util.TableUtil.reverseUrl(TableUtil.java:43) > at > org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:96) > at > org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:38) > at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:146) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:787) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341) > at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:415) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657) > at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158) > 16/06/04 01:00:42 INFO mapreduce.Job: Task Id : > attempt_1464314319848_0309_m_000001_0, Status : FAILED > Error: java.net.MalformedURLException: unknown protocol: data > > I use Apache Nutch 2.3.1 and hbase as backend. > Regards, >

