Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "GoogleSummerOfCode/SitemapCrawler/weeklyreport" page has been changed by 
LewisJohnMcgibbney:
https://wiki.apache.org/nutch/GoogleSummerOfCode/SitemapCrawler/weeklyreport?action=diff&rev1=9&rev2=10

  ## page was renamed from GoogleSummerOfCode/SitemapCrawler/weeklyreport
- || '''Week :''' 1 (25 May 2015 - 31 May 2015) ||
+ = Week : 1 (25 May 2015 - 31 May 2015) =
  
  '''Title :''' Sitemap url injection is done.
  
@@ -20, +20 @@

  
  Then you can run InjecterJob. So the sitemaps urls are injected to the db. 
The urls injected are signed as sitemap.
  
- || '''Week :''' 2 (1 June 2015 - 7 June 2015) ||
+ = Week : 2 (1 June 2015 - 7 June 2015) =
  
  '''Title :''' Sitemap detection is done. 
  
@@ -31, +31 @@

  The stm(sitemap)column is added to webpage schema for sitemap crawler. The 
urls in stm column from db will be parsed at the next time.
  
  
- || '''Week :''' 3 & 4 (8 June 2015 - 21 June 2015) ||
+ = Week : 3 & 4 (8 June 2015 - 21 June 2015) =
  
  '''Title :''' Sitemap parser plugin is developed.
  
  A plugin to parse sitemap file is developed. The plugin make use of crawler 
commons library. The sitemap file is parsed by the parse plugin. Inlinks from 
sitemap file is written to db. The inlinks will be parsed at the next time.
  
  
- || '''Week :''' 5 (22 June 2015 - 28 June 2015) ||
+ = Week : 5 (22 June 2015 - 28 June 2015) =
  ...
  

Reply via email to