Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "GoogleSummerOfCode/Giving HTML5 support for Apache Nutch 2.x" page has been changed by LewisJohnMcgibbney: https://wiki.apache.org/nutch/GoogleSummerOfCode/Giving%20HTML5%20support%20for%20Apache%20Nutch%202.x?action=diff&rev1=1&rev2=2 + <<TableOfContents(4)>> + ==== Giving HTML5 support for Apache Nutch 2.x ==== ===== Description ===== The project is aimed at giving Html5 support to Apache Nutch 2.x with using a java library. With this project two goals is aimed. First one is implementation of a new parser which has to follow WHATWG HTML5 specification. Second one is implementation of a new plugin which uses newly implemented parser and extracts new elements of HTML5.

