HI I am new to using Nutch. I'm not good with English, so the help of a translator.
My question focuses on the need to know how nutch can collect and process for future indexing on solr server , all meta tags of a html document. I am also interested in knowing how to collect the ALT attribute of the img tag in html. Well and if there is a filter where I can set up labels I collect, the better. I'll be very grateful for your help. MANP 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS INFORMATICAS... CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION http://www.uci.cu http://www.facebook.com/universidad.uci http://www.flickr.com/photos/universidad_uci

