Hi, I used (NUTCH-185) in our project (Nutch 0.9) to parse & XML documents. Recently moved to NUTCH1.0-dev for Solr integration
But building XMLParser plugin with the latest (1.0-dev) source, throw errors for the following classes because of changes in base interfaces org\apache\nutch\parse\xml\config\XMLIndexer.java : org\apache\nutch\parse\xml\XMLIndexer.java:40: org.apache.nutch.parse.xml.XMLIndexer is not abstract and does not override abstract method addIndexBackendOptions(org.apache.hadoop.conf.Configuration) in org.apache.nutch.indexer.IndexingFilter org\apache\nutch\parse\xml\XMLParser.java : org\apache\nutch\parse\xml\XMLParser.java:64: org.apache.nutch.parse.xml.XMLParser is not abstract and does not override abstract method getParse(org.apache.nutch.protocol.Content) in org.apache.nutch.parse.Parser org\apache\nutch\parse\xml\XMLParser.java:75: getParse(org.apache.nutch.protocol.Content) in org.apache.nutch.parse.xml.XMLParser cannot implement getParse(org.apache.nutch.protocol.Content) in org.apache.nutch.parse.Parser; attempting to use incompatible return type org\apache\nutch\parse\xml\XMLParser.java:111: cannot find symbol [javac] symbol : method setConf(org.apache.hadoop.conf.Configuration) [javac] location: class org.apache.nutch.parse.ParseData [javac] parseData.setConf(this.conf); [javac] ^ Can anybody advise me on the changes to the XMLIndexer.java & XMLParser.java to be compatible with the Nutch 1.0 ? Thanks,
