when execute the step 8
http://www.nabble.com/Enabling-Spell-Check-plugin-in-contrib-to11097966.html#a11688850
8. build your spell check index ( bin/nutch plugin
web-query-propose-spellcheck org.apache.nutch.spell.NGramSpeller -i
<indexdir> -f content -o spelling
showme this
$ ./bin/nutch plugin web-query-propose-spellcheck
org.apache.nutch.spell.NGramS
peller -i index -f content -o spelling
[Fatal Error] nutch-site.xml:1:7: The processing instruction target matching
"[xX][mM][lL]" is not allowed.
Exception in thread "main" java.lang.RuntimeException:
org.xml.sax.SAXParseException: The processing instruction target matching
"[xX][mM][lL]" is not allowed.
at
org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:450)
at
org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:384)
at
org.apache.hadoop.conf.Configuration.getProps(Configuration.java:373)
at org.apache.hadoop.conf.Configuration.get(Configuration.java:132)
at
org.apache.hadoop.conf.Configuration.getBoolean(Configuration.java:208)
at
org.apache.nutch.plugin.PluginRepository.<init>(PluginRepository.java:78)
at
org.apache.nutch.plugin.PluginRepository.main(PluginRepository.java:399)
Caused by: org.xml.sax.SAXParseException: The processing instruction target
matching "[xX][mM][lL]" is not allowed.
at org.apache.xerces.parsers.DOMParser.parse(Unknown Source)
at org.apache.xerces.jaxp.DocumentBuilderImpl.parse(Unknown Source)
at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:146)
at
org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:398)
... 6 more
payo wrote:
>
> Hi to all
>
> how i can enable spell check in nutch 0.8.1?
>
> exist some plugin for this?
>
> or i have do some configuration different?
>
> i found this
>
> http://www.nabble.com/Enabling-Spell-Check-plugin-in-contrib-to11097966.html#a11688850
>
> but i dont have contrib directory
>
> i am work on Centos.
>
> thanks
>
--
View this message in context:
http://www.nabble.com/spell-check-in-nutch-0.8.1-tp14296861p14426766.html
Sent from the Nutch - User mailing list archive at Nabble.com.