[ 
https://issues.apache.org/jira/browse/NUTCH-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated NUTCH-1527:
---------------------------------

    Attachment: NUTCH-1527.patch

New patch. You do need to have a config/names.txt file in your runtime/local 
(for whatever reason i don't know). I also had to update Solr's deps to make 
sure all Lucene jars are at 4.3.0 otherwise all will fail! After adding 
indexer-elastic to plugin.includes you can index with : bin/nutch index 
-Delastic.cluster=nutch crawl//crawdb/ crawl/segments/20130613162613/


There's one problem i can't figure out right now:
{code}
2013-06-13 17:51:40,205 INFO  elasticsearch.node - [nutch] {0.90.1}[1001]: 
initializing ...
2013-06-13 17:51:40,275 WARN  mapred.LocalJobRunner - job_local1865023617_0001
java.lang.LinkageError: loader constraint violation: loader (instance of 
sun/misc/Launcher$AppClassLoader) previously initiated loading for a different 
type with name "org/elasticsearch/env/Environment"
        at java.lang.ClassLoader.defineClass1(Native Method)
        at java.lang.ClassLoader.defineClass(ClassLoader.java:787)
        at 
java.security.SecureClassLoader.defineClass(SecureClassLoader.java:142)
        at java.net.URLClassLoader.defineClass(URLClassLoader.java:447)
        at java.net.URLClassLoader.access$100(URLClassLoader.java:71)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:361)
        at java.net.URLClassLoader$1.run(URLClassLoader.java:355)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:354)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:423)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:308)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:356)
        at 
org.elasticsearch.plugins.PluginsHelper.sitePlugins(PluginsHelper.java:39)
        at 
org.elasticsearch.plugins.PluginsService.<init>(PluginsService.java:94)
        at 
org.elasticsearch.node.internal.InternalNode.<init>(InternalNode.java:128)
        at org.elasticsearch.node.NodeBuilder.build(NodeBuilder.java:159)
        at org.elasticsearch.node.NodeBuilder.node(NodeBuilder.java:166)
        at 
org.apache.nutch.indexwriter.elastic.ElasticIndexWriter.open(ElasticIndexWriter.java:73)
        at org.apache.nutch.indexer.IndexWriters.open(IndexWriters.java:78)
        at 
org.apache.nutch.indexer.IndexerOutputFormat.getRecordWriter(IndexerOutputFormat.java:39)
        at 
org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.<init>(ReduceTask.java:449)
        at 
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:491)
        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
2013-06-13 17:51:40,732 ERROR indexer.IndexingJob - Indexer: 
java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
        at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
        at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)
{code}

Any pointers are much appreciated!
                
> Port nutch-elasticsearch-indexer to Nutch
> -----------------------------------------
>
>                 Key: NUTCH-1527
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1527
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 1.6, 2.1
>            Reporter: Lewis John McGibbney
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 2.4
>
>         Attachments: NUTCH-1527.patch, NUTCH-1527.patch, NUTCH-1527.patch
>
>
> The source repos for this can be found here [0].
> This issue should be inline with the work already done by Julien and others 
> over at NUTCH-1047.
> [0] https://github.com/ctjmorgan/nutch-elasticsearch-indexer

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to