Severe NoClassDefFoundError Spell StringDistance Nightly 20101207

2010-12-07 Thread Dan Hertz (Insight 49, LLC)
Whilst running java -jar start.jar from the latest nightly build example 
directory, I get the following...any ideas how to fix this? Thanks! Dan.


Dec 7, 2010 8:46:56 AM org.apache.solr.common.SolrException log
SEVERE: java.lang.NoClassDefFoundError: 
org/apache/lucene/search/spell/StringDistance
at 
org.apache.solr.search.ValueSourceParser.clinit(ValueSourceParser.java:297)
at 
org.apache.solr.core.SolrCore.initValueSourceParsers(SolrCore.java:1517)

at org.apache.solr.core.SolrCore.init(SolrCore.java:554)
at org.apache.solr.core.CoreContainer.create(CoreContainer.java:660)
at org.apache.solr.core.CoreContainer.load(CoreContainer.java:412)
at org.apache.solr.core.CoreContainer.load(CoreContainer.java:294)
at 
org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:243)
at 
org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:86)

at org.mortbay.jetty.servlet.FilterHolder.doStart(FilterHolder.java:97)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at 
org.mortbay.jetty.servlet.ServletHandler.initialize(ServletHandler.java:713)

at org.mortbay.jetty.servlet.Context.startContext(Context.java:140)
at 
org.mortbay.jetty.webapp.WebAppContext.startContext(WebAppContext.java:1282)
at 
org.mortbay.jetty.handler.ContextHandler.doStart(ContextHandler.java:518)
at 
org.mortbay.jetty.webapp.WebAppContext.doStart(WebAppContext.java:499)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at 
org.mortbay.jetty.handler.HandlerCollection.doStart(HandlerCollection.java:152)
at 
org.mortbay.jetty.handler.ContextHandlerCollection.doStart(ContextHandlerCollection.java:156)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at 
org.mortbay.jetty.handler.HandlerCollection.doStart(HandlerCollection.java:152)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)
at 
org.mortbay.jetty.handler.HandlerWrapper.doStart(HandlerWrapper.java:130)

at org.mortbay.jetty.Server.doStart(Server.java:224)
at 
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:50)

at org.mortbay.xml.XmlConfiguration.main(XmlConfiguration.java:985)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

at java.lang.reflect.Method.invoke(Method.java:597)
at org.mortbay.start.Main.invokeMain(Main.java:194)
at org.mortbay.start.Main.start(Main.java:534)
at org.mortbay.start.Main.start(Main.java:441)
at org.mortbay.start.Main.main(Main.java:119)
Caused by: java.lang.ClassNotFoundException: 
org.apache.lucene.search.spell.StringDistance

at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
at 
org.mortbay.jetty.webapp.WebAppClassLoader.loadClass(WebAppClassLoader.java:401)
at 
org.mortbay.jetty.webapp.WebAppClassLoader.loadClass(WebAppClassLoader.java:363)

... 33 more


Escaping options for tika/solr cell extract-only output

2010-03-03 Thread Dan Hertz (Insight 49, LLC)

Looking at http://wiki.apache.org/solr/ExtractingRequestHandler:

Extract Only
the output includes XML generated by Tika (and is hence further escaped 
by Solr's XML)


...is there an option to NOT have the resulting TIKA output escaped?

so lt;headgt; would come back as head/

If no, what would need to be done to enable this option? Looked into 
SOLR-1274.patch, but didn't see a parameter for such a thing.


Thanks,

Dan


Re: ConcurrentModificationException

2010-02-27 Thread Dan Hertz (Insight 49, LLC)

On 2010-02-26 12:55 PM, Yonik Seeley wrote:

Yep, definitely a bug.
It looks like resourceLoader.newInstance() is fundamentally not thread safe.
   

Could you open a JIRA issue for this?


JIRA Issue opened: http://issues.apache.org/jira/browse/SOLR-1797

Dan


ConcurrentModificationException

2010-02-26 Thread Dan Hertz (Insight 49, LLC)

Hi guys,

SOLR 1.4 (final) and 1.5 nightly work fine on a Windows box, but on our 
Centos 5 box, we're getting a ConcurrentModificationException when 
starting Tomcat 6.


Any tips on how to solve this and/or troubleshoot?

Made sure there are no duplicate libs in Tomcat and solr/lib, and tried 
to cut down contrib stuff to see if it helped, but no luck.


Thanks, Dan.

= = =  Log Below: = = =

INFO   | jvm 1| 2010/02/24 21:27:04 | SEVERE: 
java.util.ConcurrentModificationException
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
java.util.AbstractList$Itr.checkForComodification(AbstractList.java:372)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
java.util.AbstractList$Itr.next(AbstractList.java:343)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.solr.core.SolrResourceLoader.inform(SolrResourceLoader.java:507)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.solr.core.SolrCore.init(SolrCore.java:606)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.solr.core.CoreContainer.create(CoreContainer.java:429)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.solr.core.CoreContainer.load(CoreContainer.java:285)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.solr.core.CoreContainer$Initializer.initialize(CoreContainer.java:117)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.solr.servlet.SolrDispatchFilter.init(SolrDispatchFilter.java:86)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.ApplicationFilterConfig.getFilter(ApplicationFilterConfig.java:275)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.ApplicationFilterConfig.setFilterDef(ApplicationFilterConfig.java:397)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.ApplicationFilterConfig.init(ApplicationFilterConfig.java:108)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.StandardContext.filterStart(StandardContext.java:3800)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.StandardContext.start(StandardContext.java:4450)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.ContainerBase.addChildInternal(ContainerBase.java:791)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.ContainerBase.addChild(ContainerBase.java:771)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.StandardHost.addChild(StandardHost.java:526)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.HostConfig.deployDescriptor(HostConfig.java:630)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.HostConfig.deployDescriptors(HostConfig.java:556)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.HostConfig.deployApps(HostConfig.java:491)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.HostConfig.start(HostConfig.java:1206)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.HostConfig.lifecycleEvent(HostConfig.java:314)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.util.LifecycleSupport.fireLifecycleEvent(LifecycleSupport.java:119)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.ContainerBase.start(ContainerBase.java:1053)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.StandardHost.start(StandardHost.java:722)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.ContainerBase.start(ContainerBase.java:1045)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.StandardEngine.start(StandardEngine.java:443)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.StandardService.start(StandardService.java:516)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.core.StandardServer.start(StandardServer.java:710)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.Catalina.start(Catalina.java:583)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
java.lang.reflect.Method.invoke(Method.java:597)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.Bootstrap.start(Bootstrap.java:288)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
org.apache.catalina.startup.Bootstrap.main(Bootstrap.java:413)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
INFO   | jvm 1| 2010/02/24 21:27:04 | at 

Re: ConcurrentModificationException

2010-02-26 Thread Dan Hertz (Insight 49, LLC)

On 2010-02-26 12:55 PM, Yonik Seeley wrote:

Yep, definitely a bug.
It looks like resourceLoader.newInstance() is fundamentally not thread safe.

-Yonik

On Fri, Feb 26, 2010 at 2:48 PM, Yonik Seeley
yo...@lucidimagination.com  wrote:
   

Could you open a JIRA issue for this?

Yonik,

Do you still need me to open a JIRA issue, or has one been opened?
(I'm having trouble connecting to issues.apache.org)

Thanks, Dan



Re: Solr Cell on web-based files?

2009-10-31 Thread Insight 49, LLC

markus.rietz...@rzf.fin-nrw.de wrote:

curl reads from local file or stdin, so you could do something like

if it only a single file from a webserver


curl http://someserver/file.html/ | curl 
http://localhost:8983/solr/update/extract?extractOnly=true; -F na...@-


but this way no crawling, no link following etc...
  


Thanks! Will give it a shot.

Is local file URIs a limitation of solr cell, or just curl; and if the 
former, any plans to add other schemes (http or ftp) to the code base? 
Just seems that one should be able to access a file locally or over the 
Internet interchangeably.


Dan


Solr Cell on web-based files?

2009-10-27 Thread Insight 49, LLC

Hi,

If I use the ExtractingRequestHandler 
http://wiki.apache.org/solr/ExtractingRequestHandler on a local file 
(as shown in http://wiki.apache.org/solr/TikaExtractOnlyExampleOutput), 
all works well, but how do I do this for files located on a server?


e.g. (works)
curl http://localhost:8983/solr/update/extract?extractOnly=true 
--data-binary @mylocalfile.htm -H Content-type:text/html


e.g (doesn't work)
curl http://localhost:8983/solr/update/extract?extractOnly=true 
--data-binary @http://myweb.com/mylocalfile.htm -H Content-type:text/html


Thanks,

Dan



Re: Solr Cell on web-based files?

2009-10-27 Thread Insight 49, LLC

Andrzej Bialecki wrote:

Grant Ingersoll wrote:
You might try remote streaming with Solr (see 
http://wiki.apache.org/solr/SolrConfigXml).  Otherwise, look into a 
crawler such as Nutch or Droids or Heretrix.


Additionally, Nutch can be configured to send the crawled/parsed 
documents to Solr for indexing.


I'm using nutch/solr, but wanted to use Cocoon to do some other 
transformations before adding the data to solr's index.


- Dan


Re: CSV Update - Need help mapping csv field to schema's ID

2009-09-16 Thread Insight 49, LLC

Thanks guys...

Yonik and Grant commented on this thread in the dev group.

Dan

Chris Hostetter wrote:

: I would like to add an additional name:value pair for every line, mapping the
: sku field to my schema's id field:
: 
: .map={sku.field}:{id}


the map param is for replacing a *value* with a different' value ... it's 
useful for things like numeric codes in CSV files that you want to replace 
with strings in your index.


: I would prefer NOT to change the schema by adding a copyField source=sku
: dest=id/.

that's the only solution i can think of unless you want to write an 
UpdateProcessor.



-Hoss





Re: CSV Update - Need help mapping csv field to schema's ID

2009-09-15 Thread Insight 49, LLC

Bump. Can anyone help guide me in the right direction?

Want to map each sku field to the schema unique id field using update/csv.

Thanks. Dan.


Insight 49, LLC wrote:
Using http://localhost:8983/solr/update/csv?stream.file, is there any 
way to map one of the csv fields to one's schema unique id?


e.g. A file with 3 fields (sku, product,price):
http://localhost:8983/solr/update/csv?stream.file=products.csvstream.contentType=text/plain;charset=utf-8header=trueseparator=%2cencapsulator=%22escape=%5cfieldnames=sku,product,price 



I would like to add an additional name:value pair for every line, 
mapping the sku field to my schema's id field:


.map={sku.field}:{id}

I would prefer NOT to change the schema by adding a copyField 
source=sku dest=id/.


I read: http://wiki.apache.org/solr/UpdateCSV, but can't quite get it.

Thanks!

Dan



CSV Update - Need help mapping csv field to schema's ID

2009-09-13 Thread Insight 49, LLC
Using http://localhost:8983/solr/update/csv?stream.file, is there any 
way to map one of the csv fields to one's schema unique id?


e.g. A file with 3 fields (sku, product,price):
http://localhost:8983/solr/update/csv?stream.file=products.csvstream.contentType=text/plain;charset=utf-8header=trueseparator=%2cencapsulator=%22escape=%5cfieldnames=sku,product,price

I would like to add an additional name:value pair for every line, 
mapping the sku field to my schema's id field:


.map={sku.field}:{id}

I would prefer NOT to change the schema by adding a copyField 
source=sku dest=id/.


I read: http://wiki.apache.org/solr/UpdateCSV, but can't quite get it.

Thanks!

Dan