Re: Partition Question

2012-05-08 Thread Yuval Dotan
Hi Can someone please guide me to the right way to partition the solr index? On Mon, May 7, 2012 at 11:41 AM, Yuval Dotan yuvaldo...@gmail.com wrote: Hi All Jan, thanks for the reply - answers for your questions are located below Please update me if you have ideas that can solve my problems.

Re: Nested CachedSqlEntityProcessor running for each entity row with Solr 3.6?

2012-05-08 Thread not interesting
In the mean time, I am hoping each of you can test these fixes with your installation.  The best way to do this is to get a fresh SVN checkout of the 3.6.1 branch (http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_3_6/), switch to the solr directory, then run ant dist.  I

RE: Advanced search with results matrix

2012-05-08 Thread Gnanakumar
Hi Mikhail, have you considered to junk your subqueries into disjunction (BooleanQuery.Occurs.SHOULD) and request http://wiki.apache.org/solr/SimpleFacetParameters#facet.query_:_Arbitrary_Qu ery_Faceting? Thanks for pointing/guiding me at the right time and/or direction about Solr Facet

Re: Bridge between Solr and NoSQL

2012-05-08 Thread Michael Kuhlmann
Am 08.05.2012 04:13, schrieb Jeff Schmidt: Francois: Check out DataStax Enterprise 2.0, Solr integrated with Cassandra: http://www.datastax.com/docs/datastax_enterprise2.0/search/index And, Solbase, Solr integrated with HBase: https://github.com/Photobucket/Solbase I'm sure there are others,

How to change the default format for tstamp?

2012-05-08 Thread forwardswing
When i use Nutch1.2 Solr1.4,from Nutch i use the following command to commit the index to Solr: bin/nutch crawl urls -dir ... But there is an error in Solr: ERROR : org.apache.solr.core.SolrCore - org.apache.solr.common.SolrException: Error while creating field

RE: How to change the default format for tstamp?

2012-05-08 Thread Agnesrani Nadar
SELECT DATE_FORMAT( ts.insertDate,'%Y-%m-%d-%k:%i:%S') as insertDate, Try to use the above DATE_FORMAT you will get the date as 2012-03-24-14:07:59 -Original Message- From: forwardswing [mailto:wangweiz...@sohu.com] Sent: Tuesday, May 08, 2012 2:22 PM To: solr-user@lucene.apache.org

Solr Webapps and JVM code cache

2012-05-08 Thread Sujatha Arun
Hello , I see that the code cache in the JVM is nearing its memory limits 47mb /assigned in 50 MB .On deploying more solr webapps to the server,Will i get any out of memory exceptions ? and will JVM freeze? How should this be handled? Regards Sujatha

SOLR UIMA - posTag

2012-05-08 Thread hemantverm...@gmail.com
Hi All I have intergrated UIMA and Solr and successfully indexed the documents. My requirement is to boost nouns over verbs. (Lets take for example field : description) Below is the sample index contents. - doc - arr name=concept strWater Curses/str strProspect Hummer/str /arr str

Solr Spellcheck index is not replicated to the slave

2012-05-08 Thread ilanh
I am using apache-solr-1.4.1. I have a master and a slave. When I am doing “Full import”, the index is replicated to the slave however the spellcheck index is not. In the master I have: str

RE: Nested CachedSqlEntityProcessor running for each entity row with Solr 3.6?

2012-05-08 Thread Dyer, James
Kellen, I appreciate your trying this out. Is there any way you can provide your data-config.xml file? I'd really like to get to the bottom of this. Thanks. James Dyer E-Commerce Systems Ingram Content Group (615) 213-4311 -Original Message- From: not interesting

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Michael Della Bitta
Hi Sujatha, You will likely have to increase the JVM permgen space for your container when you launch it. This is normal. How you do this depends on the container you're using and how you launch it. Michael On Tue, 2012-05-08 at 16:54 +0530, Sujatha Arun wrote: Hello , I see that the code

Re: Lucene FieldCache - Out of memory exception

2012-05-08 Thread Rahul R
A update on the things I tried today. Since multiValued fields do not use the fieldCache, I changed my schema to define all my fields as multiValued fields. Although these fields need to be only single valued, I made this change and recreated the index and tested with it. Observations : - force GC

[MAVEN] Heads up: build changes

2012-05-08 Thread Steven A Rowe
If you use the Lucene/Solr Maven POMs to drive the build, I committed a major change last night (see https://issues.apache.org/jira/browse/LUCENE-3948 for more details): * 'ant get-maven-poms' no longer places pom.xml files under the lucene/ and solr/ directories. Instead, they are placed in

RE: How to change the default format for tstamp?

2012-05-08 Thread forwardswing
I am a learner to nutch,can you please describe in more detail? For example,from where and how to execute the SQL? -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-change-the-default-format-for-tstamp-tp3970751p3971251.html Sent from the Solr - User mailing list archive

RE: How to change the default format for tstamp?

2012-05-08 Thread Markus Jelsma
You should upgrade your Nutch version. Nutch 1.4 writes dates in the format Solr understands. On Tue, 8 May 2012 06:03:25 -0700 (PDT), forwardswing wangweiz...@sohu.com wrote: I am a learner to nutch,can you please describe in more detail? For example,from where and how to execute the SQL?

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Sujatha Arun
The Permgen space ,we have given 2Gb ,currently used permgen space is 650MB ,however code cache has by default 49MB and 47+ Mb has been used ,I would like to know what happens when we deploy more webapps to the container . - Does the unused memory removed from cache to give more memory?

Re: Advanced search with results matrix

2012-05-08 Thread Mikhail Khludnev
On Tue, May 8, 2012 at 11:38 AM, Gnanakumar gna...@zoniac.com wrote: My question is, is it possible to run multiple combination of search queries to just get only result count in a single trip without using facet.query? No. AFAIK. -- Sincerely yours Mikhail Khludnev Tech Lead Grid

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Michael Della Bitta
I'm sorry, apparently I'm unfamiliar with issues that result in needing 2Gb of permgen space. I've never run into code cache issues before. I found this page via Google: http://jasig.275507.n4.nabble.com/Java-code-cache-memory-td271140.html ..but perhaps someone who has experience with this

CLASSPATH

2012-05-08 Thread Tolga
Hi, Probably off-topic, but what directory should I export to CLASSPATH environment variable so that I can begin using nutch? Regards,

Re: CLASSPATH

2012-05-08 Thread Otis Gospodnetic
Tolga - you should ask on the Nutch mailing list, not Solr one. :) Otis  Performance Monitoring for Solr / ElasticSearch / HBase - http://sematext.com/spm  From: Tolga to...@ozses.net To: solr-user@lucene.apache.org Sent: Tuesday, May 8, 2012 4:30 PM

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Otis Gospodnetic
Hi, Did you really mean 47 *MB*? Yes, if any limits are reached and GC cannot reclaim enough space, you will get OOM. You can use a couple of JVM params to catch this and dump heap to a file if you want to analyze it and see what was using memory and how much. Otis  Performance Monitoring

Re: SOLR UIMA - posTag

2012-05-08 Thread Lance Norskog
2 quick ideas: 1) The nouns and verbs have to be in separate fields. So, you would make two analysis stacks. One only stores terms marked as 'noun' and the other only stores terms marked as 'verb'. 2) Store all terms in one field. Create a Similarity class that scores from the payloads. (There is

Re: Bridge between Solr and NoSQL

2012-05-08 Thread Lance Norskog
The SolrCloud project is making Solr be a NoSQL DB by itself, without connections to other systems. On Tue, May 8, 2012 at 2:01 AM, Michael Kuhlmann k...@solarier.de wrote: Am 08.05.2012 04:13, schrieb Jeff Schmidt: Francois: Check out DataStax Enterprise 2.0, Solr integrated with Cassandra:

Re: Partition Question

2012-05-08 Thread Lance Norskog
Lucene does not support more 2^32 unique documents, so you need to partition. In Solr this is done with Distributed Search: http://www.lucidimagination.com/search/link?url=http://wiki.apache.org/solr/DistributedSearch First, you have to decide a policy for which documents go to which 'shard'. It

Re: PDF indexing

2012-05-08 Thread Lance Norskog
post.jar and curl do the same thing. Look at post.sh, which uses curl. On Mon, May 7, 2012 at 12:57 PM, Tolga to...@ozses.net wrote: On 05/07/2012 10:35 PM, Jack Krupansky wrote: Try SolrCell (ExtractingRequestHandler). See: http://wiki.apache.org/solr/ExtractingRequestHandler -- Jack

Re: Solritas in production

2012-05-08 Thread Bill Bell
I would not use Solaritas unless for very rudimentary solutions and prototypes. Sent from my Mobile device 720-256-8076 On May 6, 2012, at 6:02 AM, András Bártházi and...@barthazi.hu wrote: Hi, We're currently evaluating Solr as a Sphinx replacement. Our site has 1.000.000+ pageviews a

Re: Solr Webapps and JVM code cache

2012-05-08 Thread Sujatha Arun
yes 47 MB ,Does CMS permgen Sweeping take care of code Cache clean up? Thanks Michael and Otis Regards Sujatha On Wed, May 9, 2012 at 2:27 AM, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hi, Did you really mean 47 *MB*? Yes, if any limits are reached and GC cannot reclaim enough