Solr Indexing error in this function

2012-04-22 Thread vasuj
Solr Indexing error in this function. I am using Windows 8 x32, Xampp to configure solr, tomcat. I have tried many other forums too but not helpful. Even tried configuring many XML in Xampp/solr still could not get it working. Any hints would be helpful. Here is my function for solr indexing and

Re: Storing the md5 hash of pdf files as a field in the index

2012-04-22 Thread kuchenbrett
Hi Lance, sounds interesting. The idea was to use a message digest (e. g. a md5 hash) of a file to be indexed as an unique identifier to avoid duplicates. I wasn't aware of the de-duplication feature you mention. This feature seems to be the exact solution for my problem. In the solr wiki I

Re: Solr Indexing error in this function

2012-04-22 Thread vasuj
Log is : Apr 22, 2012 2:55:17 AM org.apache.solr.update.processor.LogUpdateProcessor finish INFO: {add=[(null)]} 0 17 Apr 22, 2012 2:55:17 AM org.apache.solr.common.SolrException log SEVERE: org.apache.solr.common.SolrException: [doc=null] missing required field: id at

Re: How can I get the top term in solr?

2012-04-22 Thread Dan Tuffery
1) The TermsComponent will return the top terms: http://wiki.apache.org/solr/TermsComponent 2) Add 'debugQuery=on' to your query, look at the 'explain' section in the results to get information regarding how many times the term appears in the document (idf). On Fri, Apr 20, 2012 at 5:31 PM,

Re: Solr Indexing error in this function

2012-04-22 Thread Gora Mohanty
On 22 April 2012 15:33, vasuj vasu.j...@live.in wrote: Log is : Apr 22, 2012 2:55:17 AM org.apache.solr.update.processor.LogUpdateProcessor finish INFO: {add=[(null)]} 0 17 Apr 22, 2012 2:55:17 AM org.apache.solr.common.SolrException log SEVERE: org.apache.solr.common.SolrException:

Re: Solr Indexing error in this function

2012-04-22 Thread vasuj
yes it worked. Thanks Gora. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-Indexing-error-in-this-function-tp3929446p3929673.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: EmbeddedSolrServer and StreamingUpdateSolrServer

2012-04-22 Thread Mikhail Khludnev
To be honest I have no idea. Can you try to shutdown the first process JVM after it's complete indexing and start second JVM only after that. Whether it work? which version of Solr you are running? On Fri, Apr 20, 2012 at 8:14 AM, pcrao purn...@gmail.com wrote: Hi, Any update? Thanks, PC

Re: Large Index and OutOfMemoryError: Map failed

2012-04-22 Thread Michael McCandless
Is it possible you are hitting this (just opened) Solr issue?: https://issues.apache.org/jira/browse/SOLR-3392 Mike McCandless http://blog.mikemccandless.com On Fri, Apr 20, 2012 at 9:33 AM, Gopal Patwa gopalpa...@gmail.com wrote: We cannot avoid auto soft commit, since we need Lucene NRT

Re: SolrCloud: Programmatically create multiple collections?

2012-04-22 Thread Mark Miller
Hey Ravi - yeah, I know this is kind of confusing. The issue is that the true state is actually the advertised state in clusterstate.json *and* whether or not a node is listed on live_nodes. The reason this is the case is that if a node just dies, it may have left its current in *any* state.

RE: Crawling an SCM to update a Solr index

2012-04-22 Thread Van Tassell, Kristian
Otis, Thanks for the input! Were it not the metadata I need to extract and the slight possibility a sync error/file system error or inconsistency could occur, I would take that same route. -Kristian -Original Message- From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] Sent:

searcher leak on trunk after 2/1/2012

2012-04-22 Thread Yonik Seeley
Folks, If you're using a trunk version after 2/1/2012 in conjunction with the shipped solrconfig.xml (which uses openSearcher=false in an autoCommit by default), then you should upgrade to a new version. There's a searcher leak when openSearcher=false is used with a commit that leads to files not

'Error 404: missing core name in path' in Solr

2012-04-22 Thread vasuj
I http://lucene.472066.n3.nabble.com/file/n3931194/Screenshot_%2847%29.png used //server.deleteByQuery( *:* );// CAUTION: deletes everything! query in my solr indexing program. Since then i am receiving the error whenever , i go to http://localhost:8080/solr/admin/ and press search with query

Re: How can I get the top term in solr?

2012-04-22 Thread neosky
You are very helpful. Thanks a lot! -- View this message in context: http://lucene.472066.n3.nabble.com/How-can-I-get-the-top-term-in-solr-tp3926536p3931252.html Sent from the Solr - User mailing list archive at Nabble.com.

The index speed in the solr

2012-04-22 Thread neosky
It takes me 50 hours to index a total 9 G file(about 2,000,000 documents) with n-gram filter from min=6,max=10, my token before ngram filter is long(not a word, at most 300,000 bytes with white space). I split into 4 files and use the post.sh to update at the same time. I also tried to write a