Re: Setting termInfosIndexDivisor and Interval?

2009-07-20 Thread Shalin Shekhar Mangar
On Mon, Jul 20, 2009 at 8:04 AM, Jason Rutherglen jason.rutherg...@gmail.com wrote: Are we currently supporting this or in 1.4? (i.e. IndexReader.open and IndexWriter.setTermIndexInterval) It's useful for trie range, shingles, etc, where many terms are potentially created. No, we don't

Help needed with Solr maxBooleanClauses

2009-07-20 Thread dipanjan_pramanick
Hi, We have scenario where we need to send more than 1024 ids in the Solr url as OR condition. I have changed the value of maxBooleanClauses in solrconfig.xml, to 2048, but it is failing after handling 1024 OR conditions. Solr is throwing SEVERE: org.apache.solr.common.SolrException: Bad Request

How to configure Solr in Glassfish ?

2009-07-20 Thread huenzhao
I want use glassfish as the solr search server, but I don't know how to configure. Anybody knows? enzhao...@gmail.com Thanks! -- View this message in context: http://www.nabble.com/How-to-configure-Solr--in-Glassfish---tp24565758p24565758.html Sent from the Solr - User mailing list archive

Re: Help needed with Solr maxBooleanClauses

2009-07-20 Thread Shalin Shekhar Mangar
On Mon, Jul 20, 2009 at 1:37 PM, dipanjan_pramanick dipanjan_praman...@infosys.com wrote: Hi, We have scenario where we need to send more than 1024 ids in the Solr url as OR condition. I have changed the value of maxBooleanClauses in solrconfig.xml, to 2048, but it is failing after handling

Re: Help needed with Solr maxBooleanClauses

2009-07-20 Thread dipanjan_pramanick
Hi Shalin, Thanks for your time to respond to this issue. Its true that there is a design flaw, because of what we need to support a huge list of OR conditions through Solr. But still I would like to know if there is any other configuration other than the one in solrConfig.xml, through which we

Re: Help needed with Solr maxBooleanClauses

2009-07-20 Thread Shalin Shekhar Mangar
On Mon, Jul 20, 2009 at 2:12 PM, dipanjan_pramanick dipanjan_praman...@infosys.com wrote: Its true that there is a design flaw, because of what we need to support a huge list of OR conditions through Solr. But still I would like to know if there is any other configuration other than the one

Confusion around Binary/XML in SolrJ

2009-07-20 Thread Code Tester
I am using solr 1.4 dev in a multicore way. Each of my core's solrconfig.xml has the following lines requestHandler name=/update class=solr.XmlUpdateRequestHandler / requestHandler name=/update/javabin class=solr.BinaryUpdateRequestHandler / I am using SolrJ as EmbeddedSolrServer. When I try to

Re: Confusion around Binary/XML in SolrJ

2009-07-20 Thread Code Tester
Another observation: I am even unable to delete documents using the EmbeddedSolrServer ( on a specific core ) Steps: 1) I have 2 cores ( core0 , core1 ) Each of them have ~10 records. 2) System.setProperty(solr.solr.home, /home/user/projects/solr/example/multi); File home = new

Re: Help needed with Solr maxBooleanClauses

2009-07-20 Thread dipanjan_pramanick
Hi Shalin, We just found that there is no limit on Solr side about the maximum boolean condition. We have set the maxBooleanClauses2048/maxBooleanClauses and we are able to send about 1574 OR conditions. Over that limit, we are getting HTTP/1.1 400 Bad Request. You are correct, it's not a Solr

post error - ERROR:unknown field 'title'

2009-07-20 Thread rossputin
Hi guys. I have two different solr versions as I am evaluating nightly builds. On a more recent one.. I think 15th July I am getting the following error : ERROR:unknown field 'title' I am posting to 'solr/update/extract' with the following: curl

Re: Confusion around Binary/XML in SolrJ

2009-07-20 Thread Code Tester
Sorry everyone. Found the issue. It was because of a very stupid assumption. My code and solr were running as 2 different processes! ( Weird part is that when I run the code using EmbeddedSolrServer, it did not throw any exception that there was already a server running on that port. ) Thanks!

Solr and UIMA

2009-07-20 Thread JCodina
We are starting to use UIMA as a platform to analyze the text. The result of analyzing a document is a UIMA CAS. A Cas is a generic data structure that can contain different data. UIMA processes single documents, They get the documents from a CAS producer, process them using a PIPE that the

Wildcards at the Beginning of a Search.

2009-07-20 Thread Jörg Agatz
Hallo Solr Users... I tryed to search with a Wildcard at the beginning from a search. for example, i will search for *est and get test, vogelnest, fest, But it dosent work, i alsways get an error... Now my Big brother GOOGLE tolds me, that it can work but a search with a Wildcad at the

Re: Help needed with Solr maxBooleanClauses

2009-07-20 Thread Avlesh Singh
If yours is a JAVA stack of application, I would recommend moving to SolrJ. It is a client API which lets you talk to Solr. Know more about it here - http://wiki.apache.org/solr/Solrj Clients API's for other languages can be found here -

Re: Confusion around Binary/XML in SolrJ

2009-07-20 Thread Erik Hatcher
On Jul 20, 2009, at 6:11 AM, Code Tester wrote: I am even unable to delete documents using the EmbeddedSolrServer ( on a specific core ) Steps: 1) I have 2 cores ( core0 , core1 ) Each of them have ~10 records. 2) System.setProperty(solr.solr.home, /home/user/projects/solr/example/multi);

Re: Wildcards at the Beginning of a Search.

2009-07-20 Thread Erik Hatcher
See http://issues.apache.org/jira/browse/SOLR-218 - Solr currently does not have leading wildcard support enabled. Erik On Jul 20, 2009, at 8:09 AM, Jörg Agatz wrote: Hallo Solr Users... I tryed to search with a Wildcard at the beginning from a search. for example, i will search

Posting multiple documents at once - clarification

2009-07-20 Thread Vannia Rajan
Hi, When we post a file with a number of documents of the format shown below to solr, if there is some 'error' in one of the doc, then all the docs in the file are error-ed out and not added to the Solr-index. ?xml add doc ...

Re: Posting multiple documents at once - clarification

2009-07-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
if the error is an xml parsing error there is no way of continuing from that point. even otherwise , solr assumes that if the whole payload is not correct it is to be discarded On Mon, Jul 20, 2009 at 6:32 PM, Vannia Rajankvanniara...@gmail.com wrote: Hi,  When we post a file with a number of

RE: Word frequency count in the index

2009-07-20 Thread Daniel Alheiros
Hi Wunder, Thanks for your reply! I take your point. It has to be appropriate to your content... In the cases I deal with, using stop words wouldn't be a big deal because the documents we handle are usually a proper article (although titles could still be impacted by it). I based my stop words

method inform of SolrCoreAware callled 2 times

2009-07-20 Thread Marc Sturlese
Hey there, I have implemented a custom component wich extends SearchComponent and implements SolrCoreAware. I have decalred it in solrconfig.xml as: searchComponent name=mycomp class=solr.MyCustomComponent And added it in my Searchhandler as: arr name=last-components strmycomp/str

Re: Wildcards at the Beginning of a Search.

2009-07-20 Thread Jeff Newburn
There is a hacky way to do it if you can pull it off. You can prepend some known prefix to the field then strip it off when you get the results back. An example would be putting Phone: in front of every value in a phone number field then instead of searching like this *-111- (which won't

Re: Wildcards at the Beginning of a Search.

2009-07-20 Thread Reza Safari
Add setAllowLeadingWildcard(true); to the constructor of org.apache.solr.search.SolrQueryParser.java Gr, Reza On Jul 20, 2009, at 4:00 PM, Jeff Newburn wrote: There is a hacky way to do it if you can pull it off. You can prepend some known prefix to the field then strip it off when you

Solr tika and posting .pst files

2009-07-20 Thread S.Selvam
Hi, I am using Solr-Tika to post various files.When i try to post .pst file(outlook express), the file is being posted but it does not contain any data.I could not found anything useful after googling. Regarding solrschema , i use 1) id 2) content(this is the default field) Do i need to

RE: Wildcards at the Beginning of a Search.

2009-07-20 Thread Brian Klippel
Depending on how you are sending docs in for indexing, you could also add an additional field who's value was a string reverse of the primary value. Then search that field with a trialing wildcard. -Original Message- From: Jeff Newburn [mailto:jnewb...@zappos.com] Sent: Monday, July

Re: Posting multiple documents at once - clarification

2009-07-20 Thread Vannia Rajan
2009/7/20 Noble Paul നോബിള്‍ नोब्ळ् noble.p...@corp.aol.com if the error is an xml parsing error there is no way of continuing from that point. even otherwise , solr assumes that if the whole payload is not correct it is to be discarded Thank you for your response -- Thanks, Vanniarajan

Implementing related tags

2009-07-20 Thread James T
Hi, I have a specific requirement for searching and looking for some help from the community on how to achieve it using solr: I need to index 1million + documents. Each document contains ( among other fields ) 3 fields representing the category which that doc belongs to. For example ( a very

Recommended Articles

2009-07-20 Thread Jeff Newburn
Does anyone have links or books to recommended reading on search in general. Would like to see some literature on larger search concepts and ideas. -- Jeff Newburn Software Engineer, Zappos.com jnewb...@zappos.com - 702-943-7562

Re: Recommended Articles

2009-07-20 Thread darren
http://www.amazon.com/s/ref=nb_ss_gw?url=search-alias%3Dstripbooksfield-keywords=searchx=0y=0 Does anyone have links or books to recommended reading on search in general. Would like to see some literature on larger search concepts and ideas. -- Jeff Newburn Software Engineer, Zappos.com

Re: Recommended Articles

2009-07-20 Thread Mark Miller
dar...@ontrenet.com wrote: http://www.amazon.com/s/ref=nb_ss_gw?url=search-alias%3Dstripbooksfield-keywords=searchx=0y=0 Does anyone have links or books to recommended reading on search in general. Would like to see some literature on larger search concepts and ideas. -- Jeff Newburn

Solr JMX and Cacti

2009-07-20 Thread Edward Capriolo
Hey all, We have several deployments of Solr across our enterprise. Our largest one is a several GB and when enough documents are added an OOM exception is occurring. To debug this problem I have enable JMX. My goal is to write some cacti templates similar to the ones I have done for hadoop.

Re: Solr JMX and Cacti

2009-07-20 Thread Ryan McKinley
On Jul 20, 2009, at 8:47 AM, Edward Capriolo wrote: Hey all, We have several deployments of Solr across our enterprise. Our largest one is a several GB and when enough documents are added an OOM exception is occurring. To debug this problem I have enable JMX. My goal is to write some cacti

SolrJ embedded server : error while adding document

2009-07-20 Thread Gérard Dupont
Hi SolR guys, I'm starting to play with SolR after few years with classic Lucene. I'm trying to index a single document using the embedded server, but I got a strange error which looks like XML parsing problem (see trace hereafter). To add details, this is a simple Junit which create single

Re: SolrJ embedded server : error while adding document

2009-07-20 Thread Gérard Dupont
my mistake, pb with the buffer I added. But it raises a question : does solr (using embedded server) has its own buffer mechanism in indexing or not ? I guess not but I might be wrong. 2009/7/20 Gérard Dupont ger.dup...@gmail.com Hi SolR guys, I'm starting to play with SolR after few years

Exception searching PhoneticFilterFactory field with number

2009-07-20 Thread Robert Petersen
Reposting in hopes of an answer... Hello all, I am getting the following exception whenever a user includes a numeric term in their search, and the search includes a field defined with a PhoneticFilterFactory and further it occurs whether I use the DoubleMetaphone encoder or any other.

Re: Implementing related tags

2009-07-20 Thread Avlesh Singh
Have a look at the MoreLikeThis component - http://wiki.apache.org/solr/MoreLikeThis Cheers Avlesh On Mon, Jul 20, 2009 at 8:05 PM, James T codetester.codetes...@gmail.comwrote: Hi, I have a specific requirement for searching and looking for some help from the community on how to achieve it

Re: SolrJ embedded server : error while adding document

2009-07-20 Thread Ryan McKinley
not sure what you mean... yes, i guess... you send a bunch of requests with add( doc/collection ) and they are not visible until you send commit() On Jul 20, 2009, at 9:07 AM, Gérard Dupont wrote: my mistake, pb with the buffer I added. But it raises a question : does solr (using

Re: SolrJ embedded server : error while adding document

2009-07-20 Thread Gérard Dupont
On Mon, Jul 20, 2009 at 18:35, Ryan McKinley ryan...@gmail.com wrote: you send a bunch of requests with add( doc/collection ) and they are not visible until you send commit() That's what I meant thanks. -- Gérard Dupont Information Processing Control and Cognition (IPCC) - EADS DS

Indexing issue with XML control characters

2009-07-20 Thread Rupert Fiasco
During indexing I will often get this error: SEVERE: com.ctc.wstx.exc.WstxUnexpectedCharException: Illegal character ((CTRL-CHAR, code 3)) at [row,col {unknown-source}]: [2,1] at com.ctc.wstx.sr.StreamScanner.throwInvalidSpace(StreamScanner.java:675) By looking at this list and

Re: How to configure Solr in Glassfish ?

2009-07-20 Thread Mark Miller
What have you tried? Deploying the Solr war should be pretty straightforward. The main issue is likely setting solr.home. You likely have a lot of options there though. You can set a system property in the startup script, set a system property in the webapp context xml (if you can locate it), or I

Re: Solr JMX and Cacti

2009-07-20 Thread Edward Capriolo
On Mon, Jul 20, 2009 at 12:31 PM, Ryan McKinleyryan...@gmail.com wrote: On Jul 20, 2009, at 9:16 AM, Edward Capriolo wrote: On Mon, Jul 20, 2009 at 11:53 AM, Ryan McKinleyryan...@gmail.com wrote: On Jul 20, 2009, at 8:47 AM, Edward Capriolo wrote: Hey all, We have several deployments of

RE: multi-word synonyms with multiple matches

2009-07-20 Thread Ensdorf Ken
You haven't given us the full details on how you are using the SynonymFilterFactory (expand true or false?) but in general: yes the SynonymFilter finds the longest match it can. Sorry - doing expansion at index time: filter class=solr.SynonymFilterFactory synonyms=title_synonyms.txt

Re: Implementing related tags

2009-07-20 Thread James T
That does not seem to work fine. To further simplify the issue, assuming there is a multi valued tag field and number of docs is 1 million. By constrainting on a given tag, I need to search on the related tags. So Doc 1: tags: tagA, tagB, tagC, ball Doc 2: tags: tagA, bat Now constrainting

Re: Implementing related tags

2009-07-20 Thread Bill Au
Faceting on tags will give you all the related tags, including the original tag (tagA in your case). You will have to filter out the original tag on the client side if you don't want to show that. With Solar 1.4, you will be able to use localParam to exclude the original tag in the results. If

index version on slave

2009-07-20 Thread solr jay
If you ask for the index version of a slave instance, you always get version number being 0. Is it expected behavior? I am using this url http://slave_host:8983/solr/replication?command=indexversion This request returns correct version on master. If you use the 'details' command, you get the

Re: unable to run the solr in tomcat 5.0

2009-07-20 Thread aligu
try this: java -Durl=http://localhost:8080/solr/update -jar post.jar filename.xml it should work. HH uday kumar maddigatla wrote: hi you mis understood my question. When i try to use the command java -post.jar *.*. It is trying to Post files in Solr which is there in 8983 port. If

Re: hierarchical faceting discussion

2009-07-20 Thread Erik Hatcher
I was particularly surprised by the SOLR-64 numbers. What makes it's response so huge (and thus slow) to return the entire tree of facet counts? Erik On Jul 19, 2009, at 5:35 PM, Erik Hatcher wrote: I've posted the details of some experiments I just did comparing/ contrasting

Re: Recommended Articles

2009-07-20 Thread Óscar Marín Miró
I personally love this book: http://www.amazon.com/Building-Search-Applications-Lucene-LingPipe/dp/0615204252 It intermixes search with analysis: sentiment, named entity recognition, NLP Pipelines and so on... There's a little Nutch cameo too... On Mon, Jul 20, 2009 at 4:56 PM, Mark Miller

Re: Obtaining SOLR index size on disk

2009-07-20 Thread Peter Wolanin
Actually, if you have a server enabled as a replication master, the stats.jsp page reports the index size, so that information is available in some cases. -Peter On Sat, Jul 18, 2009 at 8:14 AM, Erik Hatchere...@ehatchersolutions.com wrote: On Jul 17, 2009, at 8:45 PM, J G wrote: Is it

Re: Exception searching PhoneticFilterFactory field with number

2009-07-20 Thread Otis Gospodnetic
Robert, Can you narrow things down by simplifying the query? For example, I see allDoublemetaphone:2226, which looks suspicious in the give me phonetic version of the input context, but if you could narrow it down, we could probably be able to help more. Otis -- Sematext --

Re: method inform of SolrCoreAware callled 2 times

2009-07-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
it is not normal to get the inform() called twice for a single object. which version of solr are you using? On Mon, Jul 20, 2009 at 7:17 PM, Marc Sturlesemarc.sturl...@gmail.com wrote: Hey there, I have implemented a custom component wich extends SearchComponent and implements SolrCoreAware.

DocList Pagination

2009-07-20 Thread pof
Hi, I am try to get the next DocList page in my custom search component. Could I get a code example of this? Cheers. -- View this message in context: http://www.nabble.com/DocList-Pagination-tp24581850p24581850.html Sent from the Solr - User mailing list archive at Nabble.com.