Re: MoreLikeThis - How to pass in external text?

2010-01-22 Thread David Stuart
The MoreLikeThisHandler allows external text to be streamed to it see http://wiki.apache.org/solr/MoreLikeThisHandler#Using_ContentStreams. The url feature is quite good if you have a lot of text and start hitting the character limit in the url Regards, Dave On 22 Jan 2010, at 05:24, Otis

Re: Find newly added documents

2010-01-22 Thread Sandeep Tagore
Thanks a lot Erik. Is there any other alternate way? Thanks a lot for your response. Regards, Sandeep You'll be able to find them only after a commit. One way to do this is index a timestamp with every document, and find the latest ones using that field. There's an example of an automatic

Re: Find newly added documents

2010-01-22 Thread Erik Hatcher
You can do a search, sort by the special _docid_ field (underscores mandatory) descending and the top documents listed will be the latest added. Like this, un-url-encoded: q=*:*sort=_docid_ desc Erik On Jan 22, 2010, at 3:39 AM, Sandeep Tagore wrote: Thanks a lot Erik. Is

RE: Solr vs. Compass

2010-01-22 Thread Minutello, Nick
Agree with everything you said. -Original Message- From: Uri Boness [mailto:ubon...@gmail.com] Sent: 22 January 2010 01:25 To: solr-user@lucene.apache.org Subject: Re: Solr vs. Compass There seems to be an implication that compass wont scale as well as solr - and I'm not sure

RE: Solr vs. Compass

2010-01-22 Thread Minutello, Nick
I would tend to agree. -Original Message- From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] Sent: 22 January 2010 05:18 To: solr-user@lucene.apache.org Subject: Re: Solr vs. Compass Hi Ken, Based on this, Solr sounds like the way to go. Otis -- Sematext --

Re: What is this error means?

2010-01-22 Thread Oleg Burlaca
Hello, I've just installed Solr1.4 on a CentOS machine and wanted to index 300-500 documents from a PHP script. I run the script from a WindowsXP host. Sporadically I'm getting the same error as Ellery Leung: Communication Error' in C:\httpd_docs\www\lib\Apache\Solr\Service.php:385 Stack

Re: ArrayIndexOutOfBoundsException when highlighting (Solr 1.4)

2010-01-22 Thread Koji Sekiguchi
Tim Underwood wrote: I'm seeing an java.lang.ArrayIndexOutOfBoundsException when trying to highlight for certain queries. The error seems to be an issue with the combination of the ShingleFilterFactory, PositionFilterFactory and the LengthFilterFactory. Here's my fieldType definition:

solr with tomcat in cluster mode

2010-01-22 Thread ZAROGKIKAS,GIORGOS
Hi I'm using solr 1.4 with tomcat in a single pc and I want to turn it in cluster mode with 2 nodes and load balancing But I can't find info how to do Is there any manual or a recorded procedure on the internet to do that Or is there anyone to help me ?

Re: solr with tomcat in cluster mode

2010-01-22 Thread Matt Mitchell
We have a similar setup and I'd be curious to see how folks are doing this as well. Our setup: A few servers and an F5 load balancer. Each Solr instance points to a shared index. We use a separate server for indexing. When the index is complete, we do some juggling using the Core Admin SWAP

request handler defaults

2010-01-22 Thread Lukas Kahwe Smith
Hi, I have noticed that atm there doesnt seem to be a way to inherit request handler definitions. This would be nice to be able to define some basic requesthandlers (maybe even with the option of defining them abstract) which could be extended. For example I have one dismax request handler for

Re: How to store a dense field value efficiently

2010-01-22 Thread Erick Erickson
I'm surprised by a 30% increase. The approach of adding a special token for not present is one of the standard ones So just to check, when you say stored, are you really storing the missing value? As in Field.Store.YES? As opposed to Field.Index.###? Because theres no need to Store this

Re: How to store a dense field value efficiently

2010-01-22 Thread Erick Erickson
Oops, that's a Lucene bit (got confused which list I was on). You can still control storing the raw text in SOLR, so my question is still relevant, but the solution may be different. Do you store the fields? Erick On Fri, Jan 22, 2010 at 10:27 AM, Erick Erickson erickerick...@gmail.comwrote:

Re: Improvising solr queries

2010-01-22 Thread Erick Erickson
Take a look at the Wiki, here's a bit to start... http://lucene.apache.org/solr/features.html http://lucene.apache.org/solr/features.htmlThe short form is that when an index is first opened, there are various caches that are initialized. The first few queries that run against a new searcher are

Re: solr blocking on commit

2010-01-22 Thread Jerome L Quinn
Otis Gospodnetic otis_gospodne...@yahoo.com wrote on 01/22/2010 12:20:45 AM: I'm missing the bigger context of this thread here, but from the snippet below - sure, commits cause in-memory index to get written to disk, that causes some IO, and that *could* affect search *if* queries are

Re: performance issue

2010-01-22 Thread Matthieu Labour
Hi Thank you for your reponse Which version of solr? I inherited the project so not exactly sure ... in CHANGES.txt it says Apache Solr Version 1.4-dev $Id: CHANGES.txt 793090 2009-07-10 19:40:33Z yonik $ What garbage collection parameters? ulimit -n 10 ; nohup java -server

Re: performance issue

2010-01-22 Thread Grant Ingersoll
You likely are running into problems by having 1000 cores and simply don't have enough memory to handle that much. You should look at what you are filtering, sorting, faceting on and see if there are ways you can tune these. Take a look at your cache configurations as well. What are your

Re: ArrayIndexOutOfBoundsException when highlighting (Solr 1.4)

2010-01-22 Thread Tim Underwood
Issue created: https://issues.apache.org/jira/browse/SOLR-1731 On Fri, Jan 22, 2010 at 5:42 AM, Koji Sekiguchi k...@r.email.ne.jp wrote: Tim Underwood wrote: I'm seeing an java.lang.ArrayIndexOutOfBoundsException when trying to highlight for certain queries. The error seems to be an issue

Is there limit on size of query string?

2010-01-22 Thread Fuad Efendi
Is there limit on size of query string? Looks like I have exceptions when query string is higher than 400 characters (average) Thanks!

Re: Is there limit on size of query string?

2010-01-22 Thread Yonik Seeley
On Fri, Jan 22, 2010 at 11:48 AM, Fuad Efendi f...@efendi.ca wrote: Is there limit on size of query string? Looks like I have exceptions when query string is higher than 400 characters (average) This is most likely going to be a servlet container issue. Are you using the bundled Jetty or

CoreContainer / getCore and create ?

2010-01-22 Thread Matthieu Labour
Hi Would it make sense to modify/ add a method to CoreContainer that creates a core if the core doesn't exist ? something like public SolrCore getCore(String name) { synchronized(cores) { SolrCore core = cores.get(name); if (core != null) core.open(); // increment the

understanding termVector output

2010-01-22 Thread Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS]
Hi, I'm trying to see if I can use termVectors for a use case I have. Essentially I want to know is: where in the indexed value does the query hit occur? I think either tv.positions or tv.offsets would provide that info but I don't really grok the result. Below I've pasted the URL and part

How to Index multiple tables using SOLR

2010-01-22 Thread JavaGuy84
Hi, Eventhough I am new to SOLR I was able to successfully index a single table in a very short span of time. Now we have a requirement where the search needs to happen on multiple tables (mutiple table indexes) at the same time. I couldnt figure out a way to index more than one table in SOLR

Re: solr blocking on commit

2010-01-22 Thread Erick Erickson
Wait, wait, wait What about near-real-time? See: http://issues.apache.org/jira/browse/SOLR-1606 (Integrate near real time) http://issues.apache.org/jira/browse/SOLR-1606I don't think this is production-ready yet, or if you're willing to run patched code, but it sure seems like you want to

RE: How to store a dense field value efficiently

2010-01-22 Thread Dallan Quass
Sorry - I meant indexed. I don't store the fields. --dallan -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Friday, January 22, 2010 9:30 AM To: solr-user@lucene.apache.org Subject: Re: How to store a dense field value efficiently Oops, that's a

RE: Is there limit on size of query string?

2010-01-22 Thread Jennifer Luo
From my test, they query doesn't have limit. The maximum query length I used is more than 1M. However if you use Http Get, there is a length limit on querystring. You can try to use HTTP POST, using ContentType application/x-www-form-urlencoded Jennifer Luo -Original Message- From:

Re: Is there limit on size of query string?

2010-01-22 Thread Jérôme Charron
Hi, I recently had the same problem. The solution was to increase the maxHttpHeaderSize in tomcat server.xml file and everything works fine now. Best regards Jérôme On Fri, Jan 22, 2010 at 19:33, Jennifer Luo jenni...@talenttech.com wrote: From my test, they query doesn't have limit. The

RE: commit fails on weblogic

2010-01-22 Thread Joe Kessel
Within the weblogic console I have unchecked the Enable Keepalives and have been able to get by this error on commit, but it now fails on optimize. Using TCPMon it was noticed that multiple request where on the same connection, including the commit. As I've read that Solr runs fine on

RE: Is there limit on size of query string?

2010-01-22 Thread Ankit Bhatnagar
Try using POST Ankit -Original Message- From: ysee...@gmail.com [mailto:ysee...@gmail.com] On Behalf Of Yonik Seeley Sent: Friday, January 22, 2010 12:22 PM To: solr-user@lucene.apache.org Subject: Re: Is there limit on size of query string? On Fri, Jan 22, 2010 at 11:48 AM, Fuad

Re: performance issue

2010-01-22 Thread Otis Gospodnetic
Can you run 32-bit Java there? Will use less memory! :) Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message From: Matthieu Labour matth...@strateer.com To: solr-user@lucene.apache.org Sent: Fri, January 22, 2010 11:07:45 AM Subject: Re:

Re: solr with tomcat in cluster mode

2010-01-22 Thread Otis Gospodnetic
This should work fine. But why are you indexing to a separate index/core? Why not index in the very same index you are searching? Slaves won't see changes until their searchers re-open. Otis -- Sematext -- http://sematext.com/ -- Solr - Lucene - Nutch - Original Message From: Matt

Re: solr with tomcat in cluster mode

2010-01-22 Thread Matt Mitchell
Hey Otis, We're indexing on a separate machine because we want to keep our production nodes away from processes like indexing. The indexing server also has a ton of resources available, more so than the production nodes. We set it up as an indexing server at one point and have decided to stick

Re: MoreLikeThis - How to pass in external text?

2010-01-22 Thread ldung
David, thank you. This was what I was looking for! David Stuart-6 wrote: The MoreLikeThisHandler allows external text to be streamed to it see http://wiki.apache.org/solr/MoreLikeThisHandler#Using_ContentStreams. The url feature is quite good if you have a lot of text and start hitting

CoreAdmin CREATE instancedir recommendation needed

2010-01-22 Thread Joe Kessel
I have been using Solr 1.4 on Tomcat and had been making use of the current working directory unknowingly. I programmatically create cores as needed. I create a core via the CoreAdmin by passing in the core name ( CORE_01 ), instancedir ( CORE_01 ), config ( solr/conf/solrconfig.xml ), and

Re: Solr under tomcat - UTF-8 issue

2010-01-22 Thread Frank Wesemann
Glock, Thomas schrieb: My flex client httpservice by default only sets the content-type request header to application/x-www-form-urlencoded what it needed to do for tomcat is set the content-type request header to content-type = application/x-www-form-urlencoded; charset=UTF-8; As some

determine which value produced a hit in multivalued field type

2010-01-22 Thread Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS]
Hi, If I have a multiValued field type of text, and I put values [cat,dog,green,blue] in it. Is there a way to tell when I execute a query against that field for dog, that it was in the 1st element position for that multiValued field? Thanks! Tim

Re: determine which value produced a hit in multivalued field type

2010-01-22 Thread Lance Norskog
No, the return order is not guaranteed. On Fri, Jan 22, 2010 at 2:52 PM, Harsch, Timothy J. (ARC-TI)[PEROT SYSTEMS] timothy.j.har...@nasa.gov wrote: Hi, If I have a multiValued field type of text, and I put values [cat,dog,green,blue] in it.  Is there a way to tell when I execute a query

RE: Solr vs. Compass

2010-01-22 Thread Fuad Efendi
Yes, transactional, I tried it: do we really need transactional? Even if commit takes 20 minutes? It's their selling point nothing more. HBase is not transactional, and it has specific use case; each tool has specific use case... in some cases Compass is the best! Also, note that Compass

RE: Solr vs. Compass

2010-01-22 Thread Fuad Efendi
Of course, I understand what transaction means; have you guys been thinking some about what may happen if we transfer $123.45 from one banking account to another banking account, and MySQL forgets to index decimal during transaction, or DBA was weird and forgot to create an index? Absolutely

RE: SOLR Performance Tuning: Fuzzy Searches, Distance, BK-Tree

2010-01-22 Thread Fuad Efendi
http://issues.apache.org/jira/browse/LUCENE-2230 Enjoy! -Original Message- From: Fuad Efendi [mailto:f...@efendi.ca] Sent: January-19-10 11:32 PM To: solr-user@lucene.apache.org Subject: SOLR Performance Tuning: Fuzzy Searches, Distance, BK-Tree Hi, I am wondering: will SOLR

Re: determine which value produced a hit in multivalued field type

2010-01-22 Thread Erik Hatcher
Wait, let's be clear here Lance... multivalued fields are indeed guaranteed to return in the order they were indexed. However, you can't really tell, using this example, that a query for dog matches which instance of that field. You maybe kinda can using highlighting though?

Re: Hindi language support in solr

2010-01-22 Thread Ranveer kumar
Hi Robert, Thanks for reply. As you write, I used textgen but still not able to search hindi text. Might be missing some important configuration. following is my schema.xml configuration fieldType name=textgen class=solr.TextField positionIncrementGap=100 analyzer type=index