[jira] [Commented] (SOLR-2341) Shard distribution policy

2011-07-19 Thread Alex Cowell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13067651#comment-13067651 ] Alex Cowell commented on SOLR-2341: --- bq. Where would this be plugged into Solr

Re: Load balancing within distributed search

2011-03-01 Thread Alex Cowell
Hi Upayavira, I believe there is a basic implementation of load balancing for distributed search. The submit() method in the HttpCommComponent class within SearchHandler contains code which splits a pipe character delimited string of shards and uses a LoadBalancedHttpSolrServer to send requests

[jira] Commented: (SOLR-2358) Distributing Indexing

2011-02-16 Thread Alex Cowell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12995570#comment-12995570 ] Alex Cowell commented on SOLR-2358: --- bq. Since this functionality is core to Solr

Re: Distributed Indexing

2011-02-14 Thread Alex Cowell
I've uploaded a patch of what we've done so far: https://issues.apache.org/jira/browse/SOLR-2358 It's still very much work in progress and there are some obvious issues which are being resolved at the moment (such as the inefficient method of waiting for all the docs to be processed before

[jira] Updated: (SOLR-2358) Distributing Indexing

2011-02-14 Thread Alex Cowell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Cowell updated SOLR-2358: -- Attachment: SOLR-2358.patch Added a patch which handles distributed add and commit update requests

Re: Distributed Indexing

2011-02-06 Thread Alex Cowell
Hey, We're making good progress, but our DistributedUpdateRequestHandler is having a bit of an identity crisis, so we thought we'd ask what other people's opinions are. The current situation is as follows: We've added a method to ContentStreamHandlerBase to check if an update request is

Re: Distributed Indexing

2011-02-03 Thread Alex Cowell
Hi all, Just a couple of questions that have arisen. 1. For handling non-distributed update requests (shards param is not present or is invalid), our code currently - assumes the user would like the data indexed, so gets the request handler assigned to /update - executes the request

Re: Distributed Indexing

2011-02-01 Thread Alex Cowell
Your code looks fine to me, except it should take in a SolrDocument object or list of, rather than strings. Then, for your Hash version, you can take a hash of the id field. As far as I can see I have access to a ListContentStream that represents all of the files being POSTed. Do I want to

Re: Distributed Indexing

2011-02-01 Thread Alex Cowell
I noticed there is a comment in the org.apache.solr.servlet.DirectSolrConnection class which reads, //Find a way to turn ListContentStream into File/SolrDocument. Did anyone find a way to do this? Turns out that comment was left over from some experimenting one of our team was doing. But I

Re: Distributed Indexing

2011-01-28 Thread Alex Cowell
Hi Yonik and Upayavira, Thank you both for your insightful responses. We now have a much better understanding of how to implement distributed indexing, although no doubt more issues will emerge along the way. Just to clarify (and for critique), our approach goes something like this: We will use

Re: Distributed Indexing

2011-01-27 Thread Alex Cowell
Making it easy for clients I think is key... one should be able to update any node in the solr cluster and have solr take care of the hard part about updating all relevant shards. This will most likely involve an update processor. This approach allows all existing update methods (including

Re: Distributed Indexing

2011-01-26 Thread Alex Cowell
Hi Soheb, Sounds good! A few things I thought of: With regard to #1, would the list of shards to index to (if present) be exclusive or would we assume that the shard the update request was sent to should also be included? For example, say, using the example you gave, an update request was sent

[jira] Updated: (SOLR-2287) (SolrCloud) Allow users to query by multiple collections

2011-01-19 Thread Alex Cowell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Cowell updated SOLR-2287: -- Attachment: SOLR-2287.patch Removed unnecessary XML solr config files from previous patch and added

[jira] Updated: (SOLR-2287) (SolrCloud) Allow users to query by multiple collections

2011-01-17 Thread Alex Cowell (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Cowell updated SOLR-2287: -- Attachment: SOLR-2287.patch Added a test class which tests basic functionality for 3 collections

Allow user to query by (multiple) collections

2010-11-22 Thread Alex Cowell
are currently in the process of developing unit tests for our code. We're open to feedback and we'll keep this list updated with our progress. Alex Cowell, Soheb Mahmood, William Mayor, Konrad Kowalski, Ahillen Kuganesan, Andy Gamblin, Hamza Sethi