[jira] Commented: (SOLR-1044) Use Hadoop RPC for inter Solr communication

Noble Paul (JIRA) Wed, 04 Mar 2009 19:50:20 -0800

    [ 
https://issues.apache.org/jira/browse/SOLR-1044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12679061#action_12679061
 ]


Noble Paul commented on SOLR-1044:
----------------------------------

bq.But the system quickly reaches steady state, right? That new connection will 
be persistent and hang around for a while to be reused again when needed.
the system reaches a steady state where the no:of connections would be slightly 
greater than the maximum no:of parallel requests. whereas a system using a 
message based RPC will still have only a single connection between 2 Solrs. 

bq.For a high-fanout distributed search, a more important part might actually 
be message parsing (independent of transport used)

As Solr move towards other applications such as mapreduce/mahout where the 
operations do not involve disk IO and where the payload is small there can be a 
problem.

My tests with hadoop RPC showed it outperforming tomcat when I used a small 
payload (5 bytes)

> Use Hadoop RPC for inter Solr communication
> -------------------------------------------
>
>                 Key: SOLR-1044
>                 URL: https://issues.apache.org/jira/browse/SOLR-1044
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>            Reporter: Noble Paul
>
> Solr uses http for distributed search . We can make it a whole lot faster if 
> we use an RPC mechanism which is more lightweight/efficient. 
> Hadoop RPC looks like a good candidate for this.  
> The implementation should just have one protocol. It should follow the Solr's 
> idiom of making remote calls . A uri + params +[optional stream(s)] . The 
> response can be a stream of bytes.
> To make this work we must make the SolrServer implementation pluggable in 
> distributed search. Users should be able to choose between the current 
> CommonshttpSolrServer, or a HadoopRpcSolrServer . 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (SOLR-1044) Use Hadoop RPC for inter Solr communication

Reply via email to