[ 
https://issues.apache.org/jira/browse/SOLR-4916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13683270#comment-13683270
 ] 

Mark Miller commented on SOLR-4916:
-----------------------------------

Thanks for taking a look AB!

bq. Re. Hadoop dependencies: the patch adds a hard dependency on Hadoop and its 
dependencies directly to Solr core. I wonder if it's possible to refactor it so 
that it could be optional and the functionality itself moved to contrib/ - this 
way only users who want to use HdfsDirectory would need Hadoop deps.

Yeah, I don't really beleive in Solr contribs - they are not so useful IMO - 
it's a pain to actually pull them out and it has to be done after the fact. 
Given the size of the dependencies is such a small percentage of the current 
size, that we don't want to support the UpdateLog as actually pluggable, and 
that it would be nice that hdfs was supported out of the box just as local 
filesystem, I don't see being a contrib being much of a win. It saves a few 
megabytes when we are already well over 100 - and that's if you are willing to 
pull it apart after you download it. From what I've seen, even with the *huge* 
extract contrib, most people don't bother repackaging. It's hard to imagine 
they would for a few megabytes. 

bq. Cache and BlockCache imple

We have done some casual benchmarking - loading tweets at a high rate of speed 
while sending queries at a high rate of speed with 1 second NRT - essentially 
the worst case NRT scenerio. By and large, performance has been similiar to 
local filesystem performance. We will likely share some numbers when we have 
some less casual results. You do of course have to warm up the block cache 
before it really kicks in.

In terms of impl, as I mentioned, the orig HdfsDirectory comes from the Blur 
guys - we tried not to change it too much currently - not until we figure out 
if we might evolve it with them in the future - eg as a Lucene module or 
something. 
                
> Add support to write and read Solr index files and transaction log files to 
> and from HDFS.
> ------------------------------------------------------------------------------------------
>
>                 Key: SOLR-4916
>                 URL: https://issues.apache.org/jira/browse/SOLR-4916
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Mark Miller
>            Assignee: Mark Miller
>         Attachments: SOLR-4916.patch
>
>


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to