[ 
https://issues.apache.org/jira/browse/SOLR-7256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15318574#comment-15318574
 ] 

Hari Sekhon commented on SOLR-7256:
-----------------------------------

FYI this was co-located on a Hadoop cluster where Raid would have meant 
destroying the existing hdfs data and making it unsuitable for Hadoop cluster 
node usage and conversely storing the indices on HDFS resulted in severe 
performance degradation, eg. SOLR-7393 - which is why the Elastic.co folks 
never wanted to put their indices on HDFS as they had reported similar 
performances issues.

> Multiple data dirs
> ------------------
>
>                 Key: SOLR-7256
>                 URL: https://issues.apache.org/jira/browse/SOLR-7256
>             Project: Solr
>          Issue Type: New Feature
>    Affects Versions: 4.10.3
>         Environment: HDP 2.2 / HDP Search
>            Reporter: Hari Sekhon
>
> Request to support multiple dataDirs as indexing a large collection fills up 
> only one of many disks in modern servers (think colocating on Hadoop servers 
> with many disks).
> While HDFS is another alternative, it results in poor performance and index 
> corruption under high online indexing loads (SOLR-7255).
> While it should be possible to do multiple cores with different dataDirs, 
> that could be very difficult to manage and not humanly scale well, so I think 
> Solr should support use of multiple dataDirs natively.
> Regards,
> Hari Sekhon
> http://www.linkedin.com/in/harisekhon



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to