[
https://issues.apache.org/jira/browse/SOLR-7256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15319166#comment-15319166
]
Jan Høydahl commented on SOLR-7256:
-----------------------------------
Hari, I closed this issue due to inactivity for a year, and it appeared you
were happy with switching to ES. But feel free to re-open if you want to revive
this issue and contribute towards finding elegant ways to utilize multiple
disks natively.
If Solr was to spread the data, a first step could be to automatically spread
new cores across multiple disks, if multiple disks are configured. Another more
complicated way could perhaps be a {{MultiFSDirectoryFactory}} which always
writes new index segments to the disk with the most free space, and does the
book-keeping of where it put the files. If that is at all practically doable,
it would be a non-intrusive solution that people could plug in if needed and
otherwise leave out?
> Multiple data dirs
> ------------------
>
> Key: SOLR-7256
> URL: https://issues.apache.org/jira/browse/SOLR-7256
> Project: Solr
> Issue Type: New Feature
> Affects Versions: 4.10.3
> Environment: HDP 2.2 / HDP Search
> Reporter: Hari Sekhon
>
> Request to support multiple dataDirs as indexing a large collection fills up
> only one of many disks in modern servers (think colocating on Hadoop servers
> with many disks).
> While HDFS is another alternative, it results in poor performance and index
> corruption under high online indexing loads (SOLR-7255).
> While it should be possible to do multiple cores with different dataDirs,
> that could be very difficult to manage and not humanly scale well, so I think
> Solr should support use of multiple dataDirs natively.
> Regards,
> Hari Sekhon
> http://www.linkedin.com/in/harisekhon
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]