elangelo commented on code in PR #4023: URL: https://github.com/apache/solr/pull/4023#discussion_r3200324979
########## solr/solr-ref-guide/modules/deployment-guide/pages/backup-restore.adoc: ########## @@ -396,6 +396,39 @@ Any children under the `<repository>` tag are passed as additional configuration Information on each of the repository implementations provided with Solr is provided below. +=== Parallel File Transfers + +Backup and restore operations can transfer multiple index files in parallel to improve throughput, especially when using cloud storage repositories like S3 or GCS where latency is higher. +The parallelism is controlled via system properties or environment variables: + +`solr.backup.maxparalleluploads`:: ++ +[%autowidth,frame=none] +|=== +|Optional |Default: `1` +|=== ++ +Maximum number of index files to upload in parallel during backup operations. +Can also be set via the `SOLR_BACKUP_MAXPARALLELUPLOADS` environment variable. +Increasing this value can significantly improve backup throughput when using cloud storage (S3, GCS), but too high a value will increase IOPS and bandwidth pressure on your cluster. +Start small and increase based on observed throughput and available resources. + +`solr.backup.maxparalleldownloads`:: ++ +[%autowidth,frame=none] +|=== +|Optional |Default: `1` +|=== ++ +Maximum number of index files to download in parallel during restore operations. +Can also be set via the `SOLR_BACKUP_MAXPARALLELDOWNLOADS` environment variable. +Increasing this value can significantly improve restore throughput when using cloud storage (S3, GCS), but too high a value will increase IOPS and bandwidth pressure on your cluster. +Start small and increase based on observed throughput and available resources. + +TIP: Both settings share a single global thread pool per property, so the configured limit applies across all concurrent backup or restore operations on the node. Review Comment: fixed ########## solr/solr-ref-guide/modules/deployment-guide/pages/backup-restore.adoc: ########## @@ -396,6 +396,39 @@ Any children under the `<repository>` tag are passed as additional configuration Information on each of the repository implementations provided with Solr is provided below. +=== Parallel File Transfers + +Backup and restore operations can transfer multiple index files in parallel to improve throughput, especially when using cloud storage repositories like S3 or GCS where latency is higher. +The parallelism is controlled via system properties or environment variables: + +`solr.backup.maxparalleluploads`:: ++ +[%autowidth,frame=none] +|=== +|Optional |Default: `1` +|=== ++ +Maximum number of index files to upload in parallel during backup operations. +Can also be set via the `SOLR_BACKUP_MAXPARALLELUPLOADS` environment variable. +Increasing this value can significantly improve backup throughput when using cloud storage (S3, GCS), but too high a value will increase IOPS and bandwidth pressure on your cluster. +Start small and increase based on observed throughput and available resources. Review Comment: fixed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
