[ 
https://issues.apache.org/jira/browse/FLINK-5129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17472435#comment-17472435
 ] 

lupan commented on FLINK-5129:
------------------------------

Does blob.storage.directory already support AWS S3 ?

When I use the following configuration:
{code:java}
blob.storage.directory: s3://iceberg-bucket/flink/blob {code}
I get the following error:
{code:java}
taskmanager    | 2022-01-11 02:41:11,460 ERROR 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner      [] - Terminating 
TaskManagerRunner with exit code 1.
taskmanager    | org.apache.flink.util.FlinkException: Failed to start the 
TaskManagerRunner.
taskmanager    |        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManager(TaskManagerRunner.java:374)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.lambda$runTaskManagerProcessSecurely$3(TaskManagerRunner.java:413)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.security.contexts.NoOpSecurityContext.runSecured(NoOpSecurityContext.java:28)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManagerProcessSecurely(TaskManagerRunner.java:413)
 [flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManagerProcessSecurely(TaskManagerRunner.java:396)
 [flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.main(TaskManagerRunner.java:354)
 [flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    | Caused by: java.io.IOException: Could not create storage 
directory for BLOB store in 's3:/iceberg-bucket/flink/blob'.
taskmanager    |        at 
org.apache.flink.runtime.blob.BlobUtils.initLocalStorageDirectory(BlobUtils.java:139)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.blob.AbstractBlobCache.<init>(AbstractBlobCache.java:89)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.blob.PermanentBlobCache.<init>(PermanentBlobCache.java:93)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.blob.BlobCacheService.<init>(BlobCacheService.java:55) 
~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.<init>(TaskManagerRunner.java:169)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        at 
org.apache.flink.runtime.taskexecutor.TaskManagerRunner.runTaskManager(TaskManagerRunner.java:367)
 ~[flink-dist_2.12-1.13.3.jar:1.13.3]
taskmanager    |        ... 5 more{code}

> make the BlobServer use a distributed file system
> -------------------------------------------------
>
>                 Key: FLINK-5129
>                 URL: https://issues.apache.org/jira/browse/FLINK-5129
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Network
>            Reporter: Nico Kruber
>            Assignee: Nico Kruber
>            Priority: Major
>             Fix For: 1.3.0
>
>         Attachments: image-2022-01-11-11-27-59-280.png
>
>
> Currently, the BlobServer uses a local storage and, in addition when the HA 
> mode is set, a distributed file system, e.g. hdfs. This, however, is only 
> used by the JobManager and all TaskManager instances request blobs from the 
> JobManager. By using the distributed file system there as well, we would 
> lower the load on the JobManager and increase scalability.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to