[ 
https://issues.apache.org/jira/browse/HDDS-9898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17796740#comment-17796740
 ] 

Mohammad Arafat Khan commented on HDDS-9898:
--------------------------------------------

I tried replicating this locally on a docker setup. I created an EC container 
*Container 1* which was initially failing to register but later registering 
successfully due to the absence of a required pipeline. Here's a summary:
 # Initially, there were multiple warnings and errors indicating that Container 
1 couldn't be added due to a missing pipeline 
({*}PipelineID=68426b76-65ec-4663-be73-64ac4e1f8412{*}). The system recognized 
the need to register Container 1 but couldn't do so because the pipeline was 
missing.

 # Later in the log, a new pipeline with the required ID 
({*}PipelineID=68426b76-65ec-4663-be73-64ac4e1f8412{*}) was created or 
recognized by the system.

 # After the pipeline was established, Container 1 was successfully added to 
the system.

In summary, the issue was resolved when the missing pipeline was created, 
allowing Container 1 to register successfully. This might have been caused by a 
delay in pipeline creation by the *{{PipeLineSyncTask}}* of R{*}econ{*} for EC 
containers in recon.

What we need to investigate now is that why is the delay happening for EC 
container pipelines and not the ratis pipelines.

> Recon SCM doesn't store EC containers
> -------------------------------------
>
>                 Key: HDDS-9898
>                 URL: https://issues.apache.org/jira/browse/HDDS-9898
>             Project: Apache Ozone
>          Issue Type: Bug
>            Reporter: Christos Bisias
>            Assignee: Mohammad Arafat Khan
>            Priority: Major
>
> While working on an integration test for 
> https://issues.apache.org/jira/browse/HDDS-9645
> I created an EC container and then tried to compare SCM to Recon. Recon-SCM 
> wasn't picking up the container and I noticed this ERROR in the logs.
> {code:java}
> 2023-12-11 15:57:38,663 [Recon-FixedThreadPoolWithAffinityExecutor-0-0] ERROR 
> container.ContainerReportHandler 
> (ContainerReportHandler.java:processSingleReplica(228)) - Received container 
> report for an unknown container 3 from datanode 
> 7501f622-8ca7-4a47-a65e-e2b3d77723c4(localhost/127.0.0.1).{code}
> I tested it with the docker env on master and Recon doesn't store EC 
> containers. When creating EC containers Recon's SCM containers table is empty.
> {code:java}
> bash-4.2$ ozone freon ockg -t 10 -n 10 --type=EC -r=rs-3-2-1024k
> ...
> ...
> ...
> bash-4.2$ ozone debug ldb --db=/data/metadata/recon-scm.db scan 
> --cf=containers
> {  } {code}
> Recon UI also displays 0 containers. When creating Ratis containers, it works
> {code:java}
> bash-4.2$ ozone freon ockg -t 10 -n 10 
> ...
> ...
> ...
> Total execution time (sec): 2
> Failures: 0
> Successful executions: 10
> bash-4.2$ ozone debug ldb --db=/data/metadata/recon-scm.db scan 
> --cf=containers
> { {
>   "id" : 6
> }: {
>   "state" : "OPEN",
>   "stateEnterTime" : 1702302641.257000000,
>   "replicationConfig" : {
>     "replicationFactor" : "ONE",
>     "replicationType" : "RATIS"
>   },
>   "usedBytes" : 0,
>   "numberOfKeys" : 0,
>   "lastUsed" : 1702302648.723516000,
>   "owner" : "omServiceIdDefault",
>   "containerID" : 6,
>   "deleteTransactionId" : 0,
>   "sequenceId" : 0
> }
> , {
> ...{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to