jerrypeng commented on a change in pull request #7255: URL: https://github.com/apache/pulsar/pull/7255#discussion_r445346075
########## File path: pulsar-functions/worker/src/main/java/org/apache/pulsar/functions/worker/FunctionMetaDataManager.java ########## @@ -174,89 +181,124 @@ public synchronized boolean containsFunction(String tenant, String namespace, St } /** - * Sends an update request to the FMT (Function Metadata Topic) - * @param functionMetaData The function metadata that needs to be updated - * @return a completable future of when the update has been applied + * Called by the worker when we are in the leader mode. In this state, we update our in-memory + * data structures and then write to the metadata topic. + * @param functionMetaData The function metadata in question + * @param delete Is this a delete operation + * @throws IllegalStateException if we are not the leader + * @throws IllegalArgumentException if the request is out of date. */ - public synchronized CompletableFuture<RequestResult> updateFunction(FunctionMetaData functionMetaData) { - - FunctionMetaData existingFunctionMetadata = null; - if (containsFunction(functionMetaData.getFunctionDetails().getTenant(), - functionMetaData.getFunctionDetails().getNamespace(), - functionMetaData.getFunctionDetails().getName())) { - existingFunctionMetadata = getFunctionMetaData(functionMetaData.getFunctionDetails().getTenant(), - functionMetaData.getFunctionDetails().getNamespace(), - functionMetaData.getFunctionDetails().getName()); + public synchronized void updateFunctionOnLeader(FunctionMetaData functionMetaData, boolean delete) + throws IllegalStateException, IllegalArgumentException { + if (exclusiveLeaderProducer == null) { + throw new IllegalStateException("Not the leader"); + } + boolean needsScheduling; + if (delete) { + needsScheduling = proccessDeregister(functionMetaData); + } else { + needsScheduling = processUpdate(functionMetaData); + } + Request.ServiceRequest serviceRequest = Request.ServiceRequest.newBuilder() + .setServiceRequestType(delete ? Request.ServiceRequest.ServiceRequestType.DELETE : Request.ServiceRequest.ServiceRequestType.UPDATE) + .setFunctionMetaData(functionMetaData) + .setWorkerId(workerConfig.getWorkerId()) + .setRequestId(UUID.randomUUID().toString()) + .build(); + try { + lastMessageSeen = exclusiveLeaderProducer.send(serviceRequest.toByteArray()); + } catch (Exception e) { Review comment: We should return the error to the worker making the call to the leader, otherwise the worker might have to wait for a timeout. I think we should just return an error and the user can retry. There is no guarantee that restarting the worker or electing another leader will help solve the issue since all the workers have the same configuration. Restarting can also be heavy and I would prefer to minimize the amount of forced restarts as possible. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org