[GitHub] [beam] AnandInguva commented on a diff in pull request #24062: Add custom inference function support to the PyTorch model handler

GitBox Sat, 12 Nov 2022 08:07:28 -0800


AnandInguva commented on code in PR #24062:
URL: https://github.com/apache/beam/pull/24062#discussion_r1020777669



##########
sdks/python/apache_beam/ml/inference/pytorch_inference.py:
##########
@@ -100,6 +112,21 @@ def _convert_to_result(
   return [PredictionResult(x, y) for x, y in zip(batch, predictions)]
 
 
+def default_tensor_inference_fn(
+    batch: Sequence[torch.Tensor],
+    model: torch.nn.Module,
+    device: str,
+    inference_args: Optional[Dict[str,
+                                  Any]] = None) -> Iterable[PredictionResult]:
+  # torch.no_grad() mitigates GPU memory issues
+  # https://github.com/apache/beam/issues/22811
+  with torch.no_grad():
+    batched_tensors = torch.stack(batch)
+    batched_tensors = _convert_to_device(batched_tensors, device)
+    predictions = model(batched_tensors, **inference_args)
+    return _convert_to_result(batch, predictions)

Review Comment:
   We should make sure whatever we do, we shouldn't instantiate the model 
before the pipeline startup. If the model is large, instantiating model's 
object takes a lot of time and sometimes pipeline won't start because we have 
hit a limit 
(https://cloud.google.com/dataflow/quotas#:~:text=The%20default%20disk%20size%20is,for%20Dataflow%20Shuffle%20batch%20pipelines._



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [beam] AnandInguva commented on a diff in pull request #24062: Add custom inference function support to the PyTorch model handler

Reply via email to