uros-b commented on code in PR #56473:
URL: https://github.com/apache/spark/pull/56473#discussion_r3444956457


##########
python/docs/source/tutorial/sql/python_data_source.rst:
##########
@@ -238,7 +238,7 @@ This is a dummy streaming data reader that generates 2 rows 
in every microbatch.
             """
             return {"offset": 0}
 
-        def latestOffset(self) -> dict:
+        def latestOffset(self, start: dict, limit: ReadLimit) -> dict:

Review Comment:
   Minor note - please check: the introductory FakeStreamReader.latestOffset is 
now annotated limit: ReadLimit, but no preceding tutorial code block imports 
ReadLimit (the shared "putting it together" import block at lines 120-126 omits 
it; only the later Admission-Control block at line 350 and the new AvailableNow 
block at line 407 import it). A reader copy-running this introductory example 
standalone hits NameError: name 'ReadLimit' is not defined. That example's body 
never uses limit (it just increments self.current), so either import ReadLimit 
in its preamble or leave the introductory signature parameterless / use a plain 
annotation.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to