amol- commented on a change in pull request #49:
URL: https://github.com/apache/arrow-cookbook/pull/49#discussion_r700212307
##########
File path: python/source/io.rst
##########
@@ -497,3 +497,40 @@ the parquet file as :class:`ChunkedArray`
pyarrow.Table
col1: int64
ChunkedArray = 0 .. 99
+
+Reading Line Delimited JSON
+===========================
+
+Arrow has builtin support for line-delimited JSON.
+Each line represents a row of data as a JSON object.
+
+Given some data in a file where each line is a JSON object
+containing a row of data:
+
+.. testcode::
+
+ import tempfile
+
+ with tempfile.NamedTemporaryFile(delete=False, mode="w+") as f:
+ f.write('{"a": 1, "b": 2.0, "c": 1}\n')
+ f.write('{"a": 3, "b": 3.0, "c": 2}\n')
+ f.write('{"a": 5, "b": 4.0, "c": 3}\n')
+ f.write('{"a": 7, "b": 5.0, "c": 4}\n')
Review comment:
I think that when nulls change the behaviour of something and you need
special code to deal with them we should have dedicated recipes. But in this
case having nulls in the data shouldn't change how you deal with it, so I don't
think there is much value in adding nulls
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]