rdblue commented on a change in pull request #1390:
URL: https://github.com/apache/iceberg/pull/1390#discussion_r478764395



##########
File path: site/docs/spark-structured-streaming.md
##########
@@ -76,31 +76,9 @@ Each micro-batch written to a table produces a new snapshot, 
which are tracked i
 
 ### Compacting data files
 
-The amount of data written in a micro batch is typically small, which can 
cause the table metadata to track lots of small files. Compacting small files 
into larger files reduces the metadata needed by the table, and increases query 
efficiency.
+The amount of data written in a micro batch is typically small, which can 
cause the table metadata to track lots of small files. [Compacting small files 
into larger files](../maintenance#compact-data-files) reduces the metadata 
needed by the table, and increases query efficiency.

Review comment:
       This looks good, thank you!




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to