rdblue commented on a change in pull request #1390:
URL: https://github.com/apache/iceberg/pull/1390#discussion_r478764395
##########
File path: site/docs/spark-structured-streaming.md
##########
@@ -76,31 +76,9 @@ Each micro-batch written to a table produces a new snapshot,
which are tracked i
### Compacting data files
-The amount of data written in a micro batch is typically small, which can
cause the table metadata to track lots of small files. Compacting small files
into larger files reduces the metadata needed by the table, and increases query
efficiency.
+The amount of data written in a micro batch is typically small, which can
cause the table metadata to track lots of small files. [Compacting small files
into larger files](../maintenance#compact-data-files) reduces the metadata
needed by the table, and increases query efficiency.
Review comment:
This looks good, thank you!
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]