This is an automated email from the ASF dual-hosted git repository. alamb pushed a commit to branch main in repository https://gitbox.apache.org/repos/asf/datafusion-site.git
The following commit(s) were added to refs/heads/main by this push: new b48fd04 Fix typo in opening paragraph (#68) b48fd04 is described below commit b48fd04d64c9f1101f5b6db31638a930b1393759 Author: Andrew Lamb <and...@nerdnetworks.org> AuthorDate: Fri Apr 11 13:37:08 2025 -0400 Fix typo in opening paragraph (#68) Scale Factor 100 is 36GB not 3.6GB --- content/blog/2025-04-10-fastest-tpch-generator.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/blog/2025-04-10-fastest-tpch-generator.md b/content/blog/2025-04-10-fastest-tpch-generator.md index 7bf45d4..dc92097 100644 --- a/content/blog/2025-04-10-fastest-tpch-generator.md +++ b/content/blog/2025-04-10-fastest-tpch-generator.md @@ -42,7 +42,7 @@ faster than any other implementation we know of. It is now possible to create the TPC-H SF=100 dataset in 72.23 seconds (1.4 GB/s 😎) on a Macbook Air M3 with 16GB of memory, compared to the classic `dbgen` which takes 30 minutes<sup>1</sup> (0.05GB/sec). On the same machine, it takes less than -2 minutes to create all 3.6 GB of SF=100 in [Apache Parquet] format, which takes 44 minutes using [DuckDB]. +2 minutes to create all 36 GB of SF=100 in [Apache Parquet] format, which takes 44 minutes using [DuckDB]. It is finally convenient and efficient to run TPC-H queries locally when testing analytical engines such as DataFusion. --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@datafusion.apache.org For additional commands, e-mail: commits-h...@datafusion.apache.org