How Good Is Parquet for Wide Tables (Machine Learning Workloads) Really?

In this blog post, we quantify the metadata overhead of Apache Parquet files for storing thousands of columns, as well as space and decode time using parquet-rs, implemented in Rust.

Read more here: External Link