Apache Parquet is a columnar storage format for Hadoop.
Apache Parquet is a columnar storage format for Hadoop.
Parquet was created to make the advantages of compressed, efficient columnar data representation available to any project in the Hadoop ecosystem, regardless of the choice of data processing framework, data model, or programming language.
References: