Tag: Apache Kudu

Performance comparison of different file formats and storage engines in the Apache Hadoop ecosystem

Performance comparison of different file formats and storage engines in the Apache Hadoop ecosystem

TOPIC This post presents a performance comparison of few popular data formats and storage engines available in the Apache Hadoop ecosystem: Apache Avro, Apache Parquet, Apache HBase and Apache Kudu on the field of space efficiency, ingestion performance, analytic scans and random data lookup. This should help in understanding how (and when) each of them … Continue reading Performance comparison of different file formats and storage engines in the Apache Hadoop ecosystem

Advertisements