Apache Iceberg is a remarkable open-source format designed for massive analytic tablesIt brings the reliability and simplicity of SQL tables to big data, while allowing engines like Spark, Trino, Flink, Presto, Hive, Impala, StarRocks, Doris, and Pig to safely work with the same tables simultaneously123.

Here are some key features of Apache Iceberg:

  1. High-Performance Format:
  2. SQL-Like Commands:
  3. Schema Evolution:
  4. Hidden Partitioning:
  5. Time Travel and Rollback:
  6. Data Compaction:

Configuration

Troubleshooting