BigQuery - serverless, scalable, cost-effective DW

Quick SQL (Standard SQL)

-- table: myproj.mydataset.groceries
-- alias g, return all columns
SELECT g.*
FROM `myproj.mydataset.groceries` AS g
LIMIT 100;

Dataflow - serverless data processing (batch + streaming)

Typical pipeline (개념)

Sources: Datastore, Pub/Sub, Kafka/Avro 등 → Dataflow(Beam) transform/enrich → Sinks: BigQuery, Vertex AI, Cloud BigtableData Studio(실시간 대시보드)

Dataprep (Trifacta)

Dataproc - managed Spark/Hadoop