Presentation: Snowflake Architecture: Building a Data Warehouse for the Cloud
This presentation is now available to view on InfoQ.com
Watch video with transcriptAbstract
At Snowflake, we wanted to architect a data warehouse from the ground up to leverage all the benefits of the cloud. Unlike shared-storage architectures that tie storage and compute together, we built a single integrated system with fully independent scaling for compute, storage and services. In the storage layer, we split data into micro-partitions and extract metadata for efficient query processing. At the compute layer, multiple virtual warehouses in separate compute clusters can simultaneously operate on the same data, giving high availability, performance isolation, scalability and concurrency. Virtual warehouses can also be automatically scaled up and down based on workload and performance.
This talk will cover the three pillars of the Snowflake architecture:
- Separating compute and storage to leverage abundant cloud compute resources
- Building an ACID compliant database system on immutable storage
- Delivering a scalable multi-tenant data warehouse system as a service
Similar Talks
Linux Foundation's Project EVE: A Cloud-Native Edge Computing Platform
Co-founder, VP Product and Strategy @ZededaEdge & Member Board Of Directors for LF Edge @linuxfoundation
Roman Shaposhnik
Shifting Left with Cloud Native CI/CD
Software Engineer @Google
Christie Wilson
Programming the Cloud: Empowering Developers to Do Infrastructure
TypeScript Co-Creator
Luke Hoban
Automated Testing for Terraform, Docker, Packer, Kubernetes, and More
Co-founder @gruntwork_io
Yevgeniy Brikman
AWS Cloud Development Kit (CDK)
Developer Tooling Advocate @AWSCloud & CDK Core Contributor
Richard Boyd
Helm 3: A Mariner's Delight
Principal Program Manager @Microsoft & K8s Release Lead for 1.16