Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance, and Scalability on the Data Lake
by: Tomer Shiran (Author),Jason Hughes(Author),Alex Merced(Author)&0more
Publisher:O’Reilly Media
Edition:1st
Publication Date: June 11, 2024
Language:English
Print Length:341 pages
ISBN-10:1098148622
ISBN-13:9781098148621
Book Description
Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool—a cost-prohibitive process for making warehouse features available to all of your data. The lack of flexibility with these patterns requires you to lock into a set of priority tools and formats, which creates data silos and data drift. This practical book shows you a better way. Apache Iceberg provides the capabilities, performance, scalability, and savings that fulfill the promise of an open data lakehouse. By following the lessons in this book, you’ll be able to achieve interactive, batch, machine learning, and streaming analytics with this high-performance open source format. Authors Tomer Shiran, Jason Hughes, and Alex Merced from Dremio show you how to get started with Iceberg. With this book, you’ll learn: The architecture of Apache Iceberg tables What happens under the hood when you perform operations on Iceberg tables How to further optimize Iceberg tables for maximum performance How to use Iceberg with popular data engines such as Apache Spark, Apache Flink, and Dremio Discover why Apache Iceberg is a foundational technology for implementing an open data lakehouse.
About the Author
Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool—a cost-prohibitive process for making warehouse features available to all of your data. The lack of flexibility with these patterns requires you to lock into a set of priority tools and formats, which creates data silos and data drift. This practical book shows you a better way. Apache Iceberg provides the capabilities, performance, scalability, and savings that fulfill the promise of an open data lakehouse. By following the lessons in this book, you’ll be able to achieve interactive, batch, machine learning, and streaming analytics with this high-performance open source format. Authors Tomer Shiran, Jason Hughes, and Alex Merced from Dremio show you how to get started with Iceberg. With this book, you’ll learn: The architecture of Apache Iceberg tables What happens under the hood when you perform operations on Iceberg tables How to further optimize Iceberg tables for maximum performance How to use Iceberg with popular data engines such as Apache Spark, Apache Flink, and Dremio Discover why Apache Iceberg is a foundational technology for implementing an open data lakehouse. Read more
Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance, and Scalability on the Data Lake
未经允许不得转载:电子书百科大全 » Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance, and Scalability on the Data Lake
相关推荐
- 100 Facts About Artificial Intelligence: English to Spanish (100 Facts Language Learning Series) (Spanish Edition)
- Cyber Security From Beginner To Expert Cyber Security Made Easy For Absolute Beginners
- SPSS For Beginners: An Illustrative Step-by-Step Approach to Analyzing Statistical data
- Learn to Code: Learn HTML, CSS and JavaScript and build a website, an app and a game
- Pro Angular 16
- Oracle Linux Cookbook: Embrace Oracle Linux and master Linux Server management
- Developing Blockchain Solutions in the Cloud: Design and develop blockchain-powered Web3 apps on AWS, Azure, and GCP
- Android Programming for Beginners: Learn All the Java and Android Skills You Need to Start Making Powerful Mobile Applications
电子书百科大全
评论前必须登录!
立即登录 注册