内容简介

Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and analysts looking for a comprehensive view of this practice. With this practical book, you'll learn how to plan and build systems to serve the needs of your organization and customers by evaluating the best technologies available in the framework of the data engineering lifecycle.

Authors Joe Reis and Matt Housley walk you through the data engineering lifecycle and show you how to stitch together a variety of cloud technologies to serve the needs of downstream data consumers. You'll understand how to apply the concepts of data generation, ingestion, orchestration, transformation, storage, governance, and deployment that are critical in any data environment regardless of the underlying technology.

This book will help you:

Assess data engineering problems using an end-to-end data framework of best practices

Cut through marketing hype when choosing data technologies, architecture, and processes

Use the data engineering lifecycle to design and build a robust architecture

Incorporate data governance and security across the data engineering lifecycle


Joe Reis is a business-minded data nerd who’s worked in the data industry for 20 years, with responsibilities ranging from statistical modeling, forecasting, machine learning, data engineering, data architecture, and almost everything else in between. Joe is the CEO and Co-Founder of Ternary Data, a data engineering and architecture consulting firm based in Salt Lake City, Utah...

下载地址

豆瓣评论

  • Lo
    当overview看看还行,细节太浅了2024-02-11
  • elfish
    全面也不过时,但纯理论概念总则介绍没有case study实例设计。也许更适合有相关工作经验的人阅读。先看Part II再看Part I更好些。推荐油管上CMU的15-721 Ad DBS视频:https://15721.courses.cs.cmu.edu/spring2023/。2023-07-27
  • 3点一直线
    这本书很详细。 很高屋建瓴, 主要讲了一些大方向和技术选型 技术栈的选择, 而且很新, 和业界很贴切。 唯一的问题就是有点抽象。 可能适合没在这一行工作的人, 了解日常工作内容? 或者作为tech lead 做技术选型。 对于一线工作的, 这个有点太overview了, 没实例内容, 都要靠自己感悟。2023-02-27

猜你喜欢

大家都喜欢