Название: Data Pipelines with Apache Airflow (Final) Автор: Bas P. Harenslak, Julian Rutger de Ruiter Издательство: Manning Publications Co Год: 2021 Формат: True PDF Страниц: 482 Размер: 15.7 Mb Язык: English
Data Pipelines with Apache Airflow teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start using Airflow for seamless data pipeline development and management. Pipelines can be challenging to manage, especially when your data has to flow through a collection of application components, servers, and cloud services. Airflow lets you schedule, restart, and backfill pipelines, and its easy-to-use UI and workflows with Python scripting has users praising its incredible flexibility. Data Pipelines with Apache Airflow takes you through best practices for creating pipelines for multiple tasks, including data lakes, cloud deployments, and data science. Data Pipelines with Apache Airflow teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start using Airflow for seamless data pipeline development and management.
Data Pipelines with Apache Airflow (MEAP) V4 Название: Data Pipelines with Apache Airflow (MEAP) Автор: Bas P. Harenslak and Julian Rutger de Ruiter Издательство: Manning Publications Год: 2020...
Modern Big Data Processing with Hadoop Название: Modern Big Data Processing with Hadoop Автор: V. Naresh Kumar, Prashant Shindgikar Издательство: Packt Publishing Год: 2018 ISBN:...