Data Pipelines Pocket Reference: Moving and Processing Data for AnalyticsКНИГИ » ПРОГРАММИНГ
Название: Data Pipelines Pocket Reference: Moving and Processing Data for Analytics (Final) Автор: James Densmore Издательство: O’Reilly Media Год: 2021-02-10 Формат: epub/mobi/pdf(conv.) Страниц: 276 Размер: 10.3 Mb Язык: English
Data pipelines are the foundation for success in data analytics and machine learning. Moving data from many diverse sources and processing it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today’s modern data stack. You’ll learn common considerations and key decision points when implementing pipelines, such as data pipeline design patterns, data ingestion implementation, data transformation, the orchestration of pipelines, and build versus buy decision making. This book addresses the most common decisions made by data professionals and discusses foundational concepts that apply to open source frameworks, commercial products, and homegrown solutions. You’ll learn: What a data pipeline is and how it works How data is moved and processed on modern data infrastructure, including cloud platforms Common tools and products used by data engineers to build pipelines How pipelines support machine learning and analytics needs Considerations for pipeline maintenance, testing, and alerting
Data Analysis with Python and PySpark (MEAP) Название: Data Analysis with Python and PySpark (MEAP) Автор: Jonathan Rioux Издательство: Manning Publications Год: 2020 Страниц: 259 Язык:...
Data Pipelines with Apache Airflow (MEAP) V4 Название: Data Pipelines with Apache Airflow (MEAP) Автор: Bas P. Harenslak and Julian Rutger de Ruiter Издательство: Manning Publications Год: 2020...
Modern Big Data Processing with Hadoop Название: Modern Big Data Processing with Hadoop Автор: V. Naresh Kumar, Prashant Shindgikar Издательство: Packt Publishing Год: 2018 ISBN:...