Hands-On Machine Learning Recommender Systems with Apache Spark

Hands-On Machine Learning Recommender Systems with Apache Spark КНИГИ » ПРОГРАММИНГ

Название: Hands-On Machine Learning Recommender Systems with Apache Spark: Build a real Artificial Intelligence solution with real data
Автор: Ernesto Lee
Издательство: Consultants Network
Год: 2020
Страниц: 118
Язык: английский
Формат: pdf, epub
Размер: 12.1 MB

This book is intended to provide an introduction to recommender systems using Apache Spark and Machine Learning. Before we begin with recommender systems using Apache Spark, we define Big Data and Machine Learning. We then dive directly into our use case of building a recommender system with Apache Spark and Machine Learning by showing you how to build a recommender system - step by step.

Apache Spark is an open source, fast and unified parallel large-scale data processing engine. It provides a framework for programming through distributed processing of large datasets at high speed. Spark supports most of the popular programming languages such as Java, Python, Scala and R. Spark is scalable, meaning, it can run on a single desktop machine or a laptop to a cluster of thousands of servers. Spark provides a set of built in libraries which can be accessed to perform data analysis over a large dataset. However, if your requirements exceed the capabilities present in the built in libraries, you can write one or explore countless external libraries from the myriad open source communities on the internet.

Why use Spark when we have Hadoop? Well, Spark excels as a unified platform for processing huge data at very high speeds for various data processing requirements. Spark is an in-memory processing framework. Spark is arguably mentioned as the successor of Apache Hadoop. Let us briefly discuss the advantages of Spark over Hadoop.

With the Hadoop ecosystem, we had various frameworks for various data processing requirements. As a developer, you could use the MapReduce framework to analyze your data using your choice of programming languages such as Java, C++, Python etc. However, a data warehouse engineer who is also a SQL expert, has to learn one of these aforementioned programming languages to leverage the MapReduce framework. To overcome this problem, a new framework which runs on the top of Hadoop called “Hive” was introduced. There was a similar problem for ETL processing and so “Pig” was introduced. Similarly, tools like “Giraph” and “Mahout” were introduced for Graphs processing and Machine Learning respectively. Moreover, Hadoop is only used for batch processing and there was no way to process data in real time. So, for this a new framework called “Storm” was integrated with Hadoop to work with streaming data. With so many frameworks, organizations had a tough time maintaining all the frameworks and tracking the issues pertaining to them. Fortunately, all this would change with advent of Spark. As mentioned, Spark is a unifying platform which provides all these frameworks as one package with four major components. Now, what actually does In-memory processing mean? Aren’t all the applications processed in memory only? Well, yes, all the applications are processed in-memory and written back to disk when processing is done, but Spark can process data in-memory and also retain the data within the memory or write to disk.

Скачать Hands-On Machine Learning Recommender Systems with Apache Spark

Скачать с Turbobit

ОТСУТСТВУЕТ ССЫЛКА/ НЕ РАБОЧАЯ ССЫЛКА ЕСТЬ РЕШЕНИЕ, ПИШИМ СЮДА!

Автор: Ingvar16 16-12-2021, 21:39 | Напечатать |

Уважаемый посетитель, Вы зашли на сайт как незарегистрированный пользователь.

С этой публикацией часто скачивают:

Developing Spark Applications with Python Название: Developing Spark Applications with Python Автор: Xavier Morera, Nereo Campos Издательство: Big Data Inc Год: December 16, 2019 Страниц:...

Machine Learning with PySpark: With Natural Language Processing and Recommender Systems 2nd Edition Название: Machine Learning with PySpark: With Natural Language Processing and Recommender Systems 2nd Edition Автор: Pramod Singh Издательство:...

Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library Название: Beginning Apache Spark 3: With DataFrame, Spark SQL, Structured Streaming, and Spark Machine Learning Library Автор: Hien Luu Издательство:...

Apache Spark: Invent The Future Название: Apache Spark: Invent The Future Автор: Ernesto Lee Издательство: Independently published Год: 2021 Страниц: 482 Язык: английский Формат:...

Introducing .NET for Apache Spark: Distributed Processing for Massive Datasets Название: Introducing .NET for Apache Spark: Distributed Processing for Massive Datasets Автор: Ed Elliott Издательство: Apress Год: 2021 Формат:...

Big Data Processing with Apache Spark Название: Big Data Processing with Apache Spark Автор: Srini Penchikala Издательство: Год: 2018 Страниц: 104 Формат: PDF Размер: 10 Mb Язык: English...

Machine Learning with PySpark: With Natural Language Processing and Recommender Systems Название: Machine Learning with PySpark: With Natural Language Processing and Recommender Systems Автор: Pramod Singh Издательство: Apress Год: 2019...

Practical Apache Spark: Using the Scala API Название: Practical Apache Spark: Using the Scala API Автор: Subhashini Chellappan, Dharanitharan Ganesan Издательство: Apress Год: 2019 Страниц:...

Beginning Apache Spark 2 Название: Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning library Автор:...

Apache Spark in 24 Hours, Sams Teach Yourself Название: Apache Spark in 24 Hours, Sams Teach Yourself Автор: Jeffrey Aven Издательство: Sams Publishing Год: 2016 Страниц: 592 Формат: true...

Информация

Посетители, находящиеся в группе Гости, не могут оставлять комментарии к данной публикации.