|
 |
|
 |
|
|
 |
|  |
|
Название: Apache Iceberg: The Definitive Guide: Data Lakehouse Functionality, Performance, and Scalability on the Data Lake Автор: Tomer Shiran, Jason Hughes, Alex Merced Издательство: O’Reilly Media, Inc. Год: 2024 Страниц: 479 Язык: английский Формат: pdf, epub (true) Размер: 14.0 MB
Traditional data architecture patterns are severely limited. To use these patterns, you have to ETL data into each tool—a cost-prohibitive process for making warehouse features available to all of your data. The lack of flexibility with these patterns requires you to lock into a set of priority tools and formats, which creates data silos and data drift. This practical book shows you a better way. Apache Iceberg provides the capabilities, performance, scalability, and savings that fulfill the promise of an open data lakehouse. By following the lessons in this book, you'll be able to achieve interactive, batch, Machine Learning, and streaming analytics with this high-performance open source format. Authors Tomer Shiran, Jason Hughes, and Alex Merced from Dremio show you how to get started with Iceberg. In these pages, you’ll learn what Apache Iceberg is, why it exists, how it works, and how to harness its power. Designed for data engineers, architects, scientists, and analysts working with large datasets across various use cases from BI dashboards to AI/ML, this book explores the core concepts, inner workings, and practical applications of Apache Iceberg. By the time you reach the end, you will have grasped the essentials and possess the practical knowledge to implement Apache Iceberg effectively in your data projects. Whether you are a newcomer or an experienced practitioner, Apache Iceberg: The Definitive Guide will be your trusted companion on this enlightening journey into Apache Iceberg. |
Разместил: Ingvar16 9-05-2024, 21:35 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: Big Data Analytics: Theory, Techniques, Platforms, and Applications Автор: Umit Demirbaga, Gagangeet Singh Aujla, Anish Jindal Издательство: Springer Год: 2024 Страниц: 299 Язык: английский Формат: pdf (true), epub Размер: 39.8 MB
This book introduces readers to Big Data analytics. It covers the background to and the concepts of Big Data, Big Data analytics, and cloud computing, along with the process of setting up, configuring, and getting familiar with the Big Data analytics working environments in the first two chapters. The third chapter provides comprehensive information on Big Data processing systems - from installing these systems to implementing real-world data applications, along with the necessary codes. The next chapter dives into the details of Big Data storage technologies, including their types, essentiality, durability, and availability, and reveals their differences in their properties. The fifth and sixth chapters guide the reader through understanding, configuring, and performing the monitoring and debugging of Big Data systems and present the available commercial and open-source tools for this purpose. Chapter seven gives information about a trending Machine Learning, Bayesian network: a probabilistic graphical model, by presenting a real-world probabilistic application to understand causal, complex, and hidden relationships for diagnosis and forecasting in a scalable manner for Big Data. Special sections throughout the eighth chapter present different case studies and applications to help the readers to develop their Big Data analytics skills using various Big Data analytics frameworks. |
Разместил: Ingvar16 8-05-2024, 20:41 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: IT Infrastructure: Security and Resilience Solutions Автор: Ralf Suss, Yannik Suss Издательство: Apress Год: 2024 Страниц: 352 Язык: английский Формат: pdf, epub (true) Размер: 31.3 MB
Embark on a comprehensive journey into the intricate world of IT infrastructure, with an in-depth look into the transformational role of secure, private data centers in today's digital era. This exploration uncovers the multi-faceted domains of IaaS, PaaS, and SaaS, examining the primary components of modern IT infrastructure—compute, storage, backup, and beyond. As technology continues to surge forward, cyber threats evolve in tandem, prompting a dire need for reinforced data center security and resilience. This book provides readers with a holistic, layered understanding of IT operations in our interconnected age. You will dive deep into the heart of technological advancements, appreciating the symbiotic relationship between evolving hardware capabilities and the progressive nature of cloud services. You will understand the intricacies of data center design, management, and the strategic role they play amid the growing reliance on both private and public clouds. Asindustries pivot towards a more digital-first approach, this book serves as a guiding star, illuminating the pathways, challenges, and opportunities of the vast IT infrastructure landscape. For IT professionals: from system administrators and network architects to IT managers and data center overseers, plus students and tech enthusiasts seeking deep insights into IT infrastructure. |
Разместил: Ingvar16 8-05-2024, 14:49 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: AWS DevOps Engineer Professional Certification Guide: Hands-on guide to understand, analyze, and solve 150 scenario-based questions Автор: Sumit Kapoor Издательство: BPB Publications Год: 2024 Страниц: 608 Язык: английский Формат: epub Размер: 33.5 MB
Learn using Cloud data technologies for improving data analytics and decision-making capabilities for your organization. The AWS DevOps Engineer Professional Certification Guide is highly challenging and can significantly boost one's career. It features scenario-based questions with lengthy descriptions, making comprehension tough. This book focuses extensively on AWS Developer Tools, CloudFormation, Elastic Beanstalk, OpsWorks, and other crucial topics, representing the exam's domain. The readers can easily prepare for the AWS Certified DevOps Engineer - Professional exam with this guide drafted with a focus on managing infrastructure and applications on AWS. It covers secure version control with CodeCommit, automated code building with CodeBuild, and streamlined updates with CodeDeploy and CodePipeline. You will learn to create secure CI/CD pipelines and define AWS infrastructure and applications with CloudFormation. The readers will explore the management of multiple AWS accounts, security tools, and automation with OpsWorks and Elastic Beanstalk. You will also discover strategies for scalability, disaster recovery, monitoring with CloudWatch, and performance analysis with Kinesis Data Streams. Finally, you will learn to implement automated responses and security best practices with AWS Config and Inspector. Successfully passing this exam will help you gain advanced technical skills needed to become a DevOps subject matter expert and earn a good remuneration in the IT industry. |
Разместил: Ingvar16 7-05-2024, 19:59 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: Open Data for Everybody: Using Open Data for Social Good Автор: Nathan Coyle Издательство: CRC Press Год: 2024 Страниц: 199 Язык: английский Формат: pdf (true) Размер: 10.4 MB
What if I told you something that could empower our third sector and activists to enhance their capacity? From gathering evidence for funding tenders to campaigning for crucial social issues and much more? It's called open data, yet many in social action remain unaware of it. Primarily shaped by corporate entities, open data seems tailored only for technologists, alienating the third sector. But in reality, it's a powerful tool for social change, bolstering civil society, and creating resilient communities. You probably don’t realize that you are using data right now. Take your phone, for example. Do you check the weather before you leave the house or even on the TV? Perhaps you’ll be looking to move home soon. If you have children, would you check out local schools or transport links? Maybe you’re considering going to university. How will you compare the cost of your course to a different one? If you’re lost, the first thing you’ll probably do is take out your phone and open your GPS map app, right? The possibilities are limitless, from tracking your fitness or weight control to planning which bus or train you’ll take the next morning. The answer to all those questions will most likely involve a website or app that uses data to operate and do its job. It is built on data—data that is changing in real time. Big Data—sometimes data can get really vast! It is essentially a huge digital repository of information, comprising extensive and intricate datasets that surpass the capacity of conventional data processing tools. It encompasses structured data, often found in databases, as well as unstructured data such as text, images, and real?time sensor readings collected from diverse sources. To extract valuable insights, detect patterns, and uncover trends within these massive datasets, organizations leverage advanced analytical techniques and cutting?edge technologies. |
Разместил: Ingvar16 7-05-2024, 15:55 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: Excel BI and Dashboards in 7 Days: Build interactive dashboards for powerful data visualization and insights Автор: Jared Poli Издательство: BPB Publications Год: 2024 Страниц: 330 Язык: английский Формат: pdf, epub Размер: 19.2 MB
Using MS Excel for powerful exploration, manipulation, and data visualization. Everyone thinks of Excel differently, and its full potential is often untapped. Businesses tend to decide to invest heavily in proper BI tools, perhaps wrongfully assuming that Excel has a minimal role in that industry. Excel can be used effectively to collect, refresh, transform, and visualize your data in beautiful and eye-catching ways. This book covers building those skills and unlocking Excel and your potential in just seven days. The book explores the process of cleaning your data to ensure accuracy, using formulas to enhance and prepare the same for PivotTables. It will also help you understand how to use data visualization to create clear charts to communicate insights effectively and construct interactive dashboards for user exploration, including elements like slicers and timelines. The book also dives into discovering design principles for easy-to-understand dashboards, while gaining knowledge on maintaining and updating them for ongoing usability. Understanding the full power behind Excel will allow you to improve your spreadsheet game and prove that you can do it all with one industry standard tool and this book. This book is for everyone who wants to be a powerful user of Excel, finance teams, sales and marketing teams, MIS Analysts, BI aspirants, and all those who work with Excel sheets daily and want to refine that skill set into something more practical. |
Разместил: Ingvar16 6-05-2024, 20:37 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: Digital and Technological Solutions: Exploring the foundations of digitization Автор: Faheem Syeed Masoodi, Zubair Sayeed Masoodi, Khalid Bashir Dar Издательство: BPB Publications Год: 2024 Страниц: 411 Язык: английский Формат: pdf, epub Размер: 10.1 MB
Understanding the basics of digital systems and technology is important in today’s rapidly evolving world. This book Digital and Technologicals Solutions: Exploring the foundations of digitization covers the essential concepts that form the backbone of digital systems. This book teaches digital systems, exploring number systems, logic gates, and computer architecture. It covers hardware, software (system and application), and operating systems. Network fundamentals like LANs, WANs, routers, and the internet are addressed. Information systems used in organizations, including e-commerce and digital marketing, are explained. The book examines digital payments (UPI, e-wallets) and cybersecurity measures. Finally, emerging technologies like cloud computing, big data, IoT, VR, blockchain, robotics, AI, and 3D printing are introduced. The Chapter 1 provides an introduction to computer systems and their workings, starting with an overview of the generations of computers. It explores the basic components of a computer system and discusses computer system architecture. The chapter also delves into software, including its definition and types such as system software and application software. It further covers operating systems, their functions, and different types including batch, multi-user, and real-time systems. Popular operating systems like MS DOS, Windows, macOS, Linux, Android, and iOS are highlighted. Additionally, the chapter introduces algorithms and flowcharts as essential tools in computer programming. |
Разместил: Ingvar16 6-05-2024, 19:51 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: Mastering Marketing Data Science: A Comprehensive Guide for Today's Marketers Автор: Iain Brown Издательство: Wiley Год: 2024 Страниц: 432 Язык: английский Формат: epub (true) Размер: 13.0 MB
Unlock the Power of data: Transform Your Marketing Strategies with Data Science. In the digital age, understanding the symbiosis between marketing and data science is not just an advantage; it's a necessity. In Mastering Marketing Data Science: A Comprehensive Guide for Today's Marketers, Dr. Iain Brown, a leading expert in data science and marketing analytics, offers a comprehensive journey through the cutting-edge methodologies and applications that are defining the future of marketing. This book bridges the gap between theoretical data science concepts and their practical applications in marketing, providing readers with the tools and insights needed to elevate their strategies in a data-driven world. Whether you're a master's student, a marketing professional, or a data scientist keen on applying your skills in a marketing context, this guide will empower you with a deep understanding of marketing data science principles and the competence to apply these principles effectively. Marketing Data Science equips organizations with the power to make data-driven decisions, optimize marketing expenditures, elevate customer experiences, and secure a competitive edge. By harnessing advanced techniques, such as Machine Learning (see Chapter 5), natural language processing (NLP) (see Chapter 6), and Big Data analytics (see Chapter 11), marketing data scientists can discover latent opportunities, foresee customer behavior, and devise personalized marketing strategies that resonate with target audiences. Engage with real-world examples, hands-on exercises in both Python & SAS, and actionable insights to apply in your marketing campaigns. |
Разместил: Ingvar16 3-05-2024, 21:31 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
 Название: SQL. Pocket guide, 4-е издание Автор: Элис Жао Издательство: Спринт Бук Год: 2024 Формат: pdf Страниц: 320 Для сайта: Mirknig.su Размер: 12,7 Мб Язык: русский
Если вы аналитик или инженер по обработке данных и используете SQL, популярный карманный справочник станет для вас идеальным помощником. Найдите множество примеров, раскрывающих все сложности языка, а также ключевые аспекты SQL при его использовании в Microsoft SQL Server, MySQL, Oracle Database, PostgreSQL и SQLite. В обновленном издании Элис Жао описывает, как в этих СУБД используется SQL для формирования запросов и внесения изменений в базу. Получите подробную информацию о типах данных и их преобразованиях, синтаксисе регулярных выражений, оконных функциях, операторах PIVOT и UNPIVOT и многом другом. |
Разместил: relizer 2-05-2024, 23:09 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
 |
|
 |
|
|
 |
|  |
|
Название: Deploying Juniper Data Centers with EVPN VXLAN (Final) Автор: Aninda Chatterjee Издательство: Addison-Wesley Professional/Pearson Education Год: 2024 Страниц: 689 Язык: английский Формат: pdf (true) Размер: 19.6 MB
Learn to deploy Juniper Data Centers with EVPN VXLAN and master the only intent-based multivendor solution for deploying and monitoring EVPN-based VXLAN fabrics! Deploying Juniper Data Centers with EVPN VXLAN is designed for engineers and architects designing, deploying, and/or maintaining small to large data centers. This book will increase productivity and streamline processing and communication by helping you understand BGP EVPNbased VXLAN, data center design and deployment using Junos, and interconnecting multiple data centers for various deployment applications. Aninda Chatterjees straightforward prose and industry experience also gives you the foundational knowledge necessary for Juniper Data Center certification from the JNCIA-DC to the JNCIE-DC. The books structure is unique in its chapter-by-chapter approach with one-pager quick reference guides at the end of the book. The author also puts theory to practice using a combination of packet captures and packet walks. Junos OS is a network operating system with a modular software architecture, developed on top of FreeBSD, which is an open-source operating system. Services run as daemons in their own protected memory space. |
Разместил: Ingvar16 2-05-2024, 06:32 | Комментарии: 0 | Подробнее
| | | |
 |
|  |
br>
|