If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step …
If you've successfully used Apache Spark to solve medium sized-problems, but still struggle to realize the "Spark promise" of unparalleled …
This book is about how to integrate full-stack open source big data architecture and how to choose the correct technology—Scala/Spark, …
Data is getting bigger, arriving faster, and coming in varied formats — and it all needs to be processed at …
A comprehensive end-to-end guide that gives hands-on practice in big data and Artificial Intelligence About This Book Learn to build …
Ready to simplify the process of building data lakehouses and data pipelines at scale? In this practical guide, learn how …
Harness the power of Scala to program Spark and analyze tonnes of data in the blink of an eye!About This …
Data engineers proficient in Databricks are in high demand. As organizations gather more data than ever before, skilled data engineers …
Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces …
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of this open-source …
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source …
Авторство: компания Databricks KnowledgebaseBest PracticesAvoid GroupByKeyDon't copy all elements of a large RDD to the driverGracefully Dealing with Bad Input …
Data in all domains is getting bigger. How can you work with it efficiently? This book introduces Apache Spark, the …
The Web is getting faster, and the data it delivers is getting bigger. How can you handle everything efficiently? This …
If you want to build an enterprise-quality application that uses natural language text but aren’t sure where to begin or …
Key Features Take your first steps in the world of data science by understanding the tools and techniques of data …
Machine Learning with Spark and Python Essential Techniques for Predictive Analytics, Second Edition simplifies ML for practical uses by focusing …
Access real-world documentation and examples for the Spark platform for building large-scale, enterprise-grade machine learning applications.The past decade has seen …
Big Data Analytics with Spark is a step-by-step guide for learning Spark, which is an open-source fast and general-purpose cluster …
Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required …
If you’re like most R users, you have deep knowledge and love for statistics. But as your organization continues to …
In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with …