Tag: PySpark

Pluralsight – Transform Data Using PySpark

Free Download Pluralsight – Transform Data Using PySpark

Released 12/2024
MP4 | Video: h264, 1280×720 | Audio: AAC, 44.1 KHz, 2 Ch
Level: Intermediate | Genre: eLearning | Language: English + subtitle | Duration: 43m | Size: 109 MB
Master large-scale data manipulation and analysis with PySpark. This course covers essential techniques for handling data, creating efficient workflows, and using custom functions to streamline complex tasks.

(more…)

Applied Data Science Using PySpark (2nd Edition)


Free Download Applied Data Science Using PySpark: Learn the End-to-End Predictive Model-Building Cycle
English | 2024 | ASIN : B0DBBXKL4X | 447 Pages | PDF | 18 MB
This comprehensive guide, featuring hand-picked examples of daily use cases, will walk you through the end-to-end predictive model-building cycle using the latest techniques and industry tricks. In Chapters 1, 2, and 3, we will begin by setting up the environment and covering the basics of PySpark, focusing on data manipulation. Chapter 4 delves into the art of variable selection, demonstrating various techniques available in PySpark. In Chapters 5, 6, and 7, we explore machine learning algorithms, their implementations, and fine-tuning techniques. Chapters 8 and 9 will guide you through machine learning pipelines and various methods to operationalize and serve models using Docker/API. Chapter 10 will demonstrate how to unlock the power of predictive models to create a meaningful impact on your business. Chapter 11 introduces some of the most widely used and powerful modeling frameworks to unlock real value from data.

(more…)

Data Analysis with Python and PySpark, Video Edition


Free Download Data Analysis with Python and PySpark, Video Edition
Released 3/2022
MP4 | Video: h264, 1280×720 | Audio: AAC, 44.1 KHz, 2 Ch
Genre: eLearning | Language: English | Duration: 10h 31m | Size: 1.6 GB
Think big about your data! PySpark brings the powerful Spark big data processing engine to the Python ecosystem, letting you seamlessly scale up your data tasks and create lightning-fast pipelines.

(more…)

Distributed Machine Learning with PySpark


Free Download Distributed Machine Learning with PySpark: Migrating Effortlessly from Pandas and Scikit-Learn
English | 2023 | ISBN: 1484297504 | 500 Pages | PDF EPUB (True) | 4 MB
Migrate from pandas and scikit-learn to PySpark to handle vast amounts of data and achieve faster data processing time. This book will show you how to make this transition by adapting your skills and leveraging the similarities in syntax, functionality, and interoperability between these tools.

(more…)