Udemy – Spark and Python for Big Data with PySpark 2019-9

Udemy – Spark and Python for Big Data with PySpark 2019-9

Description

Spark and Python for Big Data with PySpark is the name of a training course on the Udemy site that teaches how to use Spark and Python in metadata. In this course, you will also become familiar with PySpark and Spark Streaming and learn machine learning topics well. One of the most valuable skills in technology is the ability to analyze large data sets, and this course tries to introduce you to one of the best technologies in this field, Apache Spark.

Apache Spark is an open source, distributed system framework that many large corporations, such as Google, Facebook, Amazon, and NASA, use to solve big data problems. This framework can provide up to 100 times better performance than Hadoop MapReduce and completely meet your needs. In this course you will become fully acquainted with Spark and learn how to use Python in the field of big data.

Items that are taught in this course

Use Spark and Python to analyze big data

Use Spark 2.0 DataFrame syntax

Work on consulting projects

Use Spark with Random Forests for classification

Use of reinforcement gradient tree

Use MLlib to build machine learning models

Spark and Python course specifications for Big Data with PySpark

English language

Duration: 10 hours and 35 minutes

Number of lessons: 66

Level of education: Intermediate

Instructor: Jose Portilla

File format: mp4

Topics

66 lectures 10:35:05

Introduction to Course 4 lectures 30:12

Setting up Python with Spark 2 lectures 06:11

Local VirtualBox Set-up 3 lectures 31:09

AWS EC2 PySpark Set-up 4 lectures 38:58

Databricks Setup 1 lecture 11:41

AWS EMR Cluster Setup 1 lecture 17:16

Python Crash Course 7 lectures 58:50

Spark DataFrame Basics 7 lectures 01:04:52

Spark DataFrame Project Exercise 2 lectures 20:06

Introduction to Machine Learning with MLlib 2 lectures 19:25

Linear Regression 6 lectures 01:00:03

Logistic Regression 5 lectures 01:00:02

Decision Trees and Random Forests 5 lectures 52:26

K-means Clustering 5 lectures 41:20

Collaborative Filtering for Recommender Systems 2 lectures 18:39

Natural Language Processing 4 lectures 46:26

Spark Streaming with Python 5 lectures 57:18

Bonus 1 lecture 00:10

Prerequisite

General Programming Skills in any Language (Preferrably Python)

20 GB of free space on your local computer (or alternatively a strong internet connection for AWS)

Installation

After Extract, watch with your favorite Player.

English subtitle

Quality: 720p

Images

Udemy – Spark and Python for Big Data with PySpark 2019-9

Preview video

Comments

Popular