top of page

Related Courses

50 Hours of Big Data, PySpark, AWS, Scala, and Scraping & PDF Guides (614 Video

Price: 10.00 USD |  Size:17.4 GB | Duration :  50+  Hours | 614 Video Lessons 

BRAND: Expert TRAINING | ENGLISH | Bonus PDF Guides | INSTANT DOWNLOAD

 

HOW TO DOWNLOAD THIS COURSE?

You can Instant Download a PDF file After successful payment, This PDF File Contains Course Download links

You can Download This Course immediately from the click that Links

 

 

50 Hours of Big Data, PySpark, AWS, Scala, and Scraping & PDF Guides

 

 

Video description

Learn, build, and execute big data strategies with Scala and Spark, PySpark and AWS, data scraping and data mining with Python, and master MongoDB

 

About This Video

 

  • Data scraping and data mining for beginners to pro with Python
  • Clear unfolding of concepts with examples in Python, Scrapy, Scala, PySpark, and MongoDB
  • Master Big Data with PySpark and AWS

In Detail

Part 1 is designed to reflect the most in-demand Scala skills. It provides an in-depth understanding of core Scala concepts. We will wrap up with a discussion on Map Reduce and ETL pipelines using Spark from AWS S3 to AWS RDS (includes six mini-projects and one Scala Spark project).

Part 2 covers PySpark to perform data analysis. You will explore Spark RDDs, Dataframes, a bit of Spark SQL queries, transformations, and actions that can be performed on the data using Spark RDDs and dataframes, the ecosystem of Spark and Hadoop, and their underlying architecture. You will also learn how we can leverage AWS storage, databases, computations, and how Spark can communicate with different AWS services.

 

Part 3 is all about data scraping and data mining. You will cover important concepts such as Internet Browser execution and communication with the server, synchronous and asynchronous, parsing data in response from the server, tools for data scraping, Python requests module, and more.

 

In Part 4, you will be using MongoDB to develop an understanding of the NoSQL databases. You will explore the basic operations and explore the MongoDB query, project and update operators. We will wind up this section with two projects: Developing a CRUD-based application using Django and MongoDB and implementing an ETL pipeline using PySpark to dump the data in MongoDB.

By the end of this course, you will be able to relate the concepts and practical aspects of learned technologies with real-world problems.

 

Audience

This course is designed for absolute beginners who want to create intelligent solutions, study with actual data, and enjoy learning theory and then putting it into practice. Data scientists, machine learning experts, and drop shippers will all benefit from this training.

A basic understanding of programming, HTML tags, Python, SQL, and Node JS is required. However, no prior knowledge of data scraping, and Scala is needed.

 

Table of contents

Chapter 1 : Part 1 - Data Scraping and Data Mining for Beginners to Pro with Python

Why Data Scraping

Applications of Data Scraping

Introduction of Instructor

Introduction to Course, Scraping, Tools

Projects Overview for Data Scraping and Data Mining

Chapter 2 : Requests

Introduction to Python Requests

Hands-On with Requests

Extracting Quotes Manually

Quiz (Extracting Authors)

Solution (Extracting Authors)

Pagination

Quiz (Extracting Author and Quotes)

Solution 01 (Extracting Author and Quotes)

Solution 02 (Extracting Author and Quotes)

Ajax Requests

Ajax Requests for Cricinfo

Ajax Requests Pagination

Quiz (Extracting Top Stats from Cricinfo)

Solution 01 (Extracting Top Stats from Cricinfo)

Solution 02 (Extracting Top Stats from Cricinfo)

Chapter 3 : Beautiful Soup 4 (BS4)

Introduction to BS4

Quiz (Difference Between Requests and BS4)

Solution (Difference Between Requests and BS4)

Hands-On with BS4

Extracting Data from Tree

Extracting Quotes from the Website

Quiz (Extracting Author Names)

Solution (Extracting Author Names)

Attributes of Tags in BS4

Multi-Valued Attributes of Tags in BS4

Scraping Movie Names from IMDB

Quiz (Getting the Ratings, Year, and Name of the Movie)

Solution 01 (Getting the Ratings, Year, and Name of the Movie)

Solution 02 (Getting the Ratings, Year, and Name of the Movie)

Scraping Time, Genre, and Releasing Date from IMDB 01

Scraping Time, Genre, and Releasing Date from IMDB 02

Combining Two Requests Data for IMDB

Movies Recommender System (Creating Movie URL)

Movies Recommender System (Creating Director URL)

Movies Recommender System Using BS4 (Getting Top Four Movies)

Movies Recommender System Using BS4 (Merge All Requests Together)

Chapter 4 : CSS Selectors

Introduction to CSS Selectors

CSS Selectors Hands-On (Tags)

Quiz (Tags)

Solution (Tags)

CSS Selectors Hands-On (Descendants, ID, Class)

Quiz (Descendants)

Solution (Descendants)

Quiz (ID)

Solution (ID)

Solution (Class) Part 1

Solution (Class) Part 2

CSS Selectors Hands-On (Nested Tags, ID Tags, Class Tags)

Quiz (Class with Tag)

Solution (Class with Tag)

CSS Selectors Hands-On (Comma Separator, Universal Selectors

Quiz (Combining Two Selectors)

Solution (Combining Two Selectors)

CSS Selectors Hands-On (Sibling Notations and Direct Child)

Quiz (Adjacent Sibling)

Solution (Adjacent Sibling)

Quiz (General Sibling)

Solution (General Sibling)

CSS Selectors Hands-On (Child Selectors)

Quiz (First Child)

Solution (First Child)

Quiz (Only Child)

Solution (Only Child)

Quiz (Last Child)

Solution (Last Child)

CSS Selectors Hands-On (Negations, Attributes)

Quiz (Negation)

Solution (Negation)

CSS Selectors Hands-On (Attributes, Attributes Values)

Quiz (Attributes Values)

Solution (Attributes Values)

CSS Selectors Hands-On (Attributes Wild Cards Values)

Quiz (Attributes Wild Card)

Solution (Attributes Wild Card)

Chapter 5 : Scrapy

Introduction to Scrapy

Comparison of Scrapy and Requests

Scrapy at a Glance Documentation

Getting Started with Scrapy

Running Documentation Spider 1

Running Documentation Spider 2

Writing Spider from the Scratch

Understanding the Response (URL, Status)

Understanding the Response (Headers)

Understanding the Response (Values in Headers)

Understanding the Response (Body)

Understanding the Response (Request)

Understanding the Response (Meta)

50 Hours of Big Data, PySpark, AWS, Scala, and Scraping & PDF Guides (614 Video

SKU: RS2422 120
$10.00Price
  • You can Instant Download a PDF file 

    After successful payment, 

    This PDF File Contains Course Download links

    You can Download This Course immediately from the click that Links

bottom of page