Big Data and Hadoop for Beginners - with Hands-on!

Andalib Ansari, Big Data Consultant

Play Speed
  • 0.5x
  • 1x (Normal)
  • 1.25x
  • 1.5x
  • 2x
27 Videos (2h 35m)
    • Course Overview

      2:21
    • Introduction to Big Data

      9:23
    • Big Data Job Roles

      6:30
    • Big Data Salaries

      2:55
    • Technology Trends in the Market

      6:30
    • Advice for Big Data Beginners

      2:44
    • Introduction to Hadoop

      8:23
    • Hadoop Ecosystem

      5:01
    • Hadoop 1.x vs Hadoop 2.x

      14:13
    • ETL vs ELT

      3:19
    • Hadoop Vendors

      4:20
    • Managing HDFS from Command Line

      9:09
    • Introduction to Hive

      2:41
    • Hive Architecture

      2:28
    • File Formats in Hive

      4:40
    • SQL vs HQL

      3:46
    • UDF & UDAF in Hive

      2:57
    • Hive Demo

      18:50
    • Introduction to Pig

      2:57
    • Pig Architecture

      1:39
    • Pig Data Model

      2:17
    • How Pig Latin Works

      2:57
    • SQL vs Pig

      5:32
    • UDF in Pig

      3:25
    • Pig Demo

      12:49
    • Designing Data Pipeline using Pig and Hive

      7:59
    • Data Lake

      5:24

About This Class

965b4ac8

The main objective of this course is to help you understand Complex Architectures of Hadoop and its components, guide you in the right direction to start with, and quickly start working with Hadoop and its components.

It covers everything what you need as a Big Data Beginner. Learn about Big Data market, different job roles, technology trends, history of Hadoop, HDFS, Hadoop Ecosystem, Hive and Pig. In this course, we will see how as a beginner one should start with Hadoop. This course comes with a lot of hands-on examples which will help you learn Hadoop quickly.

The course have 6 sections, and focuses on the following topics:

Big Data at a Glance: Learn about Big Data and different job roles required in Big Data market. Know big data salary trends around the globe. Learn about hottest technologies and their trends in the market.

Getting Started with Hadoop: Understand Hadoop and its complex architecture. Learn Hadoop Ecosystem with simple examples. Know different versions of Hadoop (Hadoop 1.x vs Hadoop 2.x), different Hadoop Vendors in the market and Hadoop on Cloud. Understand how Hadoop uses ELT approach. Learn installing Hadoop on your machine. We will see running HDFS commands from command line to manage HDFS.

Getting Started with Hive: Understand what kind of problem Hive solves in Big Data. Learn its architectural design and working mechanism. Know data models in Hive, different file formats supported by Hive, Hive queries etc. We will see running queries in Hive.

Getting Started with Pig: Understand how Pig solves problems in Big Data. Learn its architectural design and working mechanism. Understand how Pig Latin works in Pig. You will understand the differences between SQL and Pig Latin. Demos on running different queries in Pig.

Use Cases: Real life applications of Hadoop is really important to better understand Hadoop and its components, hence we will be learning by designing a sample Data Pipeline in Hadoop to process big data. Also, understand how companies are adopting modern data architecture i.e. Data Lake in their data infrastructure.

Practice: Practice with huge Data Sets. Learn Design and Optimization Techniques by designing Data Models, Data Pipelines by using real life applications' data sets.

25

Students

--

Projects

0

Reviews (0)

Andalib Ansari

Big Data Consultant

Andalib Ansari is a Big Data consultant based out of Mumbai. He helps companies and people solve business problems using Big Data technologies. Also, one of his passion, to guide and train people on different Big Data tools and technologies.

He is having a very decent exposure of Big Data tools and technologies, and have worked with various clients, top level Mobile Network Operators (MNO), from Latin America and the US to solve different business problems for different use-cases, and designed optimized Data Pipelines using Big Data technologies on the cloud.