All Courses
×
Microsoft

Big Data and Data Science Master's Course

4,633 Ratings

Our Big Data and Data Science Master’s course lets you gain proficiency in big data and data science. You will work on real-world projects in Hadoop Dev, admin, test, and analysis, apache spark, scala, AWS, tableau, artificial intelligence, deep learning, python for data science, R, splunk developer and admin, NoSQL databases, and more. In this program, we will cover 20 courses and 56 industry-based projects.

Watch

Course Preview

Key Highlights

322 Hrs Instructor Led Training
381 Hrs Self-paced Videos
528 Hrs Project & Exercises
Certification
Job Assistance
Flexible Schedule
Lifetime Free Upgrade
Mentor Support
Trustpilot
sitejabber
Mouthshut

Overview

List of Courses Included:

Online Instructor-led Courses:

  • Big Data Hadoop and Spark
  • Apache Spark and Scala
  • Python for Data Science
  • Tableau Desktop 10
  • Splunk Developer and Admin
  • Artificial Intelligence and Deep Learning Course with TensorFlow
  • MongoDB
  • AWS
  • Microsoft Azure Training

Self-paced Courses:

  • Data Science with R
  • Apache HBase
  • Apache Cassandra
  • Couchbase
  • Machine Learning
  • Solr
  • Linux
  • Java
  • Apache Kafka
  • SQL
  • Introduction to Hadoop
  • Detailed MapReduce and HDFS
  • Hive, Pig, Sqoop, Flume and Apache HBase
  • Real-time analytics with Spark
  • Prediction and analysis through clustering
  • Deploying recommender system
  • Linear and logistic regression
  • Designing and Developing NoSQL applications
  • Mastering Artificial Intelligence Algorithms and their practical use cases

There are no prerequisites for taking this training program.

  • Global big data market to reach $122 billion in revenue by 2025 – Frost & Sullivan
  • The US alone would face a shortage of 1.4–1.9 million big data analysts in the next two years – McKinsey

This Intellipaat training program has been created keeping in mind the needs of the industry. You will gain mastery in the complete aspects of data science and Hadoop ecosystem to take on various roles and responsibilities in the big data and data science domains at top-notch salaries.

View More

Talk To Us

We are happy to help you 24/7

Career Transition

57% Average Salary Hike

$1,28,000 Highest Salary

12000+ Career Transitions

300+ Hiring Partners

Career Transition Handbook

*Past record is no guarantee of future job prospects

Meet Your Mentors

What roles do a big data and data science expert play?

Big Data Specialist

Builds and maintains a customized pluggable service-based framework to import, transform, cleanse, and validate data.

Big Data Engineer

Designs, builds, and tests scalable and robust components of the data platform and also offer technical solutions to the respective problems.

Big Data Analyst

Creates data pipelines and comes up with solutions to resolve extremely complex issues.

Business Analyst

Pulls necessary data to perform necessary tasks, including business analysis, and developing reports, metrics, and dashboards for performance monitoring.

Data Engineer

Works with complex and large data sets, and creates analysis pipelines regularly to gain insights into this data.

Data Scientist

Applies various deep learning and machine learning techniques, such as random forests, decision trees, CNNs, and more.

View More

Skills to Master

Big Data Hadoop

Apache Spark and Scala

Data Science with R

Python for Data Science

Tableau Desktop 10

Splunk Developer and Admin

Artificial Intelligence and Deep Learning with TensorFlow

MongoDB

AWS

Azure

View More

Tools to Master

hadoop mapreduce hive apache-pig sqoop Oozie spark pyspark SparkSQL scala R SQL python numpy Scipy jupyter tableau splunk SAS tensorflow mongodb AWS azure Apache_HBase cassandra couchbase solr linux java kafka
View More

Course Fees

Self Paced Training

  • 381 Hrs e-learning videos
  • Flexible Schedule
  • Lifetime Free Upgrade

$1,755

Corporate Training

  • Customized Learning
  • Enterprise Grade Learning Management System (LMS)
  • 24x7 Support
  • Enterprise Grade Reporting

Contact Us

Curriculum

Live Course Self-Paced

Big Data Hadoop and Spark

60 Hours 33 Module

Preview

Module 01 – Hadoop Installation and Setup
Module 02 – Introduction to Big Data Hadoop and Understanding HDFS and MapReduce
Module 03 – Deep Dive in MapReduce
Module 04 – Introduction to Hive
Module 05 – Advanced Hive and Impala
Module 06 – Introduction to Pig
Module 07 – Flume, Sqoop and HBase
Module 08 – Writing Spark Applications Using Scala
Module 09 – Use Case Bobsrockets Package
Module 10 – Introduction to Spark
Module 11 – Spark Basics
Module 12 – Working with RDDs in Spark
Module 13 – Aggregating Data with Pair RDDs
Module 14 – Writing and Deploying Spark Applications
Module 15 – Project Solution Discussion and Cloudera Certification Tips and Tricks
Module 16 – Parallel Processing
Module 17 – Spark RDD Persistence
Module 18 – Spark MLlib
Module 19 – Integrating Apache Flume and Apache Kafka
Module 20 – Spark Streaming
Module 21 – Improving Spark Performance
Module 22 – Spark SQL and Data Frames
Module 23 – Scheduling/Partitioning

The following topics will be available only in self-paced mode:

Module 24 – Hadoop Administration – Multi-node Cluster Setup Using Amazon EC2
Module 25 – Hadoop Administration – Cluster Configuration
Module 26 – Hadoop Administration – Maintenance, Monitoring and Troubleshooting
Module 27 – ETL Connectivity with Hadoop Ecosystem (Self-Paced)
Module 28 – Hadoop Application Testing
Module 29 – Roles and Responsibilities of Hadoop Testing Professional
Module 30 – Framework Called MRUnit for Testing of MapReduce Programs
Module 31 – Unit Testing
Module 32 – Test Execution
Module 33 – Test Plan Strategy and Writing Test Cases for Testing Hadoop Application

Download Brochure

Tools covered

mapreduce hive apache-pig sqoop Oozie spark pyspark SparkSQL

Scala Course Content

Module 01 – Introduction to Scala
Module 02 – Pattern Matching
Module 03 – Executing the Scala Code
Module 04 – Classes Concept in Scala
Module 05 – Case Classes and Pattern Matching
Module 06 – Concepts of Traits with Example
Module 07 – Scala–Java Interoperability
Module 08 – Scala Collections
Module 09 – Mutable Collections Vs. Immutable Collections
Module 10 – Use Case Bobsrockets Package

Spark Course Content

Module 11 – Introduction to Spark
Module 12 – Spark Basics
Module 13 – Working with RDDs in Spark
Module 14 – Aggregating Data with Pair RDDs
Module 15 – Writing and Deploying Spark Applications
Module 16 – Parallel Processing
Module 17 – Spark RDD Persistence
Module 18 – Spark MLlib
Module 19 – Integrating Apache Flume and Apache Kafka
Module 20 – Spark Streaming
Module 21 – Improving Spark Performance
Module 22 – Spark SQL and Data Frames
Module 23 – Scheduling/Partitioning

Download Brochure

Tools covered

spark scala

Module 01 – Introduction to Data Science using Python
Module 02 – Python basic constructs
Module 03 – Maths for DS-Statistics and Probability
Module 04 – OOPs in Python (Self-paced)
Module 05 – NumPy for mathematical computing
Module 06 – SciPy for scientific computing
Module 07 – Data manipulation
Module 08 – Data visualization with Matplotlib
Module 09 – Machine Learning using Python
Module 10 – Supervised learning
Module 11 – Unsupervised Learning
Module 12 – Python integration with Spark (Self-paced)
Module 13 – Dimensionality Reduction
Module 14 – Time Series Forecasting

Download Brochure

Tools covered

python jupyter numpy Scipy

Module 01 – Introduction to Data Visualization and The Power of Tableau
Module 02 – Architecture of Tableau
Module 03 – Charts and Graphs
Module 04 – Working with Metadata and Data Blending
Module 05 – Advanced Data Manipulations
Module 06 – Working with Filters
Module 07 – Organizing Data and Visual Analytics
Module 08 – Working with Mapping
Module 09 – Working with Calculations and Expressions
Module 10 – Working with Parameters
Module 11 – Dashboards and Stories
Module 12 – Tableau Prep
Module 13 – Integration of Tableau with R

Download Brochure

Tools covered

tableau

Module 1 – Splunk Development Concepts
Module 2 – Basic Searching
Module 3 – Using Fields in Searches
Module 4 – Saving and Scheduling Searches
Module 5 – Creating Alerts
Module 6 – Scheduled Reports
Module 7 – Tags and Event Types
Module 8 – Creating and Using Macros
Module 9 – Workflow
Module 10 – Splunk Search Commands
Module 11 – Transforming Commands
Module 12 – Reporting Commands
Module 13 – Mapping and Single Value Commands
Module 14 – Splunk Reports and Visualizations
Module 15 – Analyzing, Calculating and Formatting Results
Module 16 – Correlating Events
Module 17 – Enriching Data with Lookups
Module 18 – Creating Reports and Dashboards
Module 19 – Getting Started with Parsing
Module 20 – Using Pivot
Module 21 – Common Information Model (CIM) Add-On

Splunk Administration Topics

Module 22 – Overview of Splunk
Module 23 – Splunk Installation
Module 24 – Splunk Installation in Linux
Module 25 – Distributed Management Console
Module 26 – Introduction to Splunk App
Module 27 – Splunk Indexes and Users
Module 28 – Splunk Configuration Files
Module 29 – Splunk Deployment Management
Module 30 – Splunk Indexes
Module 31 – User Roles and Authentication
Module 32 – Splunk Administration Environment
Module 33 – Basic Production Environment
Module 34 – Splunk Search Engine
Module 35 – Various Splunk Input Methods
Module 36 – Splunk User and Index Management
Module 37 – Machine Data Parsing
Module 38 – Search Scaling and Monitoring
Module 39 – Splunk Cluster Implementation

Download Brochure

Tools covered

splunk

Module 01 – Introduction to Deep Learning and Neural Networks
Module 02 – Multi-layered Neural Networks
Module 03 – Artificial Neural Networks and Various Methods
Module 04 – Deep Learning Libraries
Module 05 – Keras API
Module 06 – TFLearn API for TensorFlow
Module 07 – Dnns (deep neural networks)
Module 08 – Cnns (convolutional neural networks)
Module 09 – Rnns (recurrent neural networks)
Module 10 – Gpu in deep learning
Module 11 – Autoencoders and restricted boltzmann machine (rbm)
Module 12 – Deep learning applications
Module 13 – Chatbots

Download Brochure

Tools covered

tensorflow

Module 01 – Introduction to NoSQL and MongoDB
Module 02 – MongoDB Installation
Module 03 – Importance of NoSQL
Module 04 – CRUD Operations
Module 05 – Data Modeling and Schema Design
Module 06 – Data Management and Administration
Module 07 – Data Indexing and Aggregation
Module 08 – MongoDB Security
Module 09 – Working with Unstructured Data

Download Brochure

Tools covered

mongodb

Module 01 – Introduction to Microsoft Azure
Module 02 – Introduction to ARM & Azure Storage
Module 03 – Introduction to Azure storage
Module 04 – Azure Virtual Machines
Module 05 – Azure App and Container services
Module 06 – Azure Networking – I
Module 07 – Azure Networking – II
Module 08 – Authentication and Authorization in Azure using RBAC
Module 09 – Microsoft Azure Active Directory
Module 10 – Azure Monitoring

Download Brochure

Tools covered

azure

Module 01 – Introduction to Cloud Computing and AWS
Module 02 – Elastic Compute and Storage Volumes
Module 03 – Load Balancing, Autoscaling and DNS
Module 04 – Virtual Private Cloud
Module 05 – Storage – Simple Storage Service (S3)
Module 06 – Databases and In-Memory DataStores
Module 07 – Management and Application Services
Module 08 – Access Management and Monitoring Services
Module 09 – Automation and Configuration Management
Module 10 – AWS Migration

Self Paced

Module 11 – Architecting AWS – whitepaper
Module 12 – DevOps on AWS
Module 13 – Amazon FSx and Global Accelerator
Module 14 – AWS Architect Interview Questions

Download Brochure

Tools covered

AWS

Module 01 – Introduction to Data Science with R
Module 02 – Data Exploration
Module 03 – Data Manipulation
Module 04 – Data Visualization
Module 05 – Introduction to Statistics
Module 06 – Machine Learning
Module 07 – Logistic Regression
Module 08 – Decision Trees and Random Forest
Module 09 – Unsupervised Learning
Module 10 – Association Rule Mining and Recommendation Engines

Self-paced Course Content

Module 11 – Introduction to Artificial Intelligence
Module 12 – Time Series Analysis
Module 13 – Support Vector Machine (SVM)
Module 14 – Naïve Bayes
Module 15 – Text Mining

Download Brochure

Tools covered

R

Module 01 – HBase Overview
Module 02 – Architecture of NoSQL
Module 03 – HBase Data Modeling
Module 04 – HBase Cluster Components
Module 05 – HBase API and Advanced Operations
Module 06 – Integration of Hive with HBase
Module 07 – File Loading with Both Load Utilities

Download Brochure

Tools covered

Apache_HBase

Module 01 – Advantages and Usage of Cassandra
Module 02 – CAP Theorem and No SQL DataBase
Module 03 – Cassandra fundamentals, data model, Installation and setup
Module 04 – Cassandra Configuration
Module 05 – Summarization, node tool commands, cluster, Indexes, Cassandra and MapReduce, Installing Ops-center
Module 06 – Multi Cluster setup
Module 07 – Thrift/Avro/Json/Hector Client
Module 08 – Datastax installation part,· Secondary index
Module 09 – Advance Modelling
Module 10 – Deploying the IDE for Cassandra applications
Module 11 – Cassandra Administration
Module 12 – Cassandra API and Summarization and Thrift

Download Brochure

Tools covered

cassandra

Module 01 – Introduction to Couchbase
Module 02 – Single-node Implementation
Module 03 – Couchbase Web Console
Module 04 – Couchbase Multi-node Cluster
Module 05 – Couchbase Command-line Interface

Download Brochure

Tools covered

couchbase

Module 01 – Introduction to Machine Learning
Module 02 – Supervised Learning and Linear Regression
Module 03 – Classification and Logistic Regression
Module 04 – Decision Tree and Random Forest
Module 05 – Naïve Bayes and Support Vector Machine (self-paced)
Module 06 – Unsupervised Learning
Module 07 – Natural Language Processing and Text Mining (self-paced)
Module 08 – Introduction to Deep Learning
Module 09 – Time Series Analysis (self-paced)

Download Brochure

Tools covered

python jupyter

Module 01 – Fundamentals of Search Engine and Apache Lucene
Module 02 – Analyzers in Lucene
Module 03 – Exploring Apache Lucene
Module 04 – Apache Lucene Demonstration
Module 05 – Apache Lucene advanced
Module 06 – Advance topics of Apache Lucene (practical)
Module 07 – Apache Solr
Module 08 – Apache Solr Indexing
Module 09 – Solr Indexing continued
Module 10 – Apache Solr Searching
Module 11 – Deep dive into Apache Solr
Module 12 – Apache Solr continued
Module 13 – Extended Features
Module 14 – Multicore
Module 15 – Administration & SolrCloud

Download Brochure

Tools covered

solr

Module 01 – Introduction to Linux
Module 02 – File Management
Module 03 – Files and Processes
Module 04 – Introduction to Shell Scripting
Module 05 – Conditional, Looping statements and Functions
Module 06 – Text Processing
Module 07 – Scheduling Tasks
Module 08 – Advanced Shell Scripting
Module 09 – Database Connectivity
Module 10 – Linux Networking

Download Brochure

Tools covered

linux

Module 01 – Core Java Concepts
Module 02 – Writing Java Programs using Java Principles
Module 03 – Language Conceptuals
Module 04 – Operating with Java Statements
Module 05 – Concept of Objects and Classes
Module 06 – Introduction to Core Classes
Module 07 – Inheritance in Java
Module 08 – Exception Handling in Detail
Module 09 – Getting started with Interfaces and Abstract Classes
Module 10 – Overview of Nested Classes
Module 11 – Getting started with Java Threads
Module 12 – Overview of Java Collections
Module 13 – Understanding JDBC
Module 14 – Java Generics
Module 15 – Input/Output in Java
Module 16 – Getting started with Java Annotations
Module 17 – Reflection and its Usage

Download Brochure

Tools covered

java

Module 01 – What is Kafka – An Introduction
Module 02 – Multi-Broker Kafka Implementation
Module 03 – Multi Node Cluster Setup
Module 04 – Integrate Flume with Kafka
Module 05 – Kafka API
Module 06 – Producers and Consumers

Download Brochure

Tools covered

kafka

Module 01 – Introduction to SQL
Module 02 – Database Normalization and Entity Relationship Model
Module 03 – SQL Operators
Module 04 – Working with SQL: Join, Tables, and Variables
Module 05 – Deep Dive into SQL Functions
Module 06 – Working with Subqueries
Module 07 – SQL Views, Functions, and Stored Procedures
Module 08 – Deep Dive into User-defined Functions
Module 09 – SQL Optimization and Performance
Module 10 – Advanced Topics
Module 11 – Managing Database Concurrency
Module 12 – Programming Databases Using Transact-SQL
Module 13 – Microsoft Courses: Study Material

Download Brochure

Tools covered

SQL
View More

Project Work

Projects will be a part of your Big Data and Data Science Master’s program to consolidate your learning. It will ensure that you have real-world experience in Big Data and Data Science.

Career Services

Career Services
guaranteed
Assured Interviews
job_portal
Exclusive access to Intellipaat Job portal
Mock Interview Preparation
1 on 1 Career Mentoring Sessions
resume
Career Oriented Sessions
linkedin
Resume & LinkedIn Profile Building
View More

Certification

Microsoft-cert Click to Zoom

The Big Data and Data Science Master’s training content is in line with respective certification exams. In this course, there will be quizzes that will reflect the type of questions asked in the respective exams. Moreover, once you successfully execute the projects, you will receive certifications from Intellipaat, Microsoft.

Moreover, this is a comprehensive course that is designed to clear multiple certifications, namely:

  • CCA Spark and Hadoop Developer (CCA175)
  • Splunk Certified Power User Certification
  • Splunk Certified Admin Certification
  • Tableau Desktop Qualified Associate Exam
  • C100DEV: MongoDB Certified Developer Associate Exam
  • Apache Cassandra DataStax Certification
  • Linux Foundation Linux Certification
  • Java SE Programmer Certification
  • AWS Certified Solutions Architect exam

Reviews & Testimonials

( 4,633 )

Land Your Dream Job Like Our Alumni

Frequently Asked Questions

What is Intellipaat’s Master's course and how is it different from individual courses?

Intellipaat’s Master’s course is a structured learning path especially designed by industry experts and ensures that you transform into a big data and data science expert. Individual courses at Intellipaat focus on one or two specializations. However, if you have to master big data and data science, then this program is for you.

3 technical 1:1 sessions per month will be allowed.

Intellipaat offers query resolution, and you can raise a ticket with the dedicated support team at any time. You can avail yourself of email support for all your queries. We can also arrange one-on-one sessions with our support team If your query does not get resolved through email. However, 1:1 session support is given for 6 months from the start date of your course.

Intellipaat provides placement assistance to all learners who have completed the training and moved to the placement pool after clearing the PRT (Placement Readiness Test). More than 500+ top MNCs and startups hire Intellipaat learners. Our alumni work with Google, Microsoft, Amazon, Sony, Ericsson, TCS, Mu Sigma, etc.

No, our job assistance is aimed at helping you land your dream job. It offers a potential opportunity for you to explore various competitive openings in the corporate world and find a well-paid job, matching your profile. The final hiring decision will always be based on your performance in the interview and the requirements of the recruiter.

View More