Big Data characteristics, understanding Hadoop distributed computing, the Bayesian Law, deploying Storm for real-time analytics, Apache Storm features, comparing Storm with Hadoop, Storm execution and learning about Tuple, Spout and Bolt
Installing Apache Storm and various types of run modes of Storm
Understanding Apache Storm and the data model
Installation of Apache Kafka and its configuration
Understanding advanced Storm topics like Spouts, Bolts, Stream Groupings and Topology and its life cycle and learning about guaranteed message processing
Various grouping types in Storm, reliable and unreliable messages, Bolt structure and life cycle, understanding Trident topology for failure handling, process and call log analysis topology for analyzing call logs for calls made from one number to another
Understanding of Trident spouts and its different types, various Trident spout interface and components, familiarizing with Trident filter, aggregator and functions and a practical and hands-on use case on solving call log problem using Storm Trident
Various components, classes and interfaces in Storm like Base Rich Bolt Class, i RichBolt Interface, i RichSpout Interface and Base Rich Spout Class and various methodologies of working with them
Understanding Cassandra, its core concepts, its strengths and deployment
Twitter Boot Stripping, detailed understanding of Boot Stripping, concepts of Storm, Storm development environment
Project 1: Call Log Analysis Using Trident
Topics: In this project, you will be working on call logs to decipher the data and gather valuable insights using Apache Storm Trident. You will extensively work with data about calls made from one number to another. The aim of this project is to resolve the call log issues with Trident stream processing and low latency distributed querying. You will gain hands-on experience in working with Spouts and Bolts, along with various Trident functions, filters, aggregation, joins and grouping.
Project 2: Twitter Data Analysis Using Trident
Topics: This is a project that involves working with Twitter data and processing it to extract patterns out of it. The Apache Storm Trident is the perfect framework for the real-time analysis of tweets. While working with Trident, you will be able to simplify the task of live Twitter feed analysis. In this project, you will gain real-world experience of working with Spouts, Bolts and Trident filters, joins, aggregation, functions and grouping.
Project 3: The US Presidential Election Results Analysis Using Trident DRPC Query
Topics: This is a project that lets you work on the US presidential election results and predict who is leading and trailing on a real-time basis. For this, you exclusively work with Trident distributed remote procedure call server. After the completion of the project, you will learn how to access data residing in a remote computer or network and deploy it for real-time processing, analysis and prediction.
This course is designed for clearing the Intellipaat Storm Certification exam. The entire training course content is designed by industry professionals in order to help you get the best jobs in top MNCs. As part of this training, you will be working on real-time projects and assignments that have immense implications in the real-world industry scenarios, thus helping you fast-track your career effortlessly.
At the end of this training program, there will be quizzes that perfectly reflect the type of questions asked in the respective certification exams and help you score better marks.
The certification will be awarded upon the completion of the project work (after expert review) and upon scoring at least 60% marks in the quiz. Intellipaat certification is well recognized in top 80+ MNCs like Ericsson, Cisco, Cognizant, Sony, Mu Sigma, Saint-Gobain, Standard Chartered, TCS, Genpact, Hexaware, etc.
A Senior Software Architect at NextGen Healthcare who has previously worked with IBM Corporation, Suresh Paritala has worked on Big Data, Data Science, Advanced Analytics, Internet of Things and Azure, along with AI domains like Machine Learning and Deep Learning. He has successfully implemented high-impact projects in major corporations around the world.
An experienced Blockchain Professional who has been bringing integrated Blockchain, particularly Hyperledger and Ethereum, and Big Data solutions to the cloud, David Callaghan has previously worked on Hadoop, AWS Cloud, Big Data and Pentaho projects that have had major impact on revenues of marquee brands around the world.