Courses ×

Back

Corporate Training Explore Courses

Data Science Training Course in San Diego

5 ( 591 ) Ratings

Intellipaat’s Data Science course in San Diego is curated by industry experts who will teach the concepts of R programming, Machine Learning algorithms, data acquisition, and data modeling, along with their practical applications in the domains of e-commerce, finance, and banking. By the end of our Data Science training, you will gain ample knowledge to become a certified Data Scientist.

In collaboration with img
MS Excel Self-paced Course

Key Highlights

42 Hrs Instructor Led Training
28 Hrs Self-paced Videos
56 Hrs Project & Exercises
Certification and Job Assistance
Flexible Schedule
Lifetime Free Upgrade
24 x 7 Lifetime Support & Access
Mentor Support
Intellipaat-Key-Features
Intellipaat-Key-Features
Intellipaat-Key-Features

Data Scientist Course in San DiegoOverview

This Data Science training in San Diego covers all the must-have modules of the domain, such as Machine Learning algorithms, Data Science life cycle, data mining, and data manipulation, and it makes the learners work on real-world datasets in projects.

What are the key topics covered in this Data Science certification in San Diego?

  • Introduction to Big Data Hadoop
  • Data exploratory analysis
  • Need for data manipulation
  • Different types of data visualization
  • Categories of statistics
  • Understanding Machine Learning
  • Logistic regression concepts
  • Information Architects
  • Business Analysts
  • Data Science job seekers and aspirants

There are no specific prerequisites. However, a little enthusiasm in learning mathematics and statistics can help you understand the concepts easily.

  • The average salary of a Data Scientist in San Diego is around US$113,000 per annum – Glassdoor
  • 204 Data Scientist job vacancies are available in San Diego – LinkedIn
  • Data Scientists earn more than the average pay of other IT employees – Indeed
View More

Talk To Us

We are happy to help you 24/7

With data collection, ’the sooner the better’ is the best answer - The CEO of Yahoo
Everything is going to be connected with data and mediated by software - The CEO of Microsoft
The world is now awash in data, and we can see consumers in a cleaner way - The Co-founder of PayPal

Career Transition

Career Transition Handbook

Skills Covered

  • R Programming
  • Exploratory Data Analysis
  • Data Manipulation
  • Data Visualization
  • Statistics 
  • Machine Learning Algorithms
View More

Course Fees

Self Paced Training

  • 28 Hrs e-learning videos
  • Lifetime Free Upgrade
  • 24 x 7 Lifetime Support & Access

$264

Online Classroom Preferred

  • Everything in self-paced, plus
  • 42 Hrs of Instructor-led Training
  • 1:1 Doubt Resolution Sessions
  • Attend as many batches for Lifetime
  • Flexible Schedule
07 Aug

SAT - SUN

08:00 PM TO 11:00 PM IST (GMT +5:30)

10 Aug

TUE - FRI

07:00 AM TO 09:00 AM IST (GMT +5:30)

14 Aug

SAT - SUN

08:00 PM TO 11:00 PM IST (GMT +5:30)

22 Aug

SAT - SUN

08:00 PM TO 11:00 PM IST (GMT +5:30)

$ 499 $399 10% OFF Expires in

Corporate Training

  • Customized Learning
  • Enterprise grade learning management system (LMS)
  • 24x7 Support
  • Enterprise grade reporting

Contact Us

Data Science Course Content

Live Course

Module 01 - Introduction to Data Science with R

Preview

1.1 What is Data Science?
1.2 Significance of Data Science in today’s data-driven world, its applications of, , lifecycle, and its components
1.3 Introduction to R programming and RStudio

Hands-on Exercise:

1. Installation of RStudio
2. Implementing simple mathematical operations and logic using R operators, loops, if statements, and switch cases

Module 02 - Data Exploration

Preview

2.1 Introduction to data exploration
2.2 Importing and exporting data to/from external sources
2.3 What are data exploratory analysis and data importing?
2.4 DataFrames, working with them, accessing individual elements, vectors, factors, operators, in-built functions, conditional and looping statements, user-defined functions, and data types

Hands-on Exercise:

1. Accessing individual elements of customer churn data
2. Modifying and extracting results from the dataset using user-defined functions in R

3.1 Need for data manipulation
3.2 Introduction to the dplyr package
3.3 Selecting one or more columns with select(), filtering records on the basis of a condition with filter(), adding new columns with mutate(), sampling, and counting
3.4 Combining different functions with the pipe operator and implementing SQL-like operations with sqldf

Hands-on Exercise:

1. Implementing dplyr
2. Performing various operations for manipulating data and storing it

4.1 Introduction to visualization
4.2 Different types of graphs, the grammar of graphics, the ggplot2 package, categorical distribution with geom_bar(), numerical distribution with geom_hist(), building frequency polygons with geom_freqpoly(), and making a scatterplot with geom_pont()
4.3 Multivariate analysis with geom_boxplot
4.4 Univariate analysis with a barplot, a histogram and a density plot, and multivariate distribution
4.5 Creating barplots for categorical variables using geom_bar(), and adding themes with the theme() layer
4.6 Visualization with plotly, frequency plots with geom_freqpoly(), multivariate distribution with scatter plots and smooth lines, continuous distribution vs categorical distribution with box-plots, and sub grouping plots
4.7 Working with co-ordinates and themes to make graphs more presentable, understanding plotly and various plots, and visualization with ggvis
4.8 Geographic visualization with ggmap() and building web applications with shinyR

Hands-on Exercise:

1. Creating data visualization to understand the customer churn ratio using ggplot2 charts
2. Using plotly for importing and analyzing data
3. Visualizing tenure, monthly charges, total charges, and other individual columns using a scatter plot

5.1 Why do we need statistics?
5.2 Categories of statistics, statistical terminology, types of data, measures of central tendency, and measures of spread
5.3 Correlation and covariance, standardization and normalization, probability and the types, hypothesis testing, chi-square testing, ANOVA, normal distribution, and binary distribution

Hands-on Exercise:

1. Building a statistical analysis model that uses quantification, representations, and experimental data
2. Reviewing, analyzing, and drawing conclusions from the data

6.1 Introduction to Machine Learning
6.2 Introduction to linear regression, predictive modeling, simple linear regression vs multiple linear regression, concepts, formulas, assumptions, and residuals in Linear Regression, and building a simple linear model
6.3 Predicting results and finding the p-value and an introduction to logistic regression
6.4 Comparing linear regression with logistics regression and bivariate logistic regression with multivariate logistic regression
6.5 Confusion matrix the accuracy of a model, understanding the fit of the model, threshold evaluation with ROCR, and using qqnorm() and qqline()
6.6 Understanding the summary results with null hypothesis, F-statistic, and
building linear models with multiple independent variables

Hands-on Exercise:

1. Modeling the relationship within data using linear predictor functions
2. Implementing linear and logistics regression in R by building a model with ‘tenure’ as the dependent variable

7.1 Introduction to logistic regression
7.2 Logistic regression concepts, linear vs logistic regression, and math behind logistic regression
7.3 Detailed formulas, logit function and odds, bivariate logistic regression, and Poisson regression
7.4 Building a simple binomial model and predicting the result, making a confusion matrix for evaluating the accuracy, true positive rate, false positive rate, and threshold evaluation with ROCR
7.5 Finding out the right threshold by building the ROC plot, cross validation, multivariate logistic regression, and building logistic models with multiple independent variables
7.6 Real-life applications of logistic regression

Hands-on Exercise:

1. Implementing predictive analytics by describing data
2. Explaining the relationship between one dependent binary variable and one or more binary variables
3. Using glm() to build a model, with ‘Churn’ as the dependent variable

8.1 What is classification? Different classification techniques
8.2 Introduction to decision trees
8.3 Algorithm for decision tree induction and building a decision tree in R
8.4 Confusion matrix and regression trees vs classification trees
8.5 Introduction to bagging
8.6 Random forest and implementing it in R
8.7 What is Naive Bayes? Computing probabilities
8.8 Understanding the concepts of Impurity function, Entropy, Gini index, and Information gain for the right split of node
8.9 Overfitting, pruning, pre-pruning, post-pruning, and cost-complexity pruning, pruning a decision tree and predicting values, finding out the right number of trees, and evaluating performance metrics

Hands-on Exercise:

1. Implementing random forest for both regression and classification problems
2. Building a tree, pruning it using ‘churn’ as the dependent variable, and building a random forest with the right number of trees
3. Using ROCR for performance metrics

9.1 What is Clustering? Its use cases
9.2 what is k-means clustering? What is canopy clustering?
9.3 What is hierarchical clustering?
9.4 Introduction to unsupervised learning
9.5 Feature extraction, clustering algorithms, and the k-means clustering algorithm
9.6 Theoretical aspects of k-means, k-means process flow, k-means in R, implementing k-means, and finding out the right number of clusters using a scree plot
9.7 Dendograms, understanding hierarchical clustering, and implementing it in R
9.8 Explanation of Principal Component Analysis (PCA) in detail and implementing PCA in R

Hands-on Exercise:

1. Deploying unsupervised learning with R to achieve clustering and dimensionality reduction
2. K-means clustering for visualizing and interpreting results for the customer churn data

10.1 Introduction to association rule mining and MBA
10.2 Measures of association rule mining: Support, confidence, lift, and apriori algorithm, and implementing them in R
10.3 Introduction to recommendation engines
10.4 User-based collaborative filtering and item-based collaborative filtering, and implementing a recommendation engine in R
10.5 Recommendation engine use cases

Hands-on Exercise:

1. Deploying association analysis as a rule-based Machine Learning method
2. Identifying strong rules discovered in databases with measures based on interesting discoveries

Self-paced Course Content

11.1 Introducing Artificial Intelligence and Deep Learning
11.2 What is an artificial neural network? TensorFlow: The computational framework for building AI models
11.3 Fundamentals of building ANN using TensorFlow and working with TensorFlow in R

12.1 What is a time series? The techniques, applications, and components of time series
12.2 Moving average, smoothing techniques, and exponential smoothing
12.3 Univariate time series models and multivariate time series analysis
12.4 ARIMA model
12.5 Time series in R, sentiment analysis in R (Twitter sentiment analysis), and text analysis

Hands-on Exercise:

1. Analyzing time series data
2. Analyzing the sequence of measurements that follow a non-random order to identify the nature of phenomenon and forecast the future values in the series

13.1 Introduction to Support Vector Machine (SVM)
13.2 Data classification using SVM
13.3 SVM algorithms using separable and inseparable cases
13.4 Linear SVM for identifying margin hyperplane

14.1 What is the Bayes theorem?
14.2 What is Naïve Bayes Classifier?
14.3 Classification Workflow
14.4 How Naive Bayes classifier works and classifier building in Scikit-Learn
14.5 Building a probabilistic classification model using Naïve Bayes and the zero probability problem

15.1 Introduction to the concepts of text mining
15.2 Text mining use cases and understanding and manipulating the text with ‘tm’ and ‘stringR’
15.3 Text mining algorithms and the quantification of the text
15.4 TF-IDF and after TF-IDF

View More

Free Career Counselling

We are happy to help you 24/7

Data Science Projects Covered

Peer Learning

Via Intellipaat PeerChat, you can interact with your peers across all classes and batches and even our alumni. Collaborate on projects, share job referrals & interview experiences, compete with the best, make new friends – the possibilities are endless and our community has something for everyone!

Data Science Certification in San Diego

What should I do to unlock Intellipaat’s certificate?

You can unlock your certificate in three simple steps:

  1. Complete the Data Science course in San Diego along with the given assignments
  2. Work on several industry-based projects and execute the same successfully
  3. Pass the certification exam

The Data Science certification you receive from Intellipaat is valid for your entire lifetime and is recognized by top organizations across the world.

On successfully completing the Data Science course and passing the Data Science training certification exam, you will receive Intellipaat’s Data Science certificate via our Learning Management System. You can download or share your certificate from this through either email or LinkedIn.

Yes. The Data Science certification issued by Intellipaat is industry-recognized. Besides, due to our affiliation with IBM, you will also receive a Data Science course completion certificate from IBM.

View More

Data Scientist Course in San Diego Reviews

5 ( 10,847 )

Our Alumni Work At

Master Client Desktop Master Client Mobile

Data Science Training in San Diego FAQs

Why should I enroll in this Data Scientist course in San Diego from Intellipaat?

Intellipaat offers an exclusive Data Science course in San Diego for professionals who want to expand their knowledge base and start a career in this field. There are many reasons for choosing Intellipaat:

  • A personal mentor to track your progress
  • Immersive online instructor-led sessions conducted by SMEs
  • Extensive LMS, allowing you to view recorded sessions within 3 hours
  • Real-time exercises, assignments, and projects
  • 24/7 learning support
  • Large community of like-minded learners
  • Industry-recognized Intellipaat badge
  • Personalized job support

Intellipaat offers one of the leading Data Science courses in San Diego that will help you to learn about practical implementations on real-time live projects. The courses like Artificial Intelligence, Data Analytics, Machine Learning, R Certification, Python for Data Science, Python, Business Analytics and others are comprehensive and tailor-made to suit you as per the industry’s needs.

Learn Data Science from the free blogs available. The top resources are Interview Questions, Tutorial and Data Science basics.

At Intellipaat, you can enroll in either the instructor-led online training or self-paced training. Apart from this, Intellipaat also offers corporate training for organizations to upskill their workforce. All trainers at Intellipaat have 12+ years of relevant industry experience, and they have been actively working as consultants in the same domain, which has made them subject matter experts. Go through the sample videos to check the quality of our trainers.

Intellipaat is offering the 24/7 query resolution, and you can raise a ticket with the dedicated support team at anytime. You can avail of the email support for all your queries. If your query does not get resolved through email, we can also arrange one-on-one sessions with our trainers.

You would be glad to know that you can contact Intellipaat support even after the completion of the training. We also do not put a limit on the number of tickets you can raise for query resolution and doubt clearance.

Intellipaat is offering you the most updated, relevant, and high-value real-world projects as part of the training program. This way, you can implement the learning that you have acquired in real-world industry setup. All training comes with multiple projects that thoroughly test your skills, learning, and practical knowledge, making you completely industry-ready.

You will work on highly exciting projects in the domains of high technology, ecommerce, marketing, sales, networking, banking, insurance, etc. After completing the projects successfully, your skills will be equal to 6 months of rigorous industry experience.

Intellipaat actively provides placement assistance to all learners who have successfully completed the training. For this, we are exclusively tied-up with over 80 top MNCs from around the world. This way, you can be placed in outstanding organizations such as Sony, Ericsson, TCS, Mu Sigma, Standard Chartered, Cognizant, and Cisco, among other equally great enterprises. We also help you with the job interview and résumé preparation as well.

You can definitely make the switch from self-paced training to online instructor-led training by simply paying the extra amount. You can join the very next batch, which will be duly notified to you.

Once you complete Intellipaat’s training program, working on real-world projects, quizzes, and assignments and scoring at least 60 percent marks in the qualifying exam, you will be awarded Intellipaat’s course completion certificate. This certificate is very well recognized in Intellipaat-affiliated organizations, including over 80 top MNCs from around the world and some of the Fortune 500companies.

Apparently, no. Our job assistance program is aimed at helping you land in your dream job. It offers a potential opportunity for you to explore various competitive openings in the corporate world and find a well-paid job, matching your profile. The final decision on hiring will always be based on your performance in the interview and the requirements of the recruiter.

View More
Select Currency