Data Science |
Introduction to Data and Statistics |
|
What is Analytics |
|
00:00:00 |
|
CRISP-DM Methodology |
|
00:00:00 |
|
Introduction to supervised and Unsupervised learning |
|
00:00:00 |
All about Data |
|
Basic Statistics – Mean, Medium, Mode, Variance, Standard Deviation (SD) |
|
00:00:00 |
|
Outliers in Data |
|
00:00:00 |
|
Data Sourcing |
|
00:00:00 |
|
Sampling Techniques |
|
00:00:00 |
|
Data Cleanup and Transformation |
|
00:00:00 |
R and R Navigation |
|
Introduction to R / R Studio |
|
00:00:00 |
|
Navigation in R |
|
00:00:00 |
|
Types of variables |
|
00:00:00 |
Data operations in R |
|
Arithmetic operations |
|
00:00:00 |
|
String operations |
|
00:00:00 |
|
Logical |
|
00:00:00 |
|
Date |
|
00:00:00 |
File operations in R |
|
Reading a CSV file |
|
00:00:00 |
|
Reading metadata of a file |
|
00:00:00 |
|
Statistics from a file |
|
00:00:00 |
Data exploration using R |
|
Functions – Mean, Median, Mode, Max, Min, Var, SD |
|
00:00:00 |
|
Functions – Table and Tapply |
|
00:00:00 |
|
Summarization using dply package |
|
00:00:00 |
Visualization concepts |
|
Visualization Basics |
|
00:00:00 |
|
Basic plots in R – Scatter, Bar, Pie, Boxplots, Line, Histogram |
|
00:00:00 |
Advance plotting with GGPLOT2 |
|
Bar |
|
00:00:00 |
|
Scatter plot |
|
00:00:00 |
|
Boxplot with facet_grid |
|
00:00:00 |
|
Raster plot |
|
00:00:00 |
|
Geo Map |
|
00:00:00 |
|
Word Cloud |
|
00:00:00 |
Regression Technique |
|
Introduction to Supervised Learning |
|
00:00:00 |
|
Linear Regression – Single Variable R2 |
|
00:00:00 |
|
Linear Regression – Multiple Variable |
|
00:00:00 |
|
Dealing with categorical variable |
|
00:00:00 |
|
Understanding Residual plot |
|
00:00:00 |
|
Multi-collinearity Identification |
|
00:00:00 |
Decision Tree |
|
Introduction to Decision Trees |
|
00:00:00 |
|
Truth Table / Type 1 and 2 Errors |
|
00:00:00 |
|
Train, Test and Validate |
|
00:00:00 |
|
Model Fit Evaluation |
|
00:00:00 |
|
Tree Tuning and Pruning |
|
00:00:00 |
|
Model Comparisons |
|
00:00:00 |
Ensemble Method-Random Forest |
|
Random Forest – Bagging and Ensemble |
|
00:00:00 |
|
Classification with Random Forest |
|
00:00:00 |
|
Variable Importance Plot |
|
00:00:00 |
|
Mtry and number of trees |
|
00:00:00 |
|
Bias and Variance TradeOff |
|
00:00:00 |
Packages |
|
Package keyword |
|
00:00:00 |
|
Default and protected access modifier |
|
00:00:00 |
|
CLASSPATH and PATH environment variables |
|
00:00:00 |
|
Pre-defined system packages |
|
00:00:00 |
XGBOOST Technique |
|
Classification with XGBoost |
|
00:00:00 |
|
Model Comparisons |
|
00:00:00 |
Dimensionality reduction |
|
Introduction to Principal Component Analysis |
|
00:00:00 |
Machine Learning |
|
Introduction to Neural Network |
|
00:00:00 |
|
Introduction to Support Vector Machine |
|
00:00:00 |
Clustering |
|
Introduction to Clustering |
|
00:00:00 |
|
Kmeans Clustering |
|
00:00:00 |
|
Deciding Cluster Size – Screeplot |
|
00:00:00 |
|
Hierarchical Cluster |
|
00:00:00 |
Market Basket Analysis |
|
Introduction to Association Rules |
|
00:00:00 |
|
Apriori Algorithm |
|
00:00:00 |
|
Support, Confidence and Life |
|
00:00:00 |
Text Mining |
|
Text Mining – Creation of Document Term Matrix |
|
00:00:00 |
|
Creation of a Word Cloud |
|
00:00:00 |
|
Sentiment Analysis |
|
00:00:00 |
|
Text Clustering |
|
00:00:00 |
|
N-Gram Analysis |
|
00:00:00 |
|
Reading Facebook / Twitter |
|
00:00:00 |
Time series Analysis |
|
Time Series Components |
|
00:00:00 |
|
Creating a TS Object |
|
00:00:00 |
|
ARIMA Model for forecasting |
|
00:00:00 |