Overview
Audience & Prerequisites
Course Outline
Schedule & Fees
Certification
Python for Data Science Training
ITS Python for Data Science training helps you learn the top programming language for the domain of Data Science. You will master the technique of how Python is deployed for Data Science, work with Pandas library for Data Science, do data munging and data cleaning, advanced numeric analysis and more through realworld handson projects and case studies.
Objectives
 Introduction to Python for Data Science
 OOP concepts, expressions and functions
 What is SQLite in Python, operations and classes
 Creating Pig and Hive UDF in Python
 Deploying Python for MapReduce programming
 Realworld Python for Data Science projects
Intended Audience

 Python’s design and libraries provide 10 times productivity compared to C, C++ or Java
 A Senior Python Developer in the United States can earn $102,000 – indeed.com
Python is one of the best programming languages that is used for the domain of Data Science. ITS is offering the definitive Python for Data Science training course for learning Python coding, running it on various systems like Windows, Linux and Mac thus making it one of the highly versatile languages for the domain of Data Analytics. Upon the completion of the training, you will be able to get the best jobs in the Data Science domain for top salaries.
Prerequisite:
You don’t need any specific knowledge to learn Python. Though, a basic knowledge of programming can help
Course Outline Duration: 4 Days
Introduction to Data Science
What is Data Science, what does a data scientist do, various examples of Data Science in the industries and how Python is deployed for Data Science applications, various steps in Data Science process like data wrangling, data exploration and selecting the model.
Introduction to Python
Introduction to Python programming language, important Python features, how is Python different from other programming languages, Python installation, Anaconda Python distribution for Windows, Linux and Mac, how to run a sample Python script, Python IDE working mechanism, running some Python basic commands, Python variables, data types and keywords.
Handson Exercise – Installing Python Anaconda for the Windows, Linux and Mac
Python basic constructs
Introduction to a basic construct in Python, understanding indentation like tabs and spaces, code comments like Pound # character, names and variables, Python builtin data types like containers (list, set, tuple and dict), numeric (float, complex, int), text sequence (string), constants (true, false, ellipsis) and others (classes, instances, modules, exceptions and more), basic operators in Python like logical, bitwise, assignment, comparison and more, slicing and the slice operator, loop and control statements like break, if, for, continue, else, range() and more.
Handson Exercise – Write your first Python program, write a Python function (with and without parameters), use Lambda expression, write a class, create a member function and a variable, create an object and write a for loop to print all odd numbers
OOPs in Python
Understanding the OOP paradigm like encapsulation, inheritance, polymorphism and abstraction, what are access modifiers, instances, class members, classes and objects, function parameter and return type functions, Lambda expressions.
Handson Exercise – Writing a Python program and incorporating the OOP concepts
NumPy for mathematical computing
Introduction to mathematical computing in Python, what are arrays and matrices, array indexing, array math, Inspecting a numpy array, Numpy array manipulation,
Handson Exercise – How to import NumPy module, creating array using NDarray, calculating standard deviation on array of numbers and calculating correlation between two variables.
SciPy for scientific computing
Introduction to SciPy, building on top of NumPy, what are the characteristics of SciPy, various subpackages for SciPy like Signal, Integrate, Fftpack, Cluster, Optimize, Stats and more, Bayes Theorem with SciPy.
Handson Exercise: Importing of SciPy, applying the Bayes theorem on the given dataset.
Data manipulation
What is a data Manipulation. using Pandas library for data manipulation, NumPy dependency of Pandas library, Series object in pandas, Dataframe in Pandas, loading and handling data with Pandas, how to merge data objects, concatenation and various types of joins on data objects, exploring dataset, Cleaning dataset, Manipulating dataset, Visualizing dataset
Handson Exercise – Doing data manipulation with Pandas by handling tabular datasets that includes variable types like float, integer, double and others.
Data visualization with Matplotlib
Introduction to Visualization, Introduction to Matplotlib, Using Matplotlib for plotting graphs and charts like Scatter, Bar, Pie, Line, Histogram and more, Matplotlib API, Subplots and Pandas builtin data visualization.
Handson Exercise – Deploying Matplotlib for creating pie, scatter, line and histogram.
Machine Learning using Python
Revision of topics in Python (Pandas, Matplotlib, NumPy, scikitLearn), Introduction to machine learning, need of Machine learning, types of machine learning, workflow of Machine Learning, Uses Cases in Machine Learning, its various arlogrithms, What is supervised learning, What is Unsupervised Learning,
Handson Exercise – Demo on ML algorithms
Supervised learning
What is supervised learning, What is linear regression, Step by step calculation of Linear Regression, Linear regression in Python, Logistic Regression, What is classification, Decision Tree, Confusion Matrix, Random Forest, Naïve Bayes classifier (Self paced), Support Vector Machine(self paced), xgboost(self paced)
Handson Exercise – Using Python library ScikitLearn for coming up with Random Forest algorithm to implement supervised learning.
Unsupervised Learning
Introduction to unsupervised learning, use cases of unsupervised learning, What is clustering, Types of clustering(selfpaced)Exclusive clustering, Overlapping Clustering, Hierarchical Clustering(selfpaced), What is Kmeans clustering, understanding the Kmeans clustering algorithm, Step by step calculation of kmeans algorithm, Demo on kmeans using Scikit , Association Rule Mining(selfpaced), Market Basket Analysis(selfpaced), Measures in association rule mining(selfpaced)support, confidence, lift, Apriori Algorithm, Demo on Apriori
Handson Exercise – Setting up the Jupyter notebook environment, loading of a dataset in Jupyter, algorithms in ScikitLearn package for performing Machine Learning techniques and training a model to search a grid.
Python integration with Spark(selfpaced)
Introduction to pyspark, who uses PySpark, need of spark with python, basics of pysark, Pyspark in industry, pySpark installation, pySpark fundamentals, advantage over mapreduce, pySpark Usecases, and pySpark demo.
Handson Exercise: Demonstrating Loops and Conditional Statements, Tuple – related operations, properties, list, etc., list – operations, related properties, set – properties, associated operations, dictionary – operations, related properties.
Please write to us at [email protected] & contact us at +919015100600 for the course price & certification cost, schedule & location
For more info kindly Contact Us.