Data Analysis with Python

Course outline and certification by IBM

Enquire Now

Online Training

Certification offered by


IBM offers a wide range of technology and consulting services; a broad portfolio of middleware for collaboration, predictive analytics, software development and systems management; and the world’s most advanced servers and supercomputers.

Key Points About This Course

Duration: 4 Days
Time: 9.00am-5.00pm
Public Class Fee:RM 4,500.00
Virtual Class Fee: RM 3,825.00
HRDF Claimable

Course Overview

Learn how to analyze data using Python. This course will take you from the basics of Python to exploring many different types of data. You will learn how to prepare data for analysis, perform simple statistical analysis, create meaningful data visualizations, predict future trends from data, and more!

Topics covered:

1) Importing Datasets
2) Cleaning the Data
3) Data frame manipulation
4) Summarizing the Data
5) Building machine learning Regression models
6) Building data pipelines

Data Analysis with Python will be delivered through lecture, lab, and assignments. It includes following parts:

Data Analysis libraries: will learn to use Pandas, Numpy and Scipy libraries to work with a sample dataset. We will introduce you to pandas, an open-source library, and we will use it to load, manipulate, analyze, and visualize cool datasets. Then we will introduce you to another open-source library, scikit-learn, and we will use some of its machine learning algorithms to build smart models and make cool predictions.

Skills You Will Gain

  • Predictive Modelling
  • Python Programming
  • Data Analysis
  • Data Visualization (DataViz)
  • Model Selection

This course is part of multiple programs

This course can be applied to multiple Specializations or Professional Certificates programs. Completing this course will count towards your learning in any of the following programs:

  • IBM Data Science Professional Certificate
  • Applied Data Science Specialization

Course Content

Part 1: Importing Datasets

  • The Problem
  • Understanding the Data
  • Python Packages for Data Science
  • Importing and Exporting Data in Python
  • Getting Started Analyzing Data in Python

Part 2: Data Wrangling

  • Pre-processing Data in Python
  • Dealing with Missing Values in Python
  • Data Formatting in Python
  • Data Normalization in Python
  • Binning in Python
  • Turning categorical variables into quantitative variables in Python

Part 3: Exploratory Data Analysis

  • Exploratory Data Analysis
  • Descriptive Statistics
  • GroupBy in Python
  • Correlation
  • Correlation – Statistics
  • Analysis of Variance ANOVA

Part 4: Model Development

  • Model Development
  • Linear Regression and Multiple Linear Regression
  • Model Evaluation using Visualization
  • Polynomial Regression and Pipelines
  • Measures for In-Sample Evaluation
  • Prediction and Decision Making

Part 5: Model Evaluation

  • Model Evaluation and Refinement
  • Overfitting, Underfitting and Model Selection
  • Ridge Regression
  • Grid Search

Training Schedule

18 – 21 Jan 2021
22 – 25 Mar 2021
17 – 20 May 2021
26 – 29 Jul 2021
20 – 23 Sep 2021
22 – 25 Nov 2021

  • Public Class Training

  • Date Format: DD slash MM slash YYYY
  • Date Format: DD slash MM slash YYYY
  • Your Information

  • Participant List

  • NameContact NumberDesignation 
  • Examination (Optional)

Enquiry Form

Call Us : 03-21165778