Data Analysis in Python
 
 

Data Analysis in Python

Course Code: PYPDA

Duration: 5 days

Our training courses can also be delivered at a location of your choice...

S7

Data Analysis in Python Course Overview

This course aims to equip delegates with a substantial knowledge of Python libraries (NumPy, Pandas, Matplotlib and others) and data analysis techniques to enable them to engineer enterprise level solutions in a data-driven environment.

Content in General:

The Pandas library, with its data preparation and analysis features will be our ultimate focus. After familiarizing ourselves with its two data structures, the Series and the DataFrame, we will use the latter to read, manipulate and generally process tabular data sourced from excel, csv and other file formats. However, before that we will thoroughly familiarize ourselves with the NumPy library, not only because it is the foundation of Pandas but, also because it offers powerful tools for numerical calculations and forms the basis of practically all of Pythons Data Science Libraries. We will explore its vectorized functions, basic linear algebra features and use its random library to demonstrate the sampling of different distributions.

Statistics, at least in the descriptive form must be an integral part of any meaningful data analysis course. So, we will learn various data classifications, applicable summary statistics. We also discuss and explore by example, the strengths and weaknesses of the various statistical summaries.

Visualization is another vital component of data analysis. To paraphrase, a graph is worth a thousand words. In this course we will learn the most appropriate visualisation for any given data set and use Matplotlib (and Seaborn) to produce Bar-charts, Pie-charts, histograms, box-plots, scatter-plots and line-graphs.

Finally, for our programming environment, we will use Jupyter Notebook (or Jupyter Lab according to our preference) on the Anaconda platform. This is the cutting edge of editor technology in Pythons Data Science ecosystem.

Approach:

We believe in learning-by-doing, so we have taken an integrated and problem-solving approach to delivering our training. The course is broken into sessions, each centred on a few related core concepts and skills. The relevant background is discussed at the beginning of the session, in a just-in-time approach. This is followed by illustrative examples, which includes the introduction of library features, syntax and semantics. For the second half, which is most of the session, the delegates are expected to solve relevant problems of graduated difficulty. Example solutions will be available for the delegates to take away at the end of the course.

This approach is effective as it integrates the learning of statistical theory, library features and Python language syntax, increasing retention by providing meaningful context for each. Immediate practice also helps delegates cement their understanding of concepts on which we build gradually.

The delegate will learn and acquire skills as follows:

Data Analysis Python

  • Numpy
    • Create and manipulate NumPy arrays and Matrices
    • Generate random numbers from various distributions
    • Use NumPy vectorized functions
    • Red array data from various common file formats
  • Pandas
    • Understand the composition, relation and main features of Pandas Series and DataFrame structures
    • Read Data from cvs, json, the web and relational database into DataFrames and Series
    • Data Cleaning and Preparations
    • Data Wrangling: Join, Combine and Reshape
    • Data Aggregation and Group operations
    • cvs, excel and other format data into Pandas DataFrame objects
  • Clean, group, manipulate and summarise tabular data using Pandas data processing features
  • Visualisation with Matplotlib (and Seaborn)
    • Plot
      • Bar, Column and Pie charts
      • box-plots
      • histograms
      • scatterplots and line-plots
  • Other
    • Use Jupyter Notebook and Jupyter Lab with the anaconda distribution

Statistics

  • Distinguish between different data types
  • Summarize Categorical and Numerical Data
  • Calculate basic descriptive statistical measures such as
    • Measures of Central Tendency:
      • Mean
      • Median
      • Mode
    • Measures of Dispersion:
      • Variance
      • Standard deviation
      • Quantiles
  • Understand the advantages and disadvantages of the various summary statistics
  • Decide on the best visual representation of any presented data
  • Understand Bivariate data and perform Correlation and basic Linear Regression
  • Produce various visual representation (or plots) of data

Who will the Course Benefit?

This course will benefit anyone who requires a solid practical foundation in Data Analysis, including descriptive statistics and visualisation in Python.

Course Objectives

This course aims to provide the delegate with the knowledge to be able to:

  • Determine the type of data at hand and decide of the most appropriate analysis and visualisation
  • Perform numerical calculations using the Python NumPy library
  • Use Pandas to read, explore, manipulate and process tabular data from various sources, including excel, csv, Json files and relational databases
  • Visualise and generally explore data using Matplotlib and Seaborn
  • Carry out descriptive statistical summaries on data in Python
  • Interpret graphs and statistical results correctly

Requirements

This course has two requirements, programming experience and mathematical knowledge.

To fulfil the programming pre-requisites, our Python Programming 1, or its equivalent is required. Exceptions could be made for delegates with extensive experience in a different programming language that includes object-oriented concepts.

To fulfil the Mathematics pre-requisites, ideally A-Levels but at a minimum GCSE level is required. Delegates will be expected to understand simple formulas, percentages and proportions and interpret simple graphs.

 
 

Data Analysis in Python

Duration: 5 days

RRP: £2,995.00 exc. VAT 


Upcoming Scheduled Events
Next 12 months

Virtual StayAhead Virtual Courses available  & London Schedule
We are running a full Virtual schedule
Start Date   Spaces Book
11 May 2020       StayAhead Classroom Courses available  Spaces StayAhead - Course Running Book
06 Jul 2020       StayAhead Classroom Courses available  Spaces StayAhead - Course Running Book
07 Sep 2020       StayAhead Classroom Courses available  Spaces StayAhead - Course Running Book
02 Nov 2020       StayAhead Classroom Courses available  Spaces StayAhead - Course Running Book
Classroom Events StayAhead Classroom Courses available You can attend in our traditional classroom setting using one of our Training Centres.
In the current climate, attendance at a Training Centre will be reviewed at the time of the request.

Closed Courses Closed Classroom Courses available These Closed Company events can be offered in your own setting or using our Live Virtual Classroom solution.
Delegates can attend from your business location, their home or a location of their choice. A great way to provide training to a group of delegates in a Team environment, even when they are in diverse locations, nationally or even internationally.
 
 
        Home             Let Us Call You
 

 

Course Deliveries

Public Scheduled and Closed
Virtual
UK and Overseas

OUG

Oracle User Group - Membership Benefits

OUG

Certification

Our independent Oracle, Solaris & Red Hat Linux curriculums helps prepare delegates for official certification.

StayAhead
Flexibility

Cannot see a sutiable date?
Please call us and we will try an accommodateyour needs!

StayAhead Live Virtual Classroom

 
Join live instructor-led classroom training from the comfort of your home or office.

All the convenience and benefits of the classroom experience without the hassle and costs associated to travel and accommodation.
 
 



Our Course Curriculum

 
 
 
 
 

AWS, Azure, VMWare, Cisco & Cyber Security

 
 



Our Ratings

Instructor
Quality
 

97.32%

Course
Practical
 

92.5%

Overall
Courseware
 

94.31%

Overall
Course
 

96.29%

+44 (0)20 7600 6116
Enquiries@StayAhead.com
Copyright © 2020 StayAhead Training Ltd
Cookies   /   Privacy Policy