Statistics

Course Code: STATS

Duration: 4 days

 
 
 
 

Statistics Course Overview

This Statistics course aims to address the growing need for a practical and systematic introduction to statistics at work. It is designed to give delegates a strong foundation in classical statistics, both descriptive and inferential, that will allow them not only to accurately interpret statistical results but also equip them with the requisite knowledge to design and carry out powerful statistical tests. The foundations will be laid to enable delegates to characterize the data they have, decide on the best summary and visual representations and, most appropriate tests to carry out. These are vital skills in the age of data.

Content in General

With the statistical content we cover in this course, delegates will acquire a general framework that will enable them to decide-on and perform a relatively wide range of common statistical procedures and tests on data. We will start by identifying different categories of data and the various descriptive statistical measures we can perform on them. This will be followed by inferential statistics which will include population and sampling distributions, confidence intervals and hypothesis testing. We will also cover concepts of probability, including distributions, on a just-in-time basis as required by our principal focus, statistical and inference.

Approach:

We believe in learning-by-doing, so we have taken a problem-solving approach to delivering our training. The course is broken into sessions, each centred on a few related core concepts and skills. The relevant theory is discussed at the beginning of the session, in a just-in-time approach. This is followed by an illustrative example. For the second half, which will generally be most of the session, the delegates are expected to solve relevant problems of graduated difficulty. Immediate practice helps delegates consolidate their understanding of concepts on which we build gradually. Example solutions will be available for the delegates to take away at the end of the course. We use the statistical language R and Python (with various relevant packages). So, delegates are expected to be proficient in one of the programming languages.

Course Objectives

This course aims to provide the delegate with the knowledge to be able to:

  • Calculate basic descriptive statistical measures such as
    • Mean, Median, Mode
    • Variance, Standard Deviation and Quartiles
  • Understand the framework of classical statistical inference
  • Understand and use discrete and continuous probability distributions including
    • Binomial, Bernoulli, Poisson, Normal, t-Distribution, Chi-Square, F-Distribution
  • Understand and/or calculate
    • Sampling bias and strategies
    • Population and sample distributions
    • Point estimates and confidence intervals
  • Design, choose and perform a hypothesis test
  • Understand and perform basic linear regression
  • Produce various visual representation (or plots) of data

Who will the Course Benefit?

This course will benefit anyone who works or intends to work with data. This course also offers a solid statistical foundation to professional or aspiring Data Scientists.

Skills Gained

The delegate will learn and acquire skills as follows:

  • Calculate measure on central tendency
  • Calculate variance and range of data
  • Summarize data
  • Produce graphs (boxplots, barcharts, scatter plots etc.)
  • Simulate sampling distributions
  • Calculate statistical estimates
  • Calculate confidence intervals
  • Formulate and perform statistical tests
  • Calculate correlation
  • Perform linear regression

Statistics Training Course

Course Introduction

  • Administration and Course Materials
  • Course Structure and Agenda
  • Delegate and Trainer Introductions

Session 1: INTRODUCTION

  • What is Statistics and Why?
  • Statistical traditions: Classical and Bayes
  • Data types and classification

Session 2: SUMMARY STATISTICS FOR CATEGORICAL DATA

  • Absolute Frequency (Count)
  • Relative Frequency
  • Grouped Frequency
  • Cumulative Frequency

Session 3: SUMMARY STATISTICS FOR NUMERICAL DATA; LOCATION

  • Measures of central tendency
    • Mean
    • Median
    • Mode

Session 4: MEASURES OF DISPERSION AND GRAPHICAL REPRESENTATION

  • Measures of dispersion
    • Range (Full, Interquartile etc.)
    • Variance and Standard Deviation
  • Graphical representation
    • Univariate
      • Discrete data, Frequency plots, Histograms
      • Continuous data, dotplot, stem-and-leaf plot, histograms, boxplots
    • Bivariate
      • scatter plot
      • line plot

Statistics Training Course

Session 5: DISCRETE NUMERICAL VARIABLE

  • Example of coin toss
    • Is my coin biased?
    • How confident am I? (Binomial distribution)
  • Discrete probability distributions (PMF)
    • Bernoulli
    • Binomial
    • Poisson

Session 6: CONTINUOUS NUMERICAL VARIABLE: POPULATION MEAN

  • Terminology: Population vs Sample
  • Sample distribution
  • Mean value theorem and Normal distribution
  • Point estimate of mean
  • Standard error and confidence interval

Session 7: SAMPLING CONSIDERATIONS

  • Sampling bias
  • Sampling strategies
  • Sampling statistics distribution

Statistics Training Course

Session 8: CONTINUOUS PROBABILITY DISTRIBUTIONS

  • Normal and Student-T distributions
  • Chi-Squared Distribution
  • F-Distribution

Session 9: HYPOTHESIS TESTING FOR NUMERICAL DATA

  • Single population
    • Test against a hypothesized mean (Z / T - Test)
    • Type-I and Type-II errors
    • Test against a hypothesized variance (Chi-Squared Test)
  • Two populations
    • Test a difference in mean
    • Power calculations
    • Paired data
  • Comparison of variances
  • ANOVA

Statistics Training Course

Session 10: HYPOTHESIS TESTING FOR CATEGORICAL DATA

  • Test for a proportion
  • Difference of proportions
  • Goodness of Fit
  • Test for independence

Session 11: REGRESSION; BIVARIATE DATA

  • Correlation
  • Linear regression
Notes:
  • Course technical content is subject to change without notice.
  • Course content is structured as sessions, this does not strictly map to course timings. Concepts, content and practicals often span sessions.

Requirements

Mathematics: Ideally A-Levels but at a minimum GCSE level is required, Delegates will be expected to understand simple formulae, percentages, proportions and limits, and interpret simple formulae and graphs. They will also be expected to perform basic algebraic manipulations, add and subtract fractions.

Programming: Delegates must also have an ability to program in Python or R up to and including defining and using functions. Our Introduction to Programming (in Python) course or our R Programming course will cover all programming requirements for this course.

Pre-Requisite Courses

Public Scheduled Events

Classroom & Live Virtual Instructor-Led Training

Duration: 4 days

Price: £2,195.00 exc. VAT 

Please contact us to review your requirements and schedule a date.

 

Live Virtual Classroom

 
Join live instructor-led classroom training from the comfort of your home or office.
All the convenience and benefits of the classroom experience without the hassle and costs of travel and accommodation.
 
 



Our Customers Include

 
EDF
Amazon
American Express
Aviva
QA
BAE
University of Cambridge
Barnardo's
Scottish Government
Bauer
Bloomberg
BP
HSBC
DVLA
GlaxoSmithKline
Government Campus
Capita
Tui
NHS
Ordnance Survey
Ministry of Defence
Zurich Insurance Group
trainline
Vodafone
 
 



Our Course Curriculum

 
 
 
+44 (0)20 7600 6116
Enquiries@StayAhead.com
Copyright © 2024 StayAhead Training Ltd
Cookies   /   Privacy Policy