Skip to content

Tung M Phung's Blog

  • Home
  • About me

Tag: statistics

Exploratory Data Analysis, Machine Learning - Data MiningLeave a comment

On ensuring fairness: Statistical parity vs Causal graphs

July 31, 2021July 31, 2021 Tung.M.Phung

We tackle the problem of ensuring fairness in machine learning, from using the traditional statistical parity to exploiting a causal network. Continue reading On ensuring fairness: Statistical parity vs Causal graphs

Feature Engineering, Machine Learning - Data MiningLeave a comment

Principal Component Analysis fully explained

March 13, 2020December 30, 2020 Tung.M.Phung

This article attempts to make PCA crystal clear to anyone who wishes to understand it thoroughly, step-by-step, in both high and low-level concepts. Continue reading Principal Component Analysis fully explained

Machine Learning - Data Mining, StatisticsLeave a comment

Unpaired Two-sample T-test (Independent T-test)

January 21, 2020July 31, 2020 Tung.M.Phung

What is an Unpaired 2-sample T-test? Let’s analyze this definition from scratch. Continue reading Unpaired Two-sample T-test (Independent T-test)

Machine Learning - Data Mining, StatisticsLeave a comment

Paired Two-sample T-test (Dependent T-test)

January 18, 2020July 31, 2020 Tung.M.Phung

What is a Paired 2-sample T-test? Let’s analyze this definition from scratch. Continue reading Paired Two-sample T-test (Dependent T-test)

Machine Learning - Data Mining, Statistics4 Comments

Z-score on a sample set

October 13, 2019July 31, 2020 Tung.M.Phung

We take a set of samples from a given Normal distribution. How extreme is this set? Continue reading Z-score on a sample set

Machine Learning - Data Mining, Statistics2 Comments

Z-score, Z-statistic, Z-test, Z-distribution

October 12, 2019July 31, 2020 Tung.M.Phung

Z-score (together with Z-test, Z-distribution, Z-statistic, etc.) is a very frequently used term from statistics being applied in Machine Learning. Continue reading Z-score, Z-statistic, Z-test, Z-distribution