Skip to content

Tung M Phung's Blog

  • Home
  • About me

Category: Preparatory Phase

Machine Learning - Data Mining, Preparatory PhaseLeave a comment

Imbalanced Learning: sampling techniques

May 24, 2020April 10, 2022 Tung.M.Phung

Various types of sampling techniques for imbalanced datasets are discussed in depth with examples and analysis. Get yourself familiar with over-/under-sampling, SMOTE, ADA-SYN, sampling with cleaning, boosting, clustering, and more. Continue reading Imbalanced Learning: sampling techniques

Machine Learning - Data Mining, Preparatory PhaseLeave a comment

Control Variable explained

February 17, 2020July 31, 2020 Tung.M.Phung

In statistics and data mining, we often encounter the word ‘control’, mostly from terms like control variables and control groups. In fact, a control variable has slightly different meanings in different fields Continue reading Control Variable explained

Machine Learning - Data Mining, Preparatory PhaseLeave a comment

Splitting data into a Training set and a Validation set

November 23, 2019July 31, 2020 Tung.M.Phung

In a machine learning project, after crawling or collecting data, we have to split it into at least 2 parts: training and validation data. Continue reading Splitting data into a Training set and a Validation set