Classification Models, Deep Learning, Exploratory Data Analysis, Feature Engineering, Machine Learning - Data Mining1 Comment

Case study: Machine Learning and Deep Learning for Knowledge Tracing in Programming Education

May 8, 2022May 14, 2022 Tung.M.Phung

Applying Machine Learning and Deep Learning to solve the Knowledge Tracing problem in the context of Programming classrooms. Continue reading Case study: Machine Learning and Deep Learning for Knowledge Tracing in Programming Education

Deep Learning, Feature Engineering, Machine Learning - Data MiningLeave a comment

Transforming everything to vectors with Deep Learning: from Word2Vec, Node2Vec, to Code2Vec and Data2Vec

April 17, 2022March 26, 2023 Tung.M.Phung

Let us discuss the state-of-the-art methods for transforming every kind of input data into fixed-length vectors of continuous values, including Word2Vec, Doc2Vec, Image2Vec, Node2Vec, Edge2Vec, Code2Vec, and Data2Vec. Continue reading Transforming everything to vectors with Deep Learning: from Word2Vec, Node2Vec, to Code2Vec and Data2Vec

Exploratory Data Analysis, Feature Engineering, Machine Learning - Data Mining2 Comments

A survey of correlation analysis methods

May 24, 2021May 24, 2021 Tung.M.Phung

A summary of popular methods to analyze the dependency between variables. Continue reading A survey of correlation analysis methods

Feature Engineering, Machine Learning - Data MiningLeave a comment

Principal Component Analysis fully explained

March 13, 2020December 30, 2020 Tung.M.Phung

This article attempts to make PCA crystal clear to anyone who wishes to understand it thoroughly, step-by-step, in both high and low-level concepts. Continue reading Principal Component Analysis fully explained

Feature Engineering, Machine Learning - Data MiningLeave a comment

When to add a dummy variable?

December 19, 2019July 31, 2020 Tung.M.Phung

A dummy variable is a variable (or feature, predictor, column) whose values can be either 0 or 1. Continue reading When to add a dummy variable?

Feature Engineering, Machine Learning - Data Mining3 Comments

How to convert Categorical Variables to Numerical Variables

December 18, 2019July 31, 2020 Tung.M.Phung

In Machine Learning, while some predictive models allow categorical variables in the data, most require all predictor variables to be continuous Continue reading How to convert Categorical Variables to Numerical Variables

Feature Engineering, Machine Learning - Data MiningLeave a comment

When to do feature centering, scaling and normalization?

December 5, 2019July 31, 2020 Tung.M.Phung

Many people have a tendency to always do feature centering, scaling or normalizing right before applying predictive models to the data… Continue reading When to do feature centering, scaling and normalization?

Feature Engineering, Machine Learning - Data MiningLeave a comment

Feature selection with sklearn

September 4, 2019July 31, 2020 Tung.M.Phung

Feature selection is hard but very important. Continue reading Feature selection with sklearn

Feature Engineering, Machine Learning - Data MiningLeave a comment

How to deal with missing values (NaNs)

August 31, 2019July 31, 2020 Tung.M.Phung

This blog post attempts to address why NaNs are bad and how we can fix them. Continue reading How to deal with missing values (NaNs)