Yongchao Huang

Understanding the MC estimator

A note on Monte Carlo method

Posted on September 10, 2020

motivation Recently I revisited the classic Monte-Carlo fundamentals (random number generator, importance/stratified sampling, QMC, etc), and found there are some cross explanations, sometimes confusing, about the MC estimator, so I decided to write down my own understandings. [Read More]

Tags: math

Data spliting, CV, and re-sampling

the logically justified approach of data processing

Posted on August 19, 2020

motivation Two common mistakes that many machine learning theorists and practioners make are: impute missing values and standardizing features before splitting resampling the imbalanced data before cross-validation Here I want to address the importance of the sequence with statistical evidence. [Read More]

Tags: ML