Manqing's Website
Docs
Manqing's Website
Blog
Docs
About
LinkedIn
Search
Search
Cancel
Loading search index…
No recent searches
No results for "
Query here
"
Title here
Date here
Summary here
Overview
Machine Learning
Basic Concepts
Data
Model
Model Overview
Linear and Logistic Regression
Bayes' Theorem and Naive Bayes
k-Means Clustering
Principal Component Analysis (PCA)
Support Vector Machines
Tree-based Methods
Ensemble Learning
Training
Evaluation
Home
Docs
Machine Learning
Data
Data
Key topics:
Data leakage: identify common sources and prevention.
Splits: IID vs. temporal splits; stratification.
Feature engineering: categorical handling, normalization, encoding.
Imbalance: resampling, class weights, thresholding.
Prev
Basic Concepts
Next
Model Overview