Use as a feature dimension reduction tool
- after performing SVD, you get the most significant singular values and vectors:
- If you use a technique to automate feature generation (e.g. tf-idf vectors), you should use this to reduce the dimensionality of those generated features
Use as a denoiser:
- Itβs a technique to denoise target variables
- https://www.kaggle.com/code/ambrosm/scp-quickstart?scriptVersionId=144293041&cellId=8
- kinda like log transforming y in linear regression to make the target variable have constant variance
- except this is for matrices of values
- Since SVD is applied to a matrix, only use this when you are predicting on multiple target variables
- PLEASE REMEMBER TO INVERSE TRANSFORM YHAT WHEN INFERENCING
e.g.