Normalize data before or after split of training and testing data?

Question

1 Answer

Anurag · Answer 1 · 2019-07-29T10:47:24+0000

You need to split the data into training and test set.

Testing data points represent real-world data. Feature normalization of the explanatory (or predictor) variables is a technique used to center and normalize the data by subtracting the mean and dividing by the variance. If you take the mean and variance of the whole dataset you'll be introducing future information into the training explanatory variables.

You can perform feature normalization over the training data. Then perform normalisation on testing instances as well, but this time using the mean and variance of training explanatory variables. We can test and evaluate whether our model can generalize well to new, unseen data points.

Hope this answer helps you!

If you want to know more about Machine Learning then watch this video:

Normalize data before or after split of training and testing data?

Normalize data before or after split of training and testing data?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Browse Categories

Popular Courses

Top Tutorials

Top Articles

Top Interview Questions