Normal equation and Numpy 'least-squares', 'solve' methods difference in regression?

Question

asked Jul 23, 2019 in Machine Learning by ParasSharma1 (19k points)

I am doing linear regression with multiple variables/features. I try to get thetas (coefficients) by using the normal equation method (that uses matrix inverse), Numpy least-squares numpy.linalg.lstsq tool and np.linalg.solve tool. In my data, I have n = 143 features and m = 13000 training examples.

For normal equation method with regularization I use this formula:

Regularization is used to solve the potential problem of matrix non-invertibility (XtX matrix may become singular/non-invertible)

Data preparation code:

import pandas as pd
import numpy as np
path = 'DB2.csv'
data = pd.read_csv(path, header=None, delimiter=";")
data.insert(0, 'Ones', 1)
cols = data.shape[1]
X = data.iloc[:,0:cols-1]
y = data.iloc[:,cols-1:cols]
IdentitySize = X.shape[1]
IdentityMatrix= np.zeros((IdentitySize, IdentitySize))
np.fill_diagonal(IdentityMatrix, 1)
For least squares method I use Numpy's numpy.linalg.lstsq. Here is Python code:
lamb = 1
th = np.linalg.lstsq(X.T.dot(X) + lamb * IdentityMatrix, X.T.dot(y))[0]
Also I used np.linalg.solve tool of numpy:
lamb = 1
XtX_lamb = X.T.dot(X) + lamb * IdentityMatrix
XtY = X.T.dot(y)
x = np.linalg.solve(XtX_lamb, XtY);
For normal equation I use:
lamb = 1
xTx = X.T.dot(X) + lamb * IdentityMatrix
XtX = np.linalg.inv(xTx)
XtX_xT = XtX.dot(X.T)
theta = XtX_xT.dot(y)

As you can see the normal equation, least squares, and np.linalg.solve tool methods give to some extent different results. The question is why these three approaches give noticeably different results and which method gives more efficient and more accurate results?

1 Answer

Anurag · Answer 1 · 2019-07-23T11:17:24+0000

I think you don’t need matrix inverse to solve linear systems. It's slow and introduces unnecessary errors.

Try to understand the mathematical concepts behind the following part:

x = A^-1 * b

you instead want:

x = np.linalg.solve(A, b)

In your case, you want something like:

XtX_lamb = X.T.dot(X) + lamb * IdentityMatrix
XtY = X.T.dot(Y)
x = np.linalg.solve(XtX_lamb, XtY);

To know more study Python Numpy Tutorial.

Hope this answer helps you!

Normal equation and Numpy 'least-squares', 'solve' methods difference in regression?

1 Answer

Related questions

Browse Categories