We use logistic regression to predict the value of the categorical outcome, But I believe linear regression is used to predict the output value given the input values.

Please explain the difference between these two methodologies.

+2 votes

We use logistic regression to predict the value of the categorical outcome, But I believe linear regression is used to predict the output value given the input values.

Please explain the difference between these two methodologies.

+2 votes

@varsha , the basic difference between logistic regression and linear regression are as follows:

1. Linear regression is a regression model which means it will give continuous output. Logistic regression is a binary classification algorithm which means it will give discrete output.

2. In Linear Regression, residuals are assumed to be normally distributed whereas, Logistic Regression, residuals need to be independent but not normally distributed.

3.Linear Regression is used when the response variable in continuous, Logistic Regression is used when the response variable is categorical.

4. Linear Regression gives an degree 1 equation: y=mx+c , Logistic Regression gives an equation of the form: Y=e^x + e^-x.

5. In Linear Regression the coefficient interpretation is straightforward whereas, in Logistic Regression it depends on log ,inverse-log, binomial, etc.

0 votes

Linear regression is useful for finding the relationship between two continuous variables. One is a predictor or independent variable and the other is a response or dependent variable.

The main task of linear regression is to obtain a line that best fits the data. The best fit line is the one for which total prediction error (all data points) is as small as possible. Error is the distance between the point to the regression line. You can implement linear regression in scikit-learn from sklearn.linear_model class.

**Code:**

from sklearn.linear_model import LinearRegression

cls = LinearRegression()

cls.fit(x_train,y_train)

cls.predict(X_test)

**Logistic regression:**

Logistic Regression is used when the dependent variable(target) is categorical.

For example,

To predict whether an email is spam (1) or not (0)

Whether the tumor is malignant (1) or not (0)

**Code**:

from sklearn.linear_model import LogisticRegression

clf = LogisticRegression()

clf.fit(X_train, y_test)

clf.predict()

**Difference between error minimization techniques:**

Linear regression is usually solved by minimizing the least-squares error of the model to the data, therefore large errors are penalized quadratically.

Logistic regression is just the opposite. Using the logistic loss function causes large errors to be penalized to an asymptotically constant.

Learn more about Logistic Regression by watching this video tutorial:

You can go through this video to learn about Multiple Linear Regression: