ANOVA in python using pandas dataframe with statsmodels or scipy?

Question

1 Answer

vinita · Answer 1 · 2021-01-06T10:07:48+0000

I have set up a direct observation to test them, found that their opinions can vary, below is an illustration of ANOVA on a pandas dataframe resembling R's results:

import pandas as pd
import statsmodels.api as sm
from statsmodels.formula.api import ols
# R code on an R sample dataset
#> anova(with(ChickWeight, lm(weight ~ Time + Diet)))
#Analysis of Variance Table
#
#Response: weight
# Df Sum Sq Mean Sq F value Pr(>F)
#Time 1 2042344 2042344 1576.460 < 2.2e-16 ***
#Diet 3 129876 43292 33.417 < 2.2e-16 ***
#Residuals 573 742336 1296
#write.csv(file='ChickWeight.csv', x=ChickWeight, row.names=F)
cw = pd.read_csv('ChickWeight.csv')
cw_lm=ols('weight ~ Time + C(Diet)', data=cw).fit() #Specify C for Categorical
print(sm.stats.anova_lm(cw_lm, typ=2))
# sum_sq df F PR(>F)
#C(Diet) 129876.056995 3 33.416570 6.473189e-20
#Time 2016357.148493 1 1556.400956 1.803038e-165
#Residual 742336.119560 573 NaN NaN

Kick-start your career in Python with the perfect Python online course now!

ANOVA in python using pandas dataframe with statsmodels or scipy?

ANOVA in python using pandas dataframe with statsmodels or scipy?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Browse Categories

Popular Courses

Top Tutorials

Top Articles

Top Interview Questions