Pandas dataframe group by order

Question

1 Answer

Shlok Pandey · Answer 1 · 2019-07-31T13:14:15+0000

You can use the below line of code:

df1.groupby((df1['Name'] != df1['Name'].shift()).cumsum()).first()

It gives the output as:

Name City
Name
1 Alice Seattle
2 Bob Seattle
3 Mallory Portland
4 Bob Portland
5 Mallory Seattle
6 Alice Seattle

And if you want the 'Name' column, then do this:

df1.groupby((df1['Name'] != df1['Name'].shift()).cumsum())['Name'].first().values

Which will give you the output:

['Alice' 'Bob' 'Mallory' 'Bob' 'Mallory' 'Alice']