Remember

Register

All Courses Ask a Question

Questions
Unanswered
Ask a Question
Blog
Tutorials
Interview Questions

Back

Explore Courses Blog Tutorials Interview Questions

community
Data Science
How to pivot large amount of data

How to pivot large amount of data

How to pivot large amount of data

0 votes

2 views

asked Jul 13, 2019 in Data Science by sourav (17.6k points)

I'm working with a table with the following format:

I would like to pivot it using:

user_product_rating = df.pivot_table(index='review/userId', columns='product/productId', values='review/score')

The problem is that there are 80k records in original df. Both Google Colab and my computer are running out of ram. Is there any efficient way to achieve the same results?

Edit: Data I'm using Cell_Phones_&_Accessories.txt.gz. I can't time it, always crashes.

data-science
python
pandas

Please log in to add a comment.

Please log in to answer this question.

1 Answer

0 votes

answered Jul 20, 2019 by Shlok Pandey (41.4k points)

Here, you can do these two things:

1. The 'review/score' column is not an integer type. So, try to convert it to int type.

2.You can use groupby as you have to run only one operation in your pivot table:

user_product_rating = df.groupby(['review/userID'])['product/productID'].mean()

Please log in to add a comment.

Related questions

0 votes

1 answer

How to pivot row's unique values and mark its occurrence?

asked Jul 20, 2019 in Data Science by sourav (17.6k points)

python
pandas
data-science

0 votes

1 answer

ValueError: Input contains NaN, infinity or a value too large for dtype('float64') while preprocessing Data

asked Jul 13, 2019 in Data Science by sourav (17.6k points)

python
pandas
data-science
machine-learning
scikit-learn

0 votes

1 answer

creating pandas dataframe with dtype float64 changes last digit of its entry (a fairly large number)

asked Jul 10, 2019 in Data Science by sourav (17.6k points)

python
pandas
numpy
data-science

0 votes

1 answer

Faster function or script for computing a large dataframe

asked Jul 23, 2019 in Data Science by sourav (17.6k points)

data-science
python
pandas
numpy

0 votes

1 answer

Summary statistics on Large csv file using python pandas

asked Jul 5, 2019 in Data Science by sourav (17.6k points)

data-science
python
pandas

Welcome to Intellipaat Community. Get your technical queries answered by top developers!

30.9k questions

32.9k answers

500 comments

665 users

Popular Questions

What are the challenges in data science, and how does the course prepare you to overcome them? Aug 10
How does a data science course support job placement and career growth? Aug 10
Can I take the data science course online, and how flexible is it? Aug 10
How can a data science course help in advancing my career? Aug 10
What are the key benefits of enrolling in a data science course? Aug 10
What are the common challenges faced in supply chain management and how to overcome them? Aug 8

© COPYRIGHT 2011-2024 INTELLIPAAT.COM. ALL RIGHTS RESERVED.

...