Remember

Register

All Courses Ask a Question

Questions
Unanswered
Ask a Question
Blog
Tutorials
Interview Questions

Back

Login

Explore Courses Blog Tutorials Interview Questions

community
Data Science
Scikit-learn train_test_split with indices

Scikit-learn train_test_split with indices

Scikit-learn train_test_split with indices

0 votes

2 views

asked Aug 17, 2019 in Data Science by sourav (17.6k points)

How do I get the original indices of the data when using train_test_split()?

What I have is the following

from sklearn.cross_validation import train_test_split
import numpy as np
data = np.reshape(np.randn(20),(10,2)) # 10 training examples
labels = np.random.randint(2, size=10) # 10 labels
x1, x2, y1, y2 = train_test_split(data, labels, size=0.2)

But this does not give the indices of the original data. One workaround is to add the indices to data (e.g. data = [(i, d) for i, d in enumerate(data)]) and then pass them inside train_test_split and then expand again. Are there any cleaner solutions?

scikit-learn
python
scipy
classification

Please log in to add a comment.

Please log in to answer this question.

1 Answer

0 votes

answered Aug 17, 2019 by Shlok Pandey (41.4k points)

You can use pandas dataframes or series:

from sklearn.model_selection import train_test_splitimport numpy as npn_samples, n_features, n_classes = 10, 2, 2data = np.random.randn(n_samples, n_features) # 10 training exampleslabels = np.random.randint(n_classes, size=n_samples) # 10 labelsindices = np.arange(n_samples)x1, x2, y1, y2, idx1, idx2 = train_test_split( data, labels, indices, test_size=0.2)

Please log in to add a comment.

Related questions

0 votes

1 answer

Scikit-learn, get accuracy scores for each class

asked Jul 26, 2019 in Machine Learning by ParasSharma1 (19k points)

machine-learning
artificial-intelligence
python
deep-learning
scikit-learn

0 votes

2 answers

How to convert a column or row matrix to a diagonal matrix in Python?

asked Oct 3, 2019 in Python by Tech4ever (20.3k points)

python
numpy
matrix
scipy

0 votes

1 answer

Unable to obtain accuracy score for my linear

asked Dec 6, 2020 in Python by laddulakshana (16.4k points)

machine-learning
python
scikit-learn

0 votes

2 answers

Why is the Cross Entropy method preferred over Mean Squared Error? In what cases does this doesn't hold up?

asked Jun 28, 2019 in Machine Learning by Sammy (47.6k points)

python
machine-learning
data-science
scikit-learn
string

+5 votes

7 answers

"How to fix: 'only integers, slices (`:`), ellipsis (`…`), numpy.newaxis (`None`) and integer or boolean arrays are valid indices'?

asked Jul 27, 2019 in Data Science by sourav (17.6k points)

python
machine-learning
data-science
linear-regression
prediction

1.2k questions

2.7k answers

501 comments

693 users

All categories
Python (132)
Java (165)
SQL (251)
Linux (14)
Big Data Hadoop & Spark (67)
Data Science (75)
R Programming (49)
C Programming (7)
DevOps and Agile (162)
AI and Deep Learning (32)
Machine Learning (9)
AWS (54)
Azure (26)
GCP (4)
RPA (2)
Selenium (12)
Blockchain (1)
Salesforce (24)
Others (12)
BI (30)
Web Technology (57)
Digital Marketing (3)
Technology Trends (6)

Browse Categories

Master Program
Big Data
Data Science
Business Intelligence
Salesforce
Cloud Computing Courses
Digital Marketing
Database
Programming
Testing
Project Management
Web Development Courses

© COPYRIGHT 2011-2024 INTELLIPAAT.COM. ALL RIGHTS RESERVED.

...