Remember

Register

All Courses Ask a Question

Questions
Unanswered
Ask a Question
Blog
Tutorials
Interview Questions

Back

Login

Explore Courses Blog Tutorials Interview Questions

community
Machine Learning
How to merge multiple feature vectors in...

How to merge multiple feature vectors in DataFrame?

How to merge multiple feature vectors in DataFrame?

0 votes

4 views

asked Jul 23, 2019 in Machine Learning by ParasSharma1 (19k points)

Using Spark ML transformers I arrived at a DataFrame where each row looks like this:

Row(object_id, text_features_vector, color_features, type_features)

where text_features is a sparse vector of term weights, color_features is a small 20-element (one-hot-encoder) dense vector of colors, and type_features is also a one-hot-encoder dense vector of types.

What would a good approach be (using Spark's facilities) to merge these features in one single, large array, so that I measure things like the cosine distance between any two objects?

machine-learning
data-science
deep-learning

Please log in to add a comment.

Please log in to answer this question.

1 Answer

0 votes

answered Jul 23, 2019 by Anurag (33.1k points)

You should simply use VectorAssembler.

For example:

import org.apache.spark.ml.feature.VectorAssembler
import org.apache.spark.sql.DataFrame
val df: DataFrame = ???
val assembler = new VectorAssembler()
.setInputCols(Array("text_features", "color_features", "type_features"))
.setOutputCol("features")
val transformed = assembler.transform(df)

For more details on Vector Assembler, study Spark Tutorial.

Hope this answer helps you!

Please log in to add a comment.

Related questions

0 votes

1 answer

XGBoost plot_importance doesn't show feature names

asked Jul 26, 2019 in Machine Learning by ParasSharma1 (19k points)

machine-learning
data-science
deep-learning
python

0 votes

0 answers

How exactly are vectors used in artificial intelligence?

asked Dec 29, 2020 in AI and Deep Learning by Gaurav S (820 points)

artificial-intelligence
machine-learning
deep-learning
data-science

0 votes

1 answer

How to plot multiple graphs in one chart using pygal?

asked Jul 6, 2019 in Machine Learning by ParasSharma1 (19k points)

machine-learning
python
data-science
deep-learning

0 votes

1 answer

How can I use sklearn.naive_bayes with (multiple) categorical features?

asked Jul 23, 2019 in Machine Learning by ParasSharma1 (19k points)

python
machine-learning
data-science
deep-learning
artificial-intelligence

0 votes

1 answer

Calling “fit” multiple times in Keras

asked Jul 16, 2019 in Machine Learning by ParasSharma1 (19k points)

python
machine-learning
data-science
keras
deep-learning

31k questions

32.8k answers

501 comments

693 users

Browse Categories

Master Program
Big Data
Data Science
Business Intelligence
Salesforce
Cloud Computing Courses
Digital Marketing
Database
Programming
Testing
Project Management
Web Development Courses

Browse By Domains

Data Science Courses Big Data Analytics Courses Business Intelligence Courses Salesforce Courses Cloud Computing Courses Digital Marketing Courses AI & Machine Learning Courses Programming Courses Database Courses Project Management Courses Cyber Security and Ethical Hacking Courses Web Development Courses Software Testing Courses Automation Courses Job Oriented Courses Degree Courses

Popular Courses

Data Science Course Artificial Intelligence Course Data Analytics Course Machine Learning Course Python Data Science Course Business Analytics Course Python Course Azure Course DevOps Course Cyber Security Course AWS Solutions Architect Salesforce Course Selenium Course AWS DevOps Course Ethical Hacking Course Power BI Course Digital Marketing Course Business Analyst Course Investment Banking Course Azure DevOps Course Azure Data Engineer Course Electric Vehicle Course UI UX Design Course SQL Course Full Stack Developer Course Data Engineering Course Supply Chain Management Course General Management Course Product Management Course

Popular Tutorials

Data Science Tutorial Machine Learning Tutorial Cyber Security Tutorial Salesforce Tutorial AWS Tutorial Azure Tutorial SQL Tutorial Selenium Tutorial Ethical Hacking Tutorial Artificial Intelligence Tutorial

Popular Resources

Data Science Machine Learning AWS Digital Marketing Cyber Security Python Interview Questions and Answers SQL Interview Questions and Answers Data Science Interview Questions and Answers PHP Interview Questions and Answers Azure DevOps Interview Questions and Answers

About Us
Media
Privacy Policy
Terms of Use
Contact Us
Blog
Interview Questions
Tutorials
Become an Instructor

© COPYRIGHT 2011-2024 INTELLIPAAT.COM. ALL RIGHTS RESERVED.

...