Explore Courses Blog Tutorials Interview Questions
0 votes
1 view
in Machine Learning by (19k points)
Is it possible to use Apache mahout without any dependency to Hadoop.

I would like to use the mahout algorithm on a single computer by only including the mahout library inside my Java project but I dont want to use hadoop at all since I will be running on a single node anyway.

Is that possible?

1 Answer

0 votes
by (33.1k points)
edited by

You should simply make each dimension (or attribute, or column) have zero mean and unit variance.

Check this pdf to brings each dimension of the SVM into the same magnitude. 

The main advantage of scaling is to avoid attributes in larger numeric ranges dominating those in smaller numeric ranges. Another advantage is to avoid numerical difficulties during the calculation. 

Kernel values usually depend on the inner products of feature vectors, e.g. the linear kernel and the polynomial kernel, large attribute values might cause numerical problems. We recommend linearly scaling each attribute to the range [-1,+1] or [0,1].

Hope this answer helps you!

If you want to study Artificial Intelligence Course and also want to go through Deep Learning Tutorial, then you can watch this video:

Browse Categories