Why is weight vector orthogonal to decision plane in neural networks

Question

1 Answer

Anurag · Answer 1 · 2019-07-02T06:22:57+0000

You need to understand that training a neural network as a perceptron requires a weight update on each iteration, which improves accuracy. The weights are just the coefficients that represent a separating plane. For the moment, forget about neurons and just consider the geometric definition of a plane in N dimensions:

w1*x1 + w2*x2 + ... + wN*xN - w0 = 0

You can also think of this as being a dot product:

w*x - w0 = 0

where w is weight and x is input vector.

This equation is used for all points on the plane. Recall that we can multiply the above equation by a constant and it still holds so we can define constants such that the vector w has unit length. Next, draw a line (a plane in 2D) somewhere near the origin. w0 is simply the perpendicular distance from the origin to the plane and w is the unit vector that points from the origin along that perpendicular. If you now draw a vector from the origin to any point on the plane, the dot product of that vector with the unit vector w will always be equal to w0 so the equation above holds, right? This is simply the geometric definition of a plane: a unit vector defining the perpendicular to the plane (w) and the distance (w0) from the origin to the plane.

I am getting a little beyond your actual question, we don't really care about points on the plane. We really want to know which side of the plane a point falls on. While w*x - w0 is exactly zero on the plane, it will have positive values for points on one side of the plane and negative values for points on the other side. That's where the neuron's activation function comes in but that's beyond your actual question.

Why is weight vector orthogonal to decision plane in neural networks

1 Answer

Related questions

Browse Categories