Can't approximate simple multiplication function in neural network with 1 hidden layer

Question

1 Answer

Shubham Rana · Answer 1 · 2019-07-16T11:20:38+0000

Big multiplication function gradient forces the net probably almost immediately into some horrifying state where all its hidden nodes have zero gradient. We can use two approaches:

1) Divide by constant. We are just dividing everything before the learning and multiply after.

2) Make log-normalization. It makes multiplication into addition:

m = x*y => ln(m) = ln(x) + ln(y).

Can't approximate simple multiplication function in neural network with 1 hidden layer

1 Answer

Related questions

Browse Categories