Difference between Dense and Activation layer in Keras

Question

2 Answers

Shaleen2110 · Answer 1 · 2019-07-31T09:46:54+0000

The best practice is to avoid using the softmax function for hidden layers of the nueral nets. The reason is, the output of the softmax function provides us the probability of the label by providing the value in the range of (0,1) and thereby softmax activation is generally preferred to be used at the last layer of the Neural net.

Moreover, if you will try to use Dense(activation=softmax) then it will internally create a dense layer first and apply softmax on top it and show you the result directly and you won't be able to retrieve the exact outputs of the last layer, instead, you will get their probability of occurrence.

Hope this helps. For more details on this, Neural Network Tutorial would be the most beneficial topic when it comes to master the course.

JaneShaw · Answer 2 · 2019-07-31T11:10:24+0000

Using Dense(activation=softmax) is computationally corresponding to first add Dense so add Activation(softmax). However there is one advantage of the second approach - you could retrieve the outputs of the last layer (before activation) out of such a defined model. In the first approach - it's impossible.

For more details on dense activation, refer to the Machine Learning Courses by Intellipaat.

Difference between Dense and Activation layer in Keras

2 Answers

Related questions

Browse Categories

Browse By Domains

Popular Courses

Popular Tutorials

Popular Resources