Why should we use RNNs instead of Markov models?

Question

asked Jul 24, 2019 in AI and Deep Learning by ashely (50.2k points)

Recently, I stumbled across this article, and I was wondering what the difference between the results you would get from a recurrent neural net, like the ones described above, and a simple Markov chain would be.

I don't understand the linear algebra happening under the hood in an RNN, but it seems that you are just designing a super convoluted way of making a statistical model for what the next letter is going to be based on the previous letters, something that is done very simply in a Markov Chain.

Why are RNNs interesting? Is it just because they are a more generalizable solution, or is there something happening that I am missing?

1 Answer

vinita · Answer 1 · 2019-07-24T14:15:42+0000

The Markov chain seizes the Markov property that is it's "memoryless". The possibility of the next symbol is calculated based on the k previous symbols. In practice, k is limited to low values

(let's say 3-5), because the transition matrix grows exponentially. Hence sentences generated by a Hidden Markov Model are very inconsistent.

On the other hand, RNNs (e.g. with LSTM units) are not restricted by the Markov property. Their rich internal state enables them to keep track of long-distance dependencies.

The foremost advantages we tend to get using a recurrent neural network(RNN) over Markov chains and hidden Markov model would be the greater objective power of neural network and their ability to perform intellectual smoothing by taking into account syntactic and semantic features.

Why should we use RNNs instead of Markov models?

1 Answer

Related questions

Browse Categories

Browse By Domains

Popular Courses

Popular Tutorials

Popular Resources