- I think encoder and decoder don’t have to be the same type of neural network
- Question answering. I applied this approach to the Conversational Question Answering (CoQA) dataset. Perplexity was 1.6. Not too bad for a very simple model.
I’ve seen Encoder/Decoder approaches used in recommender system applications before.