Add RNN for VAD and speech/music classification

Based on two dense layers with a GRU layer in the middle
11 files changed