How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies