Hello, I found a performance issue in  the definition of generate_minibatch, RNN_LSTM_TensorFlow/Many2OneMultiRNNLSTM.py, tf.cast and tf.round will created repeatedly during program execution, resulting in reduced efficiency. I think it should be created before the loop in generate_minibatch.