NVIDIA TensorRT-LLM Accelerates Encoder-Decoder Models with In-Flight Batching | Machine Daily