TensorRT-LLM will be gaining a wrapper for OpenAI's Chat API and performance improvements for LLMs.

source https://www.windowscentral.com/software-apps/nvidia-adds-support-for-openais-chat-api-to-its-latest-gpus-heres-why-its-its-a-big-deal