Nvidia launches a set of microservices for optimized inferencing


This post is by Frederic Lardinois from TechCrunch


At its GTC conference, Nvidia today announced Nvidia NIM, a new software platform designed to streamline the deployment of custom and pre-trained AI models into production environments. NIM takes the software work Nvidia has done around inferencing and optimizing models and makes it easily accessible by combining a given model with an optimized inferencing engine […] © 2024 TechCrunch. All rights reserved. For personal use only.