Serve multiple models with Amazon SageMaker and Triton Inference Server
Amazon SageMaker is a fully managed service for data science and machine learning (ML) workflows. It helps data scientists and developers prepare, build, train, and deploy high-quality ML models quickly by bringing together a broad set of capabilities purpose-built for ML. In 2021, AWS announced the integration of NVIDIA Triton Inference Server in SageMaker. You …
Read more “Serve multiple models with Amazon SageMaker and Triton Inference Server”