Scaling machine learning inference with NVIDIA TensorRT and Google Dataflow
A collaboration between Google Cloud and NVIDIA has enabled Apache Beam users to maximize the performance of ML models within their data processing pipelines, using NVIDIA TensorRTand NVIDIA GPUs alongside the new Apache Beam TensorRTEngineHandler. The NVIDIA TensorRT SDK provides high-performance, neural network inference that lets developers optimize and deploy trained ML models on NVIDIA …
Read more “Scaling machine learning inference with NVIDIA TensorRT and Google Dataflow”