Tools mentioned by Jia Li
Software and services Jia Li has mentioned across podcast appearances.
SignalCast may earn a small commission on purchases through these links — at no extra cost to you. As an Amazon Associate we earn from qualifying purchases.
NVIDIA TensorRT
by NVIDIA
“LiveX AI uses NVIDIA Triton, TensorRT, NeMo, and Nematron across the stack, with models deployed on Kubernetes at Google Cloud for auto-scaling capabilities.”
NVIDIA Nematron
by NVIDIA
“LiveX AI uses NVIDIA Triton, TensorRT, NeMo, and Nematron across the stack, with models deployed on Kubernetes at Google Cloud for auto-scaling capabilities.”
NVIDIA Triton
by NVIDIA
“LiveX AI uses NVIDIA Triton, TensorRT, NeMo, and Nematron across the stack, with models deployed on Kubernetes at Google Cloud for auto-scaling capabilities.”
NVIDIA NIM
by NVIDIA
“LiveX AI achieves six times faster average token speed using NVIDIA NIM microservices compared to traditional inference frameworks.”
Kubernetes
“LiveX AI uses NVIDIA Triton, TensorRT, NeMo, and Nematron across the stack, with models deployed on Kubernetes at Google Cloud for auto-scaling capabilities.”
Google Cloud
by Google
“LiveX AI uses NVIDIA Triton, TensorRT, NeMo, and Nematron across the stack, with models deployed on Kubernetes at Google Cloud for auto-scaling capabilities.”
CUDA
by NVIDIA
“The team optimizes at the kernel level using CUDA to maximize computational efficiency.”
NVIDIA NeMo
by NVIDIA
“LiveX AI uses NVIDIA Triton, TensorRT, NeMo, and Nematron across the stack, with models deployed on Kubernetes at Google Cloud for auto-scaling capabilities.”