# Nvidia <> Jan 26 Jan ## Products - Jan Desktop - Professional-grade GPUs - Jan Server Suite - Datacenter-grade GPUs - H Series: Tensor Compute optimizations - Nvidia Containers - https://docs.nvidia.com/nemo-framework/user-guide/latest/llama/deploy.html ## Strategy - Demos - llama.cpp - Production-grade - A100 or H100, DGX Cloud - TensorRT - Use it for speed, optimizations - https://developer.nvidia.com/blog/nvidia-tensorrt-llm-supercharges-large-language-model-inference-on-nvidia-h100-gpus/ - LLMOps Team - Build and Tune a RAG - Observability - Anything that removes 10-20% of spend ## TensorRT & Triton Roadmap - Interweaved - Performance Improvements: - https://nvidia.github.io/TensorRT-LLM/performance.html - Triton will Suppport multiple backends - TensorRT - vLLM - llama.cpp in Pytorch - Model Deployment - https://docs.nvidia.com/nemo-framework/user-guide/latest/llama/deploy.html - Support for top models - Mistral support ## Working with Nvidia - Inception - Strategic Support - Should we upgrade with the H100? - NVAIE = Enterprise Support - File tickets thru NVAIE - Prod-grade support ## How do we sell Nvidia? - If clients/partners need to buy Nvidia? - GTM with Nvidia? - Angle 1: TensorRT Adoption - Loop Ben in (Eng) for Professional, Enterprise-grade - Demo with TensorRT? - Release a blog (Nvidia Inception) - NeMo - Boutique SIs - Partner Kits (how to position Nvidia) - Contact Nvidia Resellers? - GTM with Nvidia - Cloud Validation - App Catalog - Inception - AWS, GCP, Azure = Clouds - Oracle Cloud - https://www.nvidia.com/en-us/gpu-accelerated-applications/?search=SDK - Development Journeys - RTX Cards = Pending - Workstream 1: Benchmnark for TensorRT-LLM with Triton (Jan Server) - Alpha version: ASAP = help a lot - Blog Post, etc - Workstream 2: Jan's GTM - Seldon Core? = MLOps - Enterprise Support, Self-serve - SDK with Public/Enterprise Offering - Self-serve will have different identity from - Certified with clouds - Docker containers certified for Infosec - Professional Services GTM - Hiring ## Strategic Partners - Investments - June 2024 as a Raise window