NVIDIA Corporation
- 26.8k followers
- 2788 San Tomas Expressway, Santa Clara, CA, 95051
- https://nvidia.com
Pinned Loading
Repositories
- Model-Optimizer Public
A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.
NVIDIA/Model-Optimizer’s past year of commit activity - infra-controller Public
NVIDIA Infra Controller - Hardware Lifecycle Management and multitenant networking
NVIDIA/infra-controller’s past year of commit activity - TensorRT-LLM Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
NVIDIA/TensorRT-LLM’s past year of commit activity - SkillSpector Public
Security scanner for AI agent skills. Detect vulnerabilities, malicious patterns, and security risks.
NVIDIA/SkillSpector’s past year of commit activity - NeMo-Agent-Toolkit Public
The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.
NVIDIA/NeMo-Agent-Toolkit’s past year of commit activity - NeMo-Retriever Public
NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever Library uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.
NVIDIA/NeMo-Retriever’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…