✨ Zero-code distributed tracing and profiling, observability via eBPF 🚀
-
Updated
Jun 1, 2024 - Go
✨ Zero-code distributed tracing and profiling, observability via eBPF 🚀
A Portable Toolkit for deploying Edge AI and HPC (opencl, vulkan, simd, task scheduling)
A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine
An advanced guide to run Mac OS / OS X / macOS on QEMU/KVM with libvirtd/Virt-Manager. Includes various write-ups for deep customization.
cuVS - a library for vector search and clustering on the GPU
On-device AI across mobile, embedded and edge for PyTorch
Open3D: A Modern Library for 3D Data Processing
Serve, optimize and scale PyTorch models in production
Robotics with GPU computing
Tensors and Dynamic neural networks in Python with strong GPU acceleration
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
A modern, high-performance C++17 graphics and compute library based on Vulkan
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Stretching GPU performance for GEMMs and tensor contractions.
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
An efficient C++17 GPU numerical computing library with Python-like syntax
Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.
To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."