A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
May 13, 2026 - Python
A high-throughput and memory-efficient inference and serving engine for LLMs
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
An AI-powered custom node for ComfyUI designed to enhance workflow automation and provide intelligent assistance
A Next-Generation Training Engine Built for Ultra-Large MoE Models
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.
DeepSeek 逆向 API
[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
⚡️ A robust and developer-friendly, and community-driven PHP Client that provides a clean, extensible interface for integrating with the DeepSeek AI API.
多平台模型接入,可扩展,多种输出格式,提供大语言模型聊天服务的插件 | A bot plugin for LLM chat with multi-model integration, extensibility, and various output formats
AI coding agent for your terminal.
A tinystruct-based chat module which integrated with @openai GPT-4 / 3.5-turbo / ChatGPT. @tinystruct
Model Context Protocol server for DeepSeek's advanced language models
Deepseek V3 and R1 private API, deep thinking, search, full requests. pow challenge reversed. deepseek api.
The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1
🧘 通过钉钉、飞书、企微智能机器人用自然语言查询运维资源的工具。
Go (Golang) client for Deepseek API. Deepseek Go supports DeepSeek-V3, DeepSeek-R1 and more
DeepSeek API Platform integration for OpenClaw
Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.
AI-powered automation that extracts resume data, matches job listings, rewrites resumes to fit job descriptions, and manages storage using Google Drive and Sheets via n8n.
Add a description, image, and links to the deepseek-v3 topic page so that developers can more easily learn about it.
To associate your repository with the deepseek-v3 topic, visit your repo's landing page and select "manage topics."