Members
-
awesome-deepseek-agent ★ PINNED
No description.
★ 3.9k 4d agoExplain → -
DeepSeek-V3
No description.
Python ★ 104k 9mo agoExplain → -
DeepSeek-R1
No description.
★ 92k 11mo agoExplain → -
awesome-deepseek-integration
Integrate the DeepSeek API into popular software
★ 38k 3mo agoExplain → -
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
Python ★ 24k 7mo agoExplain → -
DeepSeek-OCR
Contexts Optical Compression
Python ★ 23k 4mo agoExplain → -
Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
Python ★ 18k 1y agoExplain → -
FlashMLA
FlashMLA: Efficient Multi-head Latent Attention Kernels
C++ ★ 13k 1mo agoExplain → -
3FS
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
C++ ★ 10.0k 1mo agoExplain → -
DeepEP
DeepEP: an efficient expert-parallel communication library
Cuda ★ 9.7k 6d agoExplain → -
open-infra-index
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
★ 8.0k 1y agoExplain → -
DeepGEMM
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda ★ 7.4k 16d agoExplain → -
DeepSeek-LLM
DeepSeek LLM: Let there be answers
Makefile ★ 7.1k 2y agoExplain → -
DeepSeek-Coder-V2
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
★ 6.9k 7mo agoExplain → -
DeepSeek-VL2
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Python ★ 5.3k 1y agoExplain → -
DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
★ 5.0k 1y agoExplain → -
smallpond
A lightweight data processing framework built on DuckDB and 3FS.
Python ★ 5.0k 1y agoExplain → -
Engram
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
Python ★ 4.5k 5mo agoExplain → -
DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Python ★ 4.1k 2y agoExplain → -
DeepSeek-Math
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Python ★ 3.3k 2y agoExplain → -
DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Python ★ 3.0k 1y agoExplain → -
DeepSeek-OCR-2
Visual Causal Flow
Python ★ 3.0k 4mo agoExplain → -
DualPipe
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
Python ★ 3.0k 5mo agoExplain → -
DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Python ★ 1.9k 2y agoExplain → -
DeepSeek-V3.2-Exp
No description.
Python ★ 1.6k 7mo agoExplain → -
TileKernels
A kernel library written in tilelang
Python ★ 1.6k 1mo agoExplain → -
DeepSeek-Math-V2
No description.
Python ★ 1.6k 6mo agoExplain → -
EPLB
Expert Parallelism Load Balancer
Python ★ 1.4k 1y agoExplain → -
DeepSeek-Prover-V2
No description.
★ 1.3k 11mo agoExplain → -
profile-data
Analyze computation-communication overlap in V3/R1.
★ 1.2k 1y agoExplain → -
awesome-deepseek-coder
A curated list of open-source projects related to DeepSeek Coder
★ 791 7mo agoExplain → -
ESFT
Expert Specialized Fine-Tuning
Python ★ 738 1y agoExplain → -
DeepSeek-Prover-V1.5
No description.
Python ★ 578 1y agoExplain → -
LPLB
An early research stage expert-parallel load balancer for MoE models based on linear programming.
Python ★ 504 7mo agoExplain →
No repos match these filters.