Supervised Reptile
The supervised-reptile repository
contains code associated with the paper
On First-Order Meta-Learning
Algorithms, which introduces Reptile, a
meta-learning ...
Enter
FlashMLA
FlashMLA is a high-performance decoding
kernel library designed especially for
Multi-Head Latent Attention (MLA)
workloads, targeting NVIDIA Hopper GPU
archite...
Enter
DeepSeek Coder
DeepSeek-Coder is a series of
code-specialized language models
designed to generate, complete, and
infill code (and mixed code + natural
language) with high fl...
Enter
DeepEP
DeepEP is a communication library
designed specifically to support
Mixture-of-Experts (MoE) and expert
parallelism (EP) deployments. Its core
role is to implem...
Enter
Open Infra Index
open-infra-index is a central
infrastructure index repository
maintained by DeepSeek AI that acts as a
catalog and hub for a collection of
production-tested ...
Enter
DeepSeek LLM
The DeepSeek-LLM repository hosts the
code, model files, evaluations, and
documentation for DeepSeeks LLM series
(notably the 67B Chat variant). Its
tagline i...
Enter
DeepSeek Coder V2
DeepSeek-Coder-V2 is the version-2
iteration of DeepSeeks code generation
models, refining the original
DeepSeek-Coder line with improved
architecture, traini...
Enter
DeepSeek VL2
DeepSeek-VL2 is DeepSeeks vision +
language multimodal modelessentially
the next-gen successor to their first
vision-language models. It combines
image and t...
Enter
DeepSeek V2
DeepSeek-V2 is the second major
iteration of DeepSeeks foundation
language model (LLM) series. This
version likely includes architectural
improvements, traini...
Enter