Skip to content
Change the repository type filter

All

    Repositories list

    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      7837.7k620Updated May 28, 2025May 28, 2025
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Python
      MIT License
      6075.4k140Updated May 27, 2025May 27, 2025
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      MIT License
      24961840Updated May 22, 2025May 22, 2025
    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      MIT License
      8939k8620Updated May 21, 2025May 21, 2025
    • Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      Creative Commons Zero v1.0 Universal
      2777.8k00Updated May 15, 2025May 15, 2025
    • Integrate the DeepSeek API into popular softwares
      Creative Commons Zero v1.0 Universal
      3.6k33k8444Updated May 13, 2025May 13, 2025
    • Other
      781.1k92Updated Apr 30, 2025Apr 30, 2025
    • FlashMLA

      Public
      FlashMLA: Efficient MLA decoding kernels
      Cuda
      MIT License
      83812k400Updated Apr 29, 2025Apr 29, 2025
    • [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      MIT License
      3532.9k340Updated Apr 22, 2025Apr 22, 2025
    • MIT License
      12k90k12025Updated Apr 9, 2025Apr 9, 2025
    • Python
      MIT License
      16k97k3938Updated Apr 9, 2025Apr 9, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      MIT License
      1931.2k51Updated Mar 24, 2025Mar 24, 2025
    • Analyze computation-communication overlap in V3/R1.
      1431k100Updated Mar 21, 2025Mar 21, 2025
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
      Python
      MIT License
      2962.8k40Updated Mar 10, 2025Mar 10, 2025
    • smallpond

      Public
      A lightweight data processing framework built on DuckDB and 3FS.
      Python
      MIT License
      4164.7k226Updated Mar 5, 2025Mar 5, 2025
    • DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      MIT License
      1.8k4.9k9515Updated Feb 26, 2025Feb 26, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      MIT License
      2.2k17k15124Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      MIT License
      5184.9k773Updated Sep 25, 2024Sep 25, 2024
    • DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      MIT License
      8915.8k503Updated Sep 24, 2024Sep 24, 2024
    • Python
      MIT License
      23352280Updated Aug 16, 2024Aug 16, 2024
    • DeepSeek Coder: Let the Code Write Itself
      Python
      MIT License
      2.5k22k10320Updated May 21, 2024May 21, 2024
    • DeepSeek-VL: Towards Real-World Vision-Language Understanding
      Python
      MIT License
      5693.8k412Updated Apr 24, 2024Apr 24, 2024
    • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
      Python
      MIT License
      5162.7k321Updated Apr 15, 2024Apr 15, 2024
    • A curated list of open-source projects related to DeepSeek Coder
      20169800Updated Apr 3, 2024Apr 3, 2024
    • DeepSeek LLM: Let there be answers
      Makefile
      MIT License
      1k6.4k262Updated Feb 4, 2024Feb 4, 2024
    • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
      Python
      MIT License
      2791.7k173Updated Jan 16, 2024Jan 16, 2024