Hey There 👋 I'm Kye Gomez, Founder of Swarms. Our mission at swarms is to build the agentic economy enabling startups, organizations, and institutions to build fully autonomous organizations with…
Hey There 👋
I'm Kye Gomez, Founder of Swarms. Our mission at swarms is to build the agentic economy enabling startups, organizations, and institutions to build fully autonomous organizations with multi-agent collaboration. Swarms provides a vast array of developer tools for python, rust, and various other ecosystems! Learn more about us here Join our team learn more
-
OpenMythos
A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature.
Python ★ 14k 28d agoExplain → -
swarms
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
Python ★ 6.9k 18h agoExplain → -
tree-of-thoughts
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Python ★ 4.6k 10mo agoExplain → -
BitNet
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Python ★ 1.9k 1d agoExplain → -
awesome-multi-agent-papers
A compilation of the best multi-agent papers
TeX ★ 1.6k 7d agoExplain → -
Open-AF3
Implementation of Alpha Fold 3 from the paper: "Accurate structure prediction of biomolecular interactions with AlphaFold3" in PyTorch
Python ★ 803 1d agoExplain → -
LongNet
Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"
Python ★ 720 2y agoExplain → -
zeta
Build high-performance AI models with modular building blocks
Python ★ 595 1mo agoExplain → -
RT-2
Democratization of RT-2 "RT-2: New model translates vision and language into action"
Python ★ 578 1y agoExplain → -
VisionMamba
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory when performing batch inference to extract features on high-res images
Python ★ 495 18h agoExplain → -
MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
Python ★ 473 1d agoExplain → -
Gemini
The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
Python ★ 466 1mo agoExplain → -
Med-PaLM
Towards Generalist Biomedical AI
Python ★ 431 2y agoExplain → -
ScreenAI
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
Python ★ 385 1d agoExplain → -
Sophia
Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.
Python ★ 381 2y agoExplain → -
CM3Leon
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Python ★ 365 2y agoExplain → -
PALM-E
Implementation of "PaLM-E: An Embodied Multimodal Language Model"
Python ★ 337 2y agoExplain → -
NaViT
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
Python ★ 272 1d agoExplain → -
RT-X
Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"
Python ★ 244 1d agoExplain → -
MambaTransformer
Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling
Python ★ 224 1d agoExplain → -
LFM
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models
Python ★ 222 1d agoExplain → -
Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
Python ★ 217 1d agoExplain → -
Vit-RGTS
Open source implementation of "Vision Transformers Need Registers"
Python ★ 216 18h agoExplain → -
Python-Package-Template
A easy, reliable, fluid template for python packages complete with docs, testing suites, readme's, github workflows, linting and much much more
Shell ★ 201 1d agoExplain → -
AttentionIsOFFByOne
Implementation of "Attention Is Off By One" by Evan Miller
Python ★ 198 2y agoExplain → -
Andromeda
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
Python ★ 150 1y agoExplain → -
swarms-pytorch
Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊
Python ★ 147 18h agoExplain → -
PALI3
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"
Python ★ 147 1d agoExplain → -
the-compiler
Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!
Python ★ 145 2y agoExplain → -
SwitchTransformers
Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity"
Python ★ 142 1d agoExplain → -
MORPHEUS-1
Implementation of "MORPHEUS-1" from Prophetic AI and "The world’s first multi-modal generative ultrasonic transformer designed to induce and stabilize lucid dreams. "
Python ★ 133 1d agoExplain → -
MoE-Mamba
Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Zeta
Python ★ 130 18h agoExplain → -
MambaByte
Implementation of MambaByte in "MambaByte: Token-free Selective State Space Model" in Pytorch and Zeta
Python ★ 127 1d agoExplain → -
Mixture-of-Depths
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Python ★ 121 18h agoExplain → -
xLSTM
Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"
Python ★ 118 1d agoExplain → -
FlashAttention20
Get down and dirty with FlashAttention2.0 in pytorch, plug in and play no complex CUDA kernels
Python ★ 115 2y agoExplain → -
Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
Python ★ 115 1mo agoExplain → -
Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
Python ★ 101 2y agoExplain → -
SparseAttention
Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"
Python ★ 97 18h agoExplain → -
PALI
Democratization of "PaLI: A Jointly-Scaled Multilingual Language-Image Model"
Python ★ 95 2y agoExplain → -
RoboCAT
Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM
Python ★ 90 2y agoExplain → -
Kosmos2.5
My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"
Python ★ 75 1d agoExplain → -
phi-1
Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation
Python ★ 73 2y agoExplain → -
LiqudNet
Implementation of Liquid Nets in Pytorch
Python ★ 71 18h agoExplain → -
HLT
Implementation of the transformer from the paper: "Real-World Humanoid Locomotion with Reinforcement Learning"
Python ★ 65 18h agoExplain → -
Gamba
Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"
Python ★ 65 8mo agoExplain → -
StarlightVision
A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.
Python ★ 64 2y agoExplain → -
movie-gen
An open source community implementation of the model from the paper: "Movie Gen: A Cast of Media Foundation Models". Join our community to help implement this model!
Python ★ 60 18h agoExplain → -
Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTORCH
Python ★ 59 18h agoExplain → -
Finetuning-Suite
Finetune any model on HF in less than 30 seconds
Jupyter Notebook ★ 57 18h agoExplain → -
Griffin
Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
Python ★ 57 7mo agoExplain → -
Sora
Implementation of the premier Text to Video model from OpenAI
Python ★ 57 1y agoExplain → -
FlashAttention20Triton
Triton implementation of Flash Attention2.0
Python ★ 54 2y agoExplain → -
LUMIERE
Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research
Python ★ 52 1y agoExplain → -
qformer
Implementation of Qformer from BLIP2 in Zeta Lego blocks.
Python ★ 51 1y agoExplain → -
Blockwise-Parallel-Transformer
32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.
Python ★ 50 3y agoExplain → -
NeoSapiens
The next evolution of Agents
Python ★ 48 1d agoExplain → -
SingLoRA
This repository provides a minimal, single-file implementation of SingLoRA (Single Matrix Low-Rank Adaptation) as described in the paper "SingLoRA: Low Rank Adaptation Using a Single Matrix" by Bensaïd et al.
Python ★ 46 18h agoExplain → -
AutoRT
Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"
Python ★ 44 1y agoExplain → -
WhiteRock
The world's first fully automated VC fund.
Python ★ 42 18h agoExplain → -
DifferentialTransformer
An open source community implementation of the model from "DIFFERENTIAL TRANSFORMER" paper by Microsoft.
Python ★ 41 18h agoExplain → -
ViTAR
Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch
Python ★ 40 1y agoExplain → -
AthenaOS
AthenaOS is a next generation AI-native operating system managed by Swarms of AI Agents
Rust ★ 40 2y agoExplain → -
SimpleMamba
Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
Python ★ 40 1y agoExplain → -
metnet3
An implementation of "metnet3" in Pytorch
Python ★ 39 1d agoExplain → -
AudioFlamingo
Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities"
Python ★ 39 1y agoExplain → -
LIMoE
Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with LIMoE: the Language-Image Mixture of Experts"
Python ★ 37 18h agoExplain → -
MultiModalCrossAttn
The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"
Python ★ 37 2y agoExplain → -
USM
Implementation of Google's USM speech model in Pytorch
Python ★ 36 18h agoExplain → -
SIMA
Pytorch Implementation of Deepmind's SIMA: "Scaling Instructable Agents Across Many Simulated Worlds"
Python ★ 35 2y agoExplain → -
GPT4o
Community Open Source Implementation of GPT4o in PyTorch
Shell ★ 32 1d agoExplain → -
MegaVIT
The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"
Python ★ 32 2mo agoExplain → -
OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
Python ★ 31 18h agoExplain → -
MHMoE
Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch
Python ★ 31 18h agoExplain → -
OpenR1
An open source implementation of R1
Python ★ 31 1d agoExplain → -
EvoVLM-JP
Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI
Python ★ 31 1y agoExplain → -
attn_res
A clean, single-file PyTorch implementation of Attention Residuals (Kimi Team, MoonshotAI, 2026), integrated with Grouped Query Attention (GQA), SwiGLU feed-forward networks, and Rotary Position Embeddings (RoPE).
Python ★ 30 3mo agoExplain → -
Simba
A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series"
Python ★ 29 1y agoExplain → -
Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
Python ★ 28 18h agoExplain → -
MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
Python ★ 28 18h agoExplain → -
awesome-robotic-foundation-models
A vast array of Multi-Modal Embodied Robotic Foundation Models!
★ 28 2y agoExplain → -
Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minimal learning curve.
Python ★ 27 1y agoExplain → -
PROMPTS.md
Understanding CLAUDE.md, MEMORY.md, SKILLS.md, SOUL.md, and Related Prompting Mechanisms
★ 27 1mo agoExplain → -
LFM2
A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid AI in Pytorch
Python ★ 27 18h agoExplain → -
MC-ViT
Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"
Python ★ 27 1d agoExplain → -
Athena-for-Search
The World's First AI-Enabled Multi-Modality Native Search Engine
TypeScript ★ 26 2y agoExplain → -
BRAVE-ViT-Swarm
Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"
Python ★ 26 2mo agoExplain → -
SayCan
Implementation of "Do As I Can, Not As I Say: Grounding Language in Robotic Affordances" by Google
Python ★ 25 18h agoExplain → -
TTL
Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"
Python ★ 25 1d agoExplain → -
Aurora
Implementation of the paper: "Aurora: A Foundation Model of the Atmosphere" in PyTorch
Python ★ 24 18h agoExplain → -
Paper-Implementation-Template
A simple reproducible template to implement AI research papers
★ 24 1y agoExplain → -
MuonClip
This repository is an open source implementation of the MuonClip strategy from the KIMI K2 Model from Moonshot AI
★ 23 7mo agoExplain → -
open-moonvit
This is an ultra-simple, single-file PyTorch implementation of MoonViT, the native-resolution vision encoder from Kimi-VL.
Python ★ 22 1mo agoExplain → -
MambaFormer
Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks"
Python ★ 22 18h agoExplain → -
GPT3
An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"
Python ★ 22 2y agoExplain → -
FlashMHA
An simple pytorch implementation of Flash MultiHead Attention
Jupyter Notebook ★ 22 2y agoExplain → -
CogNetX
CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video processing into one unified framework.
Python ★ 21 18h agoExplain → -
Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
Python ★ 20 1d agoExplain → -
forest-of-thoughts
A forest of autonomous agents.
Python ★ 20 1y agoExplain → -
dev-swarm
A swarm of LLM agents that will help you test, document, and productionize your code!
Python ★ 19 18h agoExplain → -
TeraGPT
Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT
Python ★ 17 1d agoExplain → -
PaLM2-VAdapter
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter"
Python ★ 17 1y agoExplain → -
MELLE
An open source community implementation of the model MELLE from the paper: "Autoregressive Speech Synthesis without Vector Quantization"
Shell ★ 16 1d agoExplain → -
LOGICGUIDE
Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%
Python ★ 16 3y agoExplain → -
Hedgehog
Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"
Python ★ 16 2y agoExplain → -
OmniByteFormer
OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing traditional tokenization or specific data-type encodings.
Python ★ 16 18h agoExplain → -
MGQA
The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints"
Python ★ 16 2y agoExplain → -
VLM-Mamba
We introduce VLM-Mamba, the first Vision-Language Model built entirely on State Space Models (SSMs), specifically leveraging the Mamba architecture.
Python ★ 15 5mo agoExplain → -
VisionLLaMA
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
Python ★ 15 1y agoExplain → -
Prometheus
Welcome to Prometheus, the revolutionary AI model that allows you to generate DNA sequences for any creature you can imagine. Whether it’s a pink panda, an elephant-sized turtle, or a completely new lifeform from your wildest dreams, Prometheus decodes the mystery of biology and synthesizes genetic blueprints with precision.
Python ★ 15 1d agoExplain → -
AudioMamba
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
Shell ★ 15 1d agoExplain → -
VortexFusion
Transformers + Mambas + LSTMS All in One Model
Python ★ 15 18h agoExplain → -
HSSS
Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling"
Python ★ 15 1y agoExplain → -
netwatch
NETWATCH is a terminal-based security monitoring tool that gives you continuous, live visibility into every active network connection on your machine, with automatic risk scoring, GeoIP intelligence, VPN status, process path validation, and new-connection alerting.
Python ★ 14 1mo agoExplain → -
poetry-cheatsheet
A super simple cheatsheet for poetry because I forget easily.
★ 14 2y agoExplain → -
Tiktokx
Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrastive cross-modality dependency encoding to achieve superior performance compared to existing state-of-the-art multi-model recommenders.
Python ★ 14 2y agoExplain → -
M2PT
Implementation of M2PT in PyTorch from the paper: "Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities"
Python ★ 14 2y agoExplain → -
AGI
Welcome to AGI, the cutting-edge project dedicated to building the core components of Artificial General Intelligence.
Shell ★ 13 18h agoExplain → -
Qwen-VL
My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't released model code yet sooo...
Python ★ 13 2y agoExplain → -
ShallowFF
Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"
Python ★ 13 18h agoExplain → -
VideoVIT
Open source implementation of a vision transformer that can understand Videos using max vit as a foundation.
Python ★ 12 2y agoExplain → -
VARC
An open source community implementation of VARC: "ARC Is a Vision Problem!"
Python ★ 12 7mo agoExplain → -
COT-SC
Plug in and Play Prompt Technique to Boost Model reasoning by 40%
Python ★ 11 3y agoExplain → -
TableMamba
TableMamba is a Mamba-based sequential recommender that encodes a user's interaction history with selective state-space blocks and a multi-interest head of M parallel SSM readers at log-spaced timescales, scoring items by their best-matching interest in O(T).
Python ★ 11 22d agoExplain → -
PolymorphicHardDriveEncryption
A dynamic, self-healing, and AI Driven Hard Drive Encryption Algorithm Inspired from Captain America
Python ★ 11 1d agoExplain → -
OpioidRL
OpioidRL is a cutting-edge reinforcement learning (RL) library that simulates drug addiction behaviors within RL agents. Inspired by the addictive properties of drugs like methamphetamine and crack cocaine, OpioidRL offers a unique environment where agents experience reward dependency, high-risk decision-making, and compulsive behaviors — pushing
Python ★ 11 1d agoExplain → -
weather-agent
Multi-Agent Template App
Python ★ 11 1d agoExplain → -
AlphaDev
Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ultra fast sorting algorithm.
Python ★ 11 2y agoExplain → -
VisionDatasets
Open source scripts to create large scale datasets with rich detail for multi-modal models
Python ★ 11 2y agoExplain → -
Brainformer
My Implementation of "Decoding speech perception from non-invasive brain recordings"
Makefile ★ 10 1y agoExplain → -
Open-Kimi
This repository is a straightforward attempt to implement the base Kimi K2 Reasoning model architecture in pure PyTorch as simply as possible.
Python ★ 10 7mo agoExplain → -
GPT4
The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]
Python ★ 10 2y agoExplain → -
AgentGym
A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1
Shell ★ 10 1d agoExplain → -
synchro
Synchronize your requirement.txt and pyproject.toml at the bush of a button!
Python ★ 10 1d agoExplain → -
gradient-ascent
A new optimizer, Gradient Ascent: Gradient Ascent adjusts the parameters in the direction of the gradient to maximize some objective function.
Python ★ 10 2y agoExplain → -
Meta-Tree-Of-Thoughts
Tree of Thoughts with an meta prompt for 50% boost in model reasoning
Python ★ 10 3y agoExplain → -
cogvit
A simple, open, and PyTorch implementation of the ViT from the GLM paper: “tGLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents”
Python ★ 9 1mo agoExplain → -
VO-ROPE
An implementation of the all-new rope from jianlin
Python ★ 9 8mo agoExplain → -
swarms-marketplace-api
The python API for the swarms marketplace API: Add, Update, Query, Update Agents, Tools, Prompts + More
Shell ★ 9 18h agoExplain → -
Gated-Slot-Attention
An implementation of the paper: "Gated Slot Attention for Efficient Linear-Time Sequence Modeling" in PyTorch
Python ★ 9 1d agoExplain → -
Speculative-Decoding
My own implementation of "Fast Inference from Transformers via Speculative Decoding"
Python ★ 9 2y agoExplain → -
APACAI
The APAC AI Hub for Documents, Product Briefs, Plans, and SOPS, we're currently raising SAFE 100M$ at a 1Billion$ valuation.
Shell ★ 9 2y agoExplain → -
Koi
A simple pytorch implementation of a meta learning algorithm from OPENAI "Reptile: A scalable meta-learning algorithm"
Python ★ 9 2y agoExplain → -
EchoPulse
An all-new model architecture to detect devices on ranges of radio frequencies using transformers and mambas
Python ★ 9 1y agoExplain → -
swarms-home
A conversational platform for the swarms framework.
TypeScript ★ 8 1d agoExplain → -
CogVLM2
Implementation of "CogVLM2: Visual Language Models for Image and Video Understanding" in PyTorch
Shell ★ 8 1d agoExplain → -
Reaper
Reaper is a simple polymorphic malware algorithm
Python ★ 8 2y agoExplain → -
Zamba
Implementation of the Paper: "Zamba: A Compact 7B SSM Hybrid Model" in Pytorch
Python ★ 8 1d agoExplain → -
JaxTransformer
This repository demonstrates how to build a Decoder-Only Transformer with Multi-Query Attention in JAX.
Python ★ 8 1d agoExplain → -
Fusion3D
An extremely experimental model that intakes images and generates 3D scenes of those images using Diffusion
Python ★ 8 18h agoExplain → -
ai-reading-list
This collection brings together the highest-signal research papers in modern AI from the invention of the Transformer to the frontier work of 2024–2025 into a single, curated map of the field
★ 8 6mo agoExplain → -
GeminiEmbeddingModel
This module implements the Gemini Embedding model as described in the research paper "Gemini Embedding: Generalizable Embeddings from Gemini" (2025).
Python ★ 8 8mo agoExplain → -
SVMS
The Sonar Vision Mapping System is an advanced environmental mapping and visualization technology. It utilizes high-frequency sound pulses to create a real-time, three-dimensional map of an environment. This system is designed for applications in security, surveillance, and search-and-rescue operations.
Python ★ 8 7mo agoExplain → -
GATS
Implementation of GATS from the paper: "GATS: Gather-Attend-Scatter" in pytorch and zeta
Python ★ 8 1y agoExplain → -
StarForce-X
StarForce-X, an enterprise-grade, AI-driven model engineered to intercept, analyze, and decode alien radio frequencies and communications with unparalleled precision.
Python ★ 7 18h agoExplain → -
SwarmsDiscord
A discord bot that can do anything.
Python ★ 7 2y agoExplain → -
Open-Olmo
Unofficial open-source PyTorch implementation of the OLMo Hybrid architecture introduced by the Allen Institute for AI (Ai2).
Python ★ 7 3mo agoExplain → -
Open-NAMM
An open source implementation of the paper: "AN EVOLVED UNIVERSAL TRANSFORMER MEMORY"
Python ★ 7 8mo agoExplain → -
Math-Arxviv
All mathematics research papers sourced from ArXiv and meticulously curated for LLM pretraining purposes.
★ 7 2y agoExplain → -
Open-Cursor-Agent
An open-source autonomous AI agent implementation inspired by Cursor Agent, built with the Swarms framework. This production-grade agent can autonomously plan, execute, and complete complex tasks using a combination of Large Language Model reasoning and tool execution.
Python ★ 7 7mo agoExplain → -
AttentionGrid
A network of attention mechanisms at your fingertips. Unleash the potential of attention mechanisms for diverse AI applications. AttentionGrid is all you need!
Python ★ 7 2y agoExplain → -
CLIPQ
A simple implementation of a CLIP that splits up an image into quandrants and then gets the embeddings for each quandrant
Python ★ 7 1y agoExplain → -
awesome-omni-modal-papers
An awesome list of omni-modality LLM models that can perceive and generate images, videos, audios, and more all at once
★ 7 1y agoExplain → -
DSH
A novel implementation of a Drone Swarm Hivemind model that can accept inputs from any arbitary number of drones and execute actions on each drone. A Multi-input and multi-output model
Shell ★ 7 2y agoExplain → -
Midas
Implementation of Midas from [Towards Robust Monocular Depth Estimation] in Pytorch and Zeta
Shell ★ 7 2y agoExplain → -
ShortCircuit
Implementation of the paper "SHORTCIRCUIT: ALPHAZERO-DRIVEN CIRCUIT DESIGN" in PyTorch
Python ★ 6 1d agoExplain → -
AoA-torch
Implementation of Attention on Attention in Zeta
Python ★ 6 18h agoExplain → -
kyegomez
Advancing Humanity with Multi-Modality AI
★ 6 3mo agoExplain → -
Expanding-Godel-s-Ontological-Proof
This paper presents a comprehensive analysis of Kurt G¨odel’s ontological proof for the existence of God, with particular focus on contemporary expansions and refinements of the original framework
★ 6 10mo agoExplain → -
CrysisAI
Implementation of the AI model from Crysis with Heat Imaging and multi-object detection using Yolo.
Python ★ 6 2y agoExplain → -
ModelHub
The Ultimate Hub of AI Models - Simplified, Streamlined, and Scalable for Production-Grade Deployment.
Shell ★ 5 1d agoExplain → -
CNNGPT
This CNN-based language model leverages causal and dilated convolutions, gated activations, residual connections, and layer normalization to effectively model textual data for generation tasks.
Python ★ 5 1d agoExplain → -
ai-civilization ⑂
Building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
★ 5 2y agoExplain → -
Odin
SOTA Classification at scale for UAVs, Drones, and much more
Python ★ 5 2y agoExplain → -
MambaDecoderBlock
MambaDecoderBlock is a novel decoder architecture that replaces traditional self-attention mechanisms with Mamba state space models, augmented by Mixture of Experts (MoE) layers.
Python ★ 5 6mo agoExplain → -
Toto
A pytorch implementation of the paper: ""Toto: Time Series Optimized Transformer for Observability"
Shell ★ 5 1d agoExplain → -
NanoCAD ⑂
open-source nanotech CAD
Python ★ 5 3y agoExplain → -
Athena-Create
Athena is an all-in-one app that allows easy creation of various things like images, videos, music, and soon, even apps, using natural language.
HCL ★ 5 3y agoExplain → -
Tree-Attention-Torch
An implementation of Tree-Attention in PyTorch because it's in JAX for some reason
Python ★ 5 1d agoExplain → -
ChronoFormer
A production-grade implementation of a memory-efficient transformer specifically designed for tabular time series data.
Python ★ 5 1y agoExplain → -
HGRN2
Implementation of the paper: "HGRN2: Gated Linear RNNs with State Expansion" in PyTorch
Shell ★ 5 1y agoExplain → -
swarm-home
No description.
TypeScript ★ 5 2y agoExplain → -
Multi-Model-Training
An experimental repository on research for training multiple models all at once in an evolutionary capacity!
Python ★ 4 1d agoExplain → -
GPTBot
Open source implementation of a super reliable web crawler powered by LLMs by OpenAI
★ 4 2y agoExplain → -
OmniAlignNet
An open-source PyTorch implementation of OmniAlignNet from the OmniVinci paper, designed to align vision and audio embeddings in a shared omni-modal space.
Python ★ 4 8mo agoExplain → -
Modeling-Economic-Systems-as-Neural-Networks
This paper presents a groundbreaking framework that models economic systems as intelligent neural networks, offering a novel approach to understanding how economies learn, adapt, and self-organize.
★ 4 1y agoExplain → -
Shadows-of-Other-Worlds
Shadows of Other Worlds: Detecting Multiversal Interference in Quantum Measurements
★ 4 1y agoExplain → -
mcs-platform
Your Personal Hospital Powered by Swarms.ai
TypeScript ★ 3 4mo agoExplain → -
Optimus-Prime ⑂
A simple but complete full-attention transformer with a set of promising experimental features from various papers
Python ★ 3 2y agoExplain → -
AutoGPT ⑂
An experimental open-source attempt to make GPT-4 fully autonomous.
★ 3 2y agoExplain → -
Chai-1
An free and open source community implementation of Chai-1 in PyTorch
Python ★ 3 1d agoExplain → -
SSM-As-VLM-Bridge
An exploration into leveraging SSM's as Bridge/Adapter Layers for VLM
Python ★ 3 18h agoExplain → -
primus
A multimodal foundation model for humanoid robotics that integrates multiple input modalities—text, speech, vision (images and videos), and outputs both actions and speech simultaneously like a transformer.
★ 3 1y agoExplain → -
SuperPromptOmega ⑂
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents. I made an prompt optimization swarm and updated it hundreds of times. This is the 400th iteration
★ 3 1y agoExplain → -
A-Theory-on-Value-Creation
This paper introduces a comprehensive and formal theory on value creation, bridging the gap between classical economic models and contemporary economic realities driven by technology, innovation, and intangible assets.
★ 3 1y agoExplain → -
open_qwen
A non-official implementation of Qwen 3.5, as there doesn’t seem to be a paper or any code available that I can find, so I decided to implement it just for fun.
Python ★ 2 3mo agoExplain → -
kyegomez-com
No description.
MDX ★ 2 3mo agoExplain → -
agents
An Definitive and Unified AI Agents Framework to Automate Anything and Everything
★ 2 3y agoExplain → -
KNOTX
KNOTX is a new and theoretical activation function designed for ultra-fast multi-modality learning utilizing knot theory + dynamical systems modeling.
★ 1 3y agoExplain → -
segment-geospatial ⑂
A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
★ 1 3y agoExplain →
No repos match these filters.