Members
-
AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Jupyter Notebook ★ 14k 6mo agoExplain → -
AI-Scientist-v2
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Python ★ 6.6k 6mo agoExplain → -
continuous-thought-machines
Continuous Thought Machines, because thought takes time and reasoning is a process.
Python ★ 2.0k 5mo agoExplain → -
evolutionary-model-merge
Official repository of Evolutionary Optimization of Model Merging Recipes
Python ★ 1.4k 1y agoExplain → -
text-to-lora
Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input
Python ★ 1.3k 1y agoExplain → -
ShinkaEvolve
ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution 🧬
Python ★ 1.2k 12d agoExplain → -
self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
Python ★ 1.2k 1y agoExplain → -
doc-to-lora
Hypernetworks that update LLMs to remember factual information
Python ★ 756 6d agoExplain → -
treequest
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
Python ★ 552 4mo agoExplain → -
asal
Automating the Search for Artificial Life with Foundation Models!
Jupyter Notebook ★ 474 8mo agoExplain → -
RLT
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
Python ★ 362 1y agoExplain → -
evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
Python ★ 359 1y agoExplain → -
AI-Scientist-ICLR2025-Workshop-Experiment
No description.
Python ★ 301 1y agoExplain → -
sparser-faster-llms
Cuda kernels for leveraging LLM sparsity to improve throughput and decrease the memory requirements during inference and training.
Cuda ★ 245 1mo agoExplain → -
DiffusionBlocks
DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation
Python ★ 229 4mo agoExplain → -
DroPE
Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding
Python ★ 218 5mo agoExplain → -
drq
Digital Red Queen: Adversarial Program Evolution in Core War with LLMs
Red ★ 205 5mo agoExplain → -
DiscoPOP ⑂
Code for Discovering Preference Optimization Algorithms with and for Large Language Models
Python ★ 197 2y agoExplain → -
ALE-Bench
The official repository of ALE-Bench
Python ★ 188 17d agoExplain → -
natural_niches
The code repository of the paper: Competition and Attraction Improve Model Fusion
Jupyter Notebook ★ 171 10mo agoExplain → -
TinySwallow-ChatUI
Browser-based chat UI for TinySwallow-1.5B that runs without API calls.
CSS ★ 136 6mo agoExplain → -
TAID
Official implementation of "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"
Python ★ 123 8mo agoExplain → -
ab-mcts-arc2
No description.
Python ★ 115 11mo agoExplain → -
robust-kbench
No description.
Python ★ 98 7mo agoExplain → -
kame
No description.
Python ★ 92 1mo agoExplain → -
digital-ecosystem
Interactive multi-agent NCA ecosystem simulation
JavaScript ★ 82 2mo agoExplain → -
repo
RePo: Language Models with Context Re-Positioning
Python ★ 77 2mo agoExplain → -
petri-dish-nca
No description.
Python ★ 58 7mo agoExplain → -
TinySwallow-ChatUI-Local
Python-based chat demo for TinySwallow-1.5B that works completely offline
Python ★ 58 1y agoExplain → -
CycleQD
CycleQD is a framework for parameter space model merging.
Python ★ 49 1y agoExplain → -
IASC
LLMs for Constructed Languages
HTML ★ 48 2mo agoExplain → -
shachi
Reimagining Agent-based Modeling with Large Language Model Agents via Shachi
Python ★ 46 6d agoExplain → -
edinet2dataset
edinet2dataset is a tool to construct financial dataset using EDINET.
Python ★ 40 3mo agoExplain → -
EDINET-Bench
[ICLR 2026] Evaluating the performance of LLMs on Japanese challenging financial tasks.
Python ★ 35 3mo agoExplain → -
Kamon
Data and code for understanding and generation of Kamon.
Python ★ 35 3mo agoExplain → -
vllm ⑂
A high-throughput and memory-efficient inference and serving engine for LLMs
★ 35 2y agoExplain → -
kame_finetune
No description.
Python ★ 30 1mo agoExplain → -
L2D
Large language models to diffusion finetuning code
Python ★ 27 1y agoExplain → -
TransEvalnia
Reasoning-based Evaluation and Ranking of Translations.
Python ★ 20 18d agoExplain → -
fast-weight-product-key-memory
Code for Fast-weight Product Key Memory (FwPKM)
Python ★ 19 3mo agoExplain → -
neuroevolution-for-ai
Neuroevolution Community
★ 14 7mo agoExplain → -
AC-DC
No description.
Python ★ 11 2mo agoExplain → -
nca-alife
Learning Neural Cellular Automata that produce Open-Ended Alife!
Jupyter Notebook ★ 11 1y agoExplain → -
rl-razor-mnist
Replication of the MNIST experiments from paper **RL's Razor: Why Online Reinforcement Learning Forgets Less**
Python ★ 9 3mo agoExplain → -
Sudoku-Bench
An AI benchmark for creative, human-like problem solving using Sudoku variants
★ 7 17d agoExplain → -
DreamCubed
No description.
Jupyter Notebook ★ 6 1mo agoExplain → -
google-code-golf-2025
No description.
Python ★ 6 7mo agoExplain → -
orcaclaw ⑂
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞 Now with more orca.
★ 5 4mo agoExplain → -
ike
A DeepSpeed-based framework for distributed training and inference of language models.
Python ★ 2 3mo agoExplain → -
BALROG ⑂
Benchmarking Agentic LLM and VLM Reasoning On Games
★ 2 10mo agoExplain → -
AC-DC-eval_harness
No description.
Python ★ 1 2mo agoExplain → -
mle-bench-shinka-agent ⑂
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Python ★ 1 7mo agoExplain → -
LanguageEvolution
No description.
Python ★ 0 9d agoExplain → -
KamonBench
KamonBench: A Grammar-Based Dataset for Evaluating Compositional Factor Recovery in Vision-Language Models
Python ★ 0 1mo agoExplain →
No repos match these filters.