1-day current streak·11-day longest streak
-
llm_context_benchmarks ★ PINNED
📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimized for Apple Silicon), visual performance charts.
Python ★ 68 6d agoExplain → -
wine_variety_classification ★ PINNED
Examples on how to use various LLM providers with a Wine Classification problem
Python ★ 129 1mo agoExplain → -
qwen-image-mps ★ PINNED
Qwen Image models through MPS
Python ★ 271 5mo agoExplain → -
z-image-mps ★ PINNED
Z Image models through MPS
Python ★ 166 6mo agoExplain → -
chatbot-ollama ★ PINNED
Chatbot Ollama is an open source chat UI for Ollama.
TypeScript ★ 1.9k 9mo agoExplain → -
kubernetes-the-hard-way-on-azure ⑂
Bootstrap Kubernetes the hard way on Microsoft Azure Platform. No scripts.
Makefile ★ 457 2y agoExplain → -
prompt-eng-ollama-interactive-tutorial
Ollama's Interactive Prompt Engineering Tutorial
Jupyter Notebook ★ 264 1y agoExplain → -
lmstudio_hf
A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.
Python ★ 88 7mo agoExplain → -
autogram
Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.
Python ★ 82 2y agoExplain → -
ai-toolkit ⑂
The ultimate training toolkit for finetuning diffusion models
Python ★ 34 4mo agoExplain → -
fasterliveportrait-mlx
Apple MLX port of FasterLivePortrait for Apple Silicon
Python ★ 25 27d agoExplain → -
hn_local_image
Turn the Hacker News front page into local AI art. Powered by mlx-vlm, MFlux, and Apple Silicon.
Python ★ 17 4d agoExplain → -
asitop ⑂
Perf monitoring CLI tool for Apple Silicon
Python ★ 16 2y agoExplain → -
mlx-openbench ⑂
Provider-agnostic, open-source evaluation infrastructure for language models
Python ★ 12 3mo agoExplain → -
ml-ssd-mlx ⑂
No description.
Python ★ 11 16d agoExplain → -
easy-azure-opensource
OpenSource deployment made easy
Shell ★ 10 11y agoExplain → -
XBai-o4 ⑂
No description.
Python ★ 6 10mo agoExplain → -
ds4 ⑂
DeepSeek 4 Flash local inference engine for Metal
C ★ 4 22d agoExplain → -
autoresearch-mlx-local ⑂
Apple Silicon (MLX) port of Karpathy's autoresearch — autonomous AI research loops on Mac, no PyTorch required.
★ 3 1mo agoExplain → -
unsloth-mlx ⑂
Bringing the Unsloth experience to Mac users via Apple's MLX framework
★ 3 5mo agoExplain → -
awesome-llm-apps ⑂
Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.
★ 3 1y agoExplain → -
canalsplat
CanalSplat - An interactive gallery showcasing Canaletto's Venice paintings as 3D Gaussian Splats. Explore iconic Venetian views in an immersive web experience, featuring a curated selection of Canaletto masterpieces rendered as interactive 3D scenes.
CSS ★ 3 6mo agoExplain → -
benchmarks_llm_silicon
A place to store benchmarks results on Apple Silicon Mx machines running MLX, llama.cpp, LM Studio, Ollama or anything else relevant for the community.
★ 3 1y agoExplain → -
mlx-lm ⑂
Run LLMs with MLX
Python ★ 2 24d agoExplain → -
ollama_tic_tac_toe_agent
No description.
Python ★ 2 1y agoExplain → -
mcp-server-rabbitmq ⑂
MCP server for interacting with RabbitMQ
Python ★ 2 1y agoExplain → -
vllm-metal ⑂
Community maintained hardware plugin for vLLM on Apple Silicon
Python ★ 2 2mo agoExplain → -
Claude-Cowork ⑂
OpenSource Claude Cowork. A desktop AI assistant that helps you with programming, file management, and any task you can describe.
★ 2 5mo agoExplain → -
jan ⑂
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.
★ 2 4mo agoExplain → -
mlx-ui ⑂
A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.
Python ★ 2 1y agoExplain → -
mlx_simple_benchmarks
Very simple benchmarks around Apple mlx
Python ★ 2 2y agoExplain → -
gemma-cookbook ⑂
A collection of guides and examples for the Gemma open models from Google.
Jupyter Notebook ★ 1 8h agoExplain → -
mlx-audio ⑂
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
Python ★ 1 15d agoExplain → -
mistral.rs ⑂
Fast, flexible LLM inference
★ 1 17d agoExplain → -
chatbot-ui ⑂
The open-source AI chat app for everyone.
TypeScript ★ 1 2y agoExplain → -
ToolCall-15 ⑂
No description.
TypeScript ★ 1 2mo agoExplain → -
prompt-eng-interactive-tutorial ⑂
Anthropic's Interactive Prompt Engineering Tutorial
★ 1 1y agoExplain → -
mac-studio-server ⑂
Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource optimization, and remote management via SSH.
★ 1 1y agoExplain → -
parallax ⑂
Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere
★ 1 6mo agoExplain → -
tinker-cookbook ⑂
Post-training with Tinker
★ 1 7mo agoExplain → -
hf-mem ⑂
A CLI to estimate inference memory requirements for Hugging Face models, written in Python.
★ 1 5mo agoExplain → -
ai-engineering-hub ⑂
No description.
★ 1 1y agoExplain → -
distilabel ⑂
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
★ 1 9mo agoExplain → -
mlx-knife ⑂
ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)
Python ★ 1 10mo agoExplain → -
OLMo ⑂
Modeling, training, eval, and inference code for OLMo
★ 1 2y agoExplain → -
CrewAI ⑂
No description.
Python ★ 1 1y agoExplain → -
chat-with-mlx ⑂
Chat with your data natively on Apple Silicon using MLX Framework.
Python ★ 1 1y agoExplain → -
PicoMLXServer ⑂
No description.
★ 1 2y agoExplain → -
SiLLM ⑂
No description.
Python ★ 1 2y agoExplain → -
ResearchPlot ⑂
No description.
★ 1 2y agoExplain → -
mactop ⑂
mactop - Apple Silicon Monitor Top
★ 0 5d agoExplain → -
coreai-model-zoo ⑂
Community model zoo + knowledge base for Apple Core AI (iOS/macOS 27): Qwen3.5 & Gemma 4 converted end-to-end, verified on-device (iPhone 17 Pro GPU/ANE), conversion gotchas, custom Metal kernels, Swift runner
★ 0 6d agoExplain → -
mlx-vlm ⑂
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
Python ★ 0 3d agoExplain → -
repoprompt-classic ⑂
Archived educational snapshot of the Repo Prompt app
★ 0 16d agoExplain → -
repoprompt-ce ⑂
Community edition of RepoPrompt: a native macOS context engineering app for AI coding agents, with an MCP CLI.
★ 0 8d agoExplain → -
GemmaDesktop ⑂
An experiment, what if Gemma had a Desktop app tuned for the model and offline scenarios?
TypeScript ★ 0 9d agoExplain → -
deep-swe ⑂
Measuring frontier coding agents on original, long-horizon engineering tasks
★ 0 12d agoExplain → -
omlx ⑂
LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar
Python ★ 0 13d agoExplain → -
ratel-bench ⑂
Benchmark harness for Ratel: BM25 retrieval evaluation + agent-campaign with LLM judge
★ 0 1mo agoExplain → -
pi-mono ⑂
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
★ 0 3mo agoExplain → -
dflash-mlx ⑂
Lossless DFlash speculative decoding for MLX on Apple Silicon
Python ★ 0 1mo agoExplain → -
orbitals ⑂
No description.
★ 0 1mo agoExplain → -
BenchLocal ⑂
No description.
TypeScript ★ 0 2mo agoExplain → -
hermes-agent ⑂
The agent that grows with you
Python ★ 0 2mo agoExplain → -
llama-benchy ⑂
llama-benchy - llama-bench style benchmarking tool for all backends
Python ★ 0 2mo agoExplain → -
pi-vs-claude-code ⑂
Comparison between open source PI agent and closed source Claude Code agent
★ 0 3mo agoExplain → -
reap-mlx ⑂
REAP expert pruning for MoE LLMs on Apple Silicon via MLX
★ 0 3mo agoExplain → -
maclocal-api ⑂
'afm' command cli: macOS server and single prompt mode that exposes Apple's Foundation and MLX Models and other APIs running on your Mac through a single aggregated OpenAI-compatible API endpoint. Supports Apple Vision and single command (non-server) inference with piping as well . Now with Web Browser and local AI API aggregator
Swift ★ 0 3mo agoExplain → -
pocket-server ⑂
An OS for your agents, built for your pocket.
★ 0 8mo agoExplain → -
gabliteration ⑂
Automated hyperparameter search for optimal Gabliteration configurations on large language models
Python ★ 0 3mo agoExplain → -
mlx-lm-lora-example-notebooks ⑂
this repo has all official MLX-LM-LoRA example notebooks for training on Apple Silicon
★ 0 4mo agoExplain → -
kooka-server ⑂
mlx-lm server wrapper for agentic harness
Python ★ 0 5mo agoExplain → -
exo ⑂
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Python ★ 0 5mo agoExplain → -
SHARP-ML ⑂
No description.
★ 0 5mo agoExplain → -
mlx-lm-lora ⑂
Train Large Language Models on MLX.
Python ★ 0 7mo agoExplain → -
intelligence-per-watt ⑂
No description.
★ 0 7mo agoExplain → -
mlx-lm-lens ⑂
Find the hidden meaning of LLMs
Python ★ 0 7mo agoExplain → -
mlx-swift-examples ⑂
Examples using MLX Swift
Swift ★ 0 7mo agoExplain → -
OpenDevin ⑂
🐚 OpenDevin: Code Less, Make More
★ 0 2y agoExplain → -
consumer-tflop-database ⑂
Gets your TFLOPs for your GPU quickly
★ 0 8mo agoExplain → -
mlx-engine ⑂
LM Studio Apple MLX engine
Python ★ 0 8mo agoExplain → -
mlx_parallm ⑂
Fast parallel LLM inference for MLX
★ 0 9mo agoExplain → -
agno ⑂
Lightweight framework for building Agents with memory, knowledge, tools and reasoning.
Python ★ 0 9mo agoExplain → -
Hands-On-Large-Language-Models ⑂
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
★ 0 11mo agoExplain → -
tflops_mps
TFLOPs testing on MPS and CUDA
Python ★ 0 9mo agoExplain → -
fastmlx ⑂
FastMLX is a high performance production ready API to host MLX models.
★ 0 1y agoExplain → -
HRM ⑂
Hierarchical Reasoning Model Official Release
★ 0 10mo agoExplain → -
anycoder ⑂
No description.
Python ★ 0 10mo agoExplain → -
mlx-gui ⑂
MLX-GUI MLX Inference Server
★ 0 11mo agoExplain → -
mlx ⑂
MLX: An array framework for Apple silicon
C++ ★ 0 1y agoExplain → -
MetaGPT ⑂
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
★ 0 2y agoExplain → -
llm-perfbench ⑂
No description.
★ 0 1y agoExplain → -
human_applesilicon ⑂
Human Eval scores evaluated locally using a 3090
★ 0 1y agoExplain → -
mlx-examples ⑂
Examples in the MLX framework
Python ★ 0 1y agoExplain → -
macos-core-to-core-latency ⑂
Core-to-core latency benchmark that works on MacOS without hard affinity
C++ ★ 0 1y agoExplain → -
msft_analysis
No description.
TypeScript ★ 0 1y agoExplain → -
mlxcli ⑂
Run large models from the terminal using Apple MLX.
★ 0 2y agoExplain → -
llm-mlx ⑂
Support for MLX models in LLM
Python ★ 0 1y agoExplain → -
llm ⑂
Access large language models from the command-line
★ 0 1y agoExplain → -
SnakeBench ⑂
No description.
Python ★ 0 1y agoExplain → -
mlx-omni-server ⑂
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. It implements OpenAI-compatible API endpoints, enabling seamless integration with existing OpenAI SDK clients while leveraging the power of local ML inference.
★ 0 1y agoExplain → -
crewAI-tools ⑂
No description.
Python ★ 0 1y agoExplain → -
automatic-learning-amplifier ⑂
MLX-based QA pair generator and LLM finetuning tool in Streamlit
★ 0 1y agoExplain → -
Phi-3CookBook ⑂
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
Jupyter Notebook ★ 0 1y agoExplain → -
lobe-icons ⑂
🥨 Lobe Icons - Popular AI / LLM Model Brand SVG Logo and Icon Collection.
★ 0 1y agoExplain → -
llama-recipes ⑂
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
★ 0 1y agoExplain → -
f5-tts-mlx ⑂
Implementation of F5-TTS in MLX
★ 0 1y agoExplain → -
theWrongRoom ⑂
Interrogate LLMs to solve corporate mysteries.
★ 0 1y agoExplain → -
smol-vision ⑂
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
★ 0 1y agoExplain → -
aider ⑂
aider is AI pair programming in your terminal
Python ★ 0 1y agoExplain → -
chat-logger ⑂
No description.
★ 0 1y agoExplain → -
MLX-vs-Pytorch ⑂
Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs
Python ★ 0 1y agoExplain → -
mflux ⑂
A MLX port of FLUX based on the Huggingface Diffusers implementation.
★ 0 1y agoExplain → -
huggingface.js ⑂
Utilities to use the Hugging Face Hub API
★ 0 1y agoExplain → -
ChatMLX ⑂
ChatMLX is a large model real-time conversation app implemented using MLX.🚧
Swift ★ 0 1y agoExplain → -
crewAI-examples ⑂
No description.
Python ★ 0 1y agoExplain → -
mlx-tuning-fork ⑂
Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for systematic running of easily parameterized fine-tunes
Python ★ 0 1y agoExplain → -
rlx ⑂
A reinforcement learning framework based on MLX.
Python ★ 0 1y agoExplain → -
lightning-whisper-mlx ⑂
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
Python ★ 0 1y agoExplain → -
DIY-Astra ⑂
No description.
★ 0 2y agoExplain → -
lamini-sdk ⑂
No description.
★ 0 2y agoExplain → -
mlx-benchmark ⑂
Benchmark of Apple's MLX operations on mlx gpu, cpu, torch mps and cuda.
Python ★ 0 2y agoExplain → -
Hermes-Function-Calling ⑂
No description.
Python ★ 0 2y agoExplain → -
LibreChat ⑂
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
TypeScript ★ 0 2y agoExplain → -
react-complete-guide-code ⑂
Code snapshots and materials for our "React - The Complete Guide" course (https://acad.link/reactjs)
★ 0 2y agoExplain → -
mlx-image ⑂
mlx image models for Apple Silicon machines
Jupyter Notebook ★ 0 2y agoExplain → -
laserRMT ⑂
This is our own implementation of 'Layer Selective Rank Reduction'
Python ★ 0 2y agoExplain → -
yet-another-applied-llm-benchmark ⑂
No description.
★ 0 2y agoExplain → -
ollama-webui ⑂
ChatGPT-Style Web UI Client for Ollama 🦙
Svelte ★ 0 2y agoExplain → -
helix ⑂
Create your own AI by fine-tuning open source models
★ 0 2y agoExplain → -
mlx-moe ⑂
Scripts to create your own moe models using mlx
Python ★ 0 2y agoExplain → -
semantic-kernel ⑂
Integrate cutting-edge LLM technology quickly and easily into your apps
★ 0 2y agoExplain → -
plock ⑂
From anywhere you can type, query and stream the output of an LLM or any other script
★ 0 2y agoExplain → -
LLM-automator ⑂
No description.
Python ★ 0 2y agoExplain → -
LinAlg4DataScience ⑂
Code that accompanies the book "Linear Algebra for Data Science"
★ 0 2y agoExplain → -
ollama ⑂
Get up and running with Llama 2 and other large language models locally
Go ★ 0 2y agoExplain → -
xformers ⑂
Hackable and optimized Transformers building blocks, supporting a composable construction.
★ 0 2y agoExplain → -
crewai-experiments ⑂
Experiments with local as well as models available through an api
★ 0 2y agoExplain → -
LLM-FTC-sampling ⑂
First token cutoff sampling inference example
Python ★ 0 2y agoExplain → -
gguf-tools ⑂
GGUF implementation in C as a library and a tools CLI program
★ 0 2y agoExplain → -
nanoGPT_mlx ⑂
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
★ 0 2y agoExplain → -
streaming-openai-fastapi-langchain
No description.
★ 0 2y agoExplain → -
mongo ⑂
The MongoDB Database
★ 0 2y agoExplain → -
continue ⑂
⏩ the open-source autopilot for software development—bring the power of ChatGPT to VS Code and JetBrains
★ 0 2y agoExplain → -
python-mastery ⑂
Advanced Python Mastery (course by @dabeaz)
★ 0 2y agoExplain → -
langchain ⑂
⚡ Building applications with LLMs through composability ⚡
Python ★ 0 2y agoExplain → -
MassTransit ⑂
Distributed Application Framework for .NET
C# ★ 0 2y agoExplain → -
awesome-chatgpt-prompts ⑂
This repo includes ChatGPT prompt curation to use ChatGPT better.
★ 0 3y agoExplain → -
workflow-core ⑂
Lightweight workflow engine for .NET Standard
C# ★ 0 3y agoExplain → -
kubernetes-elastic-stack ⑂
How to set up the Elastic stack on Kubernetes
Shell ★ 0 8y agoExplain → -
ServiceStack ⑂
Thoughtfully architected, obscenely fast, thoroughly enjoyable web services for all
C# ★ 0 9y agoExplain →
No repos match these filters.