Tyler Michael Smith

@tlrmchlsmth

vLLM Core Maintainer | MTS at Red Hat

47 repos
152 followers
65 following

Just 29%
Python 19%
Shell 14%
Go 10%
Dockerfile 10%

1.5k contributions in the last year

13-day current streak·14-day longest streak

‹ swipe through months ›

Jun 2025

SMTWTFS123456789101112131415161718192021222324252627282930

Jul 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Aug 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Sep 2025

SMTWTFS123456789101112131415161718192021222324252627282930

Oct 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Nov 2025

SMTWTFS123456789101112131415161718192021222324252627282930

Dec 2025

SMTWTFS12345678910111213141516171819202122232425262728293031

Jan 2026

SMTWTFS12345678910111213141516171819202122232425262728293031

Feb 2026

SMTWTFS12345678910111213141516171819202122232425262728

Mar 2026

SMTWTFS12345678910111213141516171819202122232425262728293031

Apr 2026

SMTWTFS123456789101112131415161718192021222324252627282930

May 2026

SMTWTFS12345678910111213141516171819202122232425262728293031

Jun 2026

SMTWTFS123456789101112131415161718192021222324252627282930

Less More

All public repos (47)

Show forks Show archived Sort

vllm ★ PINNED ⑂

A high-throughput and memory-efficient inference and serving engine for LLMs

Python ★ 0 1d ago
Explain →
llm-d ★ PINNED ⑂

llm-d is a Kubernetes-native high-performance distributed LLM inference framework

Makefile ★ 0 1mo ago
Explain →
blis ★ PINNED ⑂

BLAS-like Library Instantiation Software Framework

C ★ 1 3y ago
Explain →
coolS

A LaTeX package to use the Cool S as a symbol in math equations

TeX ★ 9 8y ago
Explain →
momms

Multilevel Optimized Matrix-matrix Multiplication Sandbox

C ★ 8 7y ago
Explain →
j-pareto

No description.

Python ★ 3 1mo ago
Explain →
j-llm-d

Justfile harness for llm-d

Just ★ 2 3d ago
Explain →
tms_submod

Routines for submodular set function minimization

C++ ★ 2 6y ago
Explain →
prefill-decode-experiments

No description.

Just ★ 1 1y ago
Explain →
nvshmem-guide

No description.

★ 1 1y ago
Explain →
vllm-dp-lws

No description.

Dockerfile ★ 1 11mo ago
Explain →
blas_gemm_rust_driver

Just drivers to time mkl & blis dgemm written in Rust

Rust ★ 1 9y ago
Explain →
nightly-eval

No description.

Python ★ 0 8h ago
Explain →
claudectx

kubectx for AI coding agents — switch paired Claude Code + Codex CLI contexts (settings, tokens, skills, MCP servers) and translate config between them

Go ★ 0 15d ago
Explain →
dotfiles

No description.

Shell ★ 0 15d ago
Explain →
tv ⑂

No description.

★ 0 26d ago
Explain →
DeepGEMM ⑂

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

★ 0 1mo ago
Explain →
vllm-skills

No description.

Shell ★ 0 2mo ago
Explain →
flashinfer ⑂

FlashInfer: Kernel Library for LLM Serving

★ 0 1mo ago
Explain →
llmd-routing-bench

Benchmarking tool for the llm-d routing sidecar (P/D disaggregation overhead)

Go ★ 0 3mo ago
Explain →
DeepEP ⑂

DeepEP: an efficient expert-parallel communication library

★ 0 2mo ago
Explain →
vllm-dev-env

No description.

★ 0 5mo ago
Explain →
combine_traces

No description.

Python ★ 0 5mo ago
Explain →
llm-d-dev-img

No description.

Just ★ 0 6mo ago
Explain →
my_pods

No description.

Dockerfile ★ 0 9mo ago
Explain →
guidellm ⑂

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

★ 0 9mo ago
Explain →
llm-d-modelservice ⑂

No description.

★ 0 11mo ago
Explain →
benchmark-pod-interactive ⑂

Pod for benchmarking interactive in llm-d

★ 0 11mo ago
Explain →
llm-d-infra ⑂

llm-d helm charts and deployment examples

★ 0 10mo ago
Explain →
ptgq_fp8

No description.

Python ★ 0 11mo ago
Explain →
llm-d-inference-scheduler ⑂

Inference scheduler for llm-d

★ 0 2mo ago
Explain →
canhazgpu ⑂

A simple GPU reservation tool for single host shared development systems

★ 0 11mo ago
Explain →
ci-infra ⑂

This repo hosts code for vLLM CI & Performance Benchmark infrastructure.

★ 0 1y ago
Explain →
literate-bassoon

No description.

Just ★ 0 1y ago
Explain →
ibgda-repro

No description.

★ 0 1y ago
Explain →
llm-d-project-template

No description.

Just ★ 0 1y ago
Explain →
pd_examples

No description.

Just ★ 0 1y ago
Explain →
LMCache ⑂

Redis for LLMs

Python ★ 0 1y ago
Explain →
lmcache-tests ⑂

No description.

★ 0 1y ago
Explain →
lmcache-server ⑂

No description.

★ 0 1y ago
Explain →
lmcache-vllm ⑂

The driver for LMCache core to run in vLLM

★ 0 1y ago
Explain →
torchac_cuda ⑂

No description.

★ 0 1y ago
Explain →
cutlass ⑂

CUDA Templates for Linear Algebra Subroutines

★ 0 1y ago
Explain →
transformers ⑂

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python ★ 0 1y ago
Explain →
flash-attention ⑂

Fast and memory-efficient exact attention

★ 0 1y ago
Explain →
flux ⑂

A fast communication-overlapping library for tensor parallelism on GPUs.

C++ ★ 0 1y ago
Explain →
momms_exper_driver

No description.

Shell ★ 0 9y ago
Explain →

No repos match these filters.