Asim A.Osman

@Asimawad ·Cape Town ·www.linkedin.com/in/asim-awad-hussein-osman-35174b336

56 repos
4 followers
7 following

Jupyter Notebook 53%
Python 29%
HTML 9%
Java 3%
Assembly 3%

444 contributions in the last year

17-day longest streak

Jun 2025

15161718192021222324252627282930

Jul 2025

12345678910111213141516171819202122232425262728293031

Aug 2025

12345678910111213141516171819202122232425262728293031

Sep 2025

123456789101112131415161718192021222324252627282930

Oct 2025

12345678910111213141516171819202122232425262728293031

Nov 2025

123456789101112131415161718192021222324252627282930

Dec 2025

12345678910111213141516171819202122232425262728293031

Jan 2026

12345678910111213141516171819202122232425262728293031

Feb 2026

12345678910111213141516171819202122232425262728

Mar 2026

12345678910111213141516171819202122232425262728293031

Apr 2026

123456789101112131415161718192021222324252627282930

May 2026

12345678910111213141516171819202122232425262728293031

Jun 2026

1234567891011121314151617181920

Hi, I'm Asim AI Research Engineer based in Cape Town, South Africa 🇿🇦 I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering Currently at InstaDeep working on MARL…

Hi, I'm Asim

AI Research Engineer based in Cape Town, South Africa 🇿🇦

I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering

Currently at InstaDeep working on MARL research, I'm currently focused on combining Contrastive Goal Conditioned Reinforcement Learnining and Unsupervised Environment Design (UED) in Multi Agent settings.

🎓 MSc in AI from University of Cape Town & AIMS South Africa (Google DeepMind Scholar)

---

🔬 What I Work On

Multi-Agent RL — Contrastive learning, goal-conditioned RL, and curriculum strategies in JAX
LLM Agents — Autonomous agents for ML engineering, scientific discovery, and code generation
Inference-Time Scaling — Making open-source LLMs competitive with proprietary models
LLM Engineering — Fine-tuning, RLHF (PPO/GRPO/DPO), vLLM serving, distributed training

Skills

I'm good with Python JAX/Flax PyTorch vLLM HuggingFace TRL Unsloth LangGraph/LangSmith TPU/GPU

![Website](https://asimawad.github.io)
![LinkedIn](https://www.linkedin.com/in/asim-awad-hussein-osman-35174b336/)
![Email](mailto:[email protected])

All public repos (56)

Show forks Show archived