Chrome2api
OpenAI-compatible local API wrapper for Chrome Gemini Nano / ChromeML
Wraps Chrome's built-in Gemini Nano AI model behind an OpenAI-compatible local HTTP API so any OpenAI-ready tool can use on-device AI without an internet connection.
Chrome2api is a wrapper that lets you use the AI model built into Google Chrome as if it were a standard AI API. Google Chrome ships with a small on-device language model called Gemini Nano that runs entirely on your local computer, no internet connection required. This project exposes that model through an interface that matches the format used by OpenAI's API, which means any tool or application that already knows how to talk to a standard AI API can be pointed at this local server instead.
The way it works: a local HTTP server runs on your machine and listens for requests in the OpenAI format. When a request arrives, it passes the prompt to a small runner program that talks directly to Chrome's internal AI runtime. The AI processes the prompt and sends the response back through the same chain. The whole thing stays on your machine, using Chrome's model weights and GPU drivers that are already installed on your system.
An important detail is that the project does not include the actual AI model files or Chrome's proprietary runtime libraries. You must source those yourself from a local Chrome installation. The README explains exactly where to find them on a Windows machine and where to place them in the project folder. The runner program itself is open-source and needs to be compiled from the included C++ source code.
The project currently runs on Windows only. It supports text prompts, multi-turn conversations, image inputs (from local file paths), and one audio file at a time. It handles both standard and streaming response formats. The README is clear that this is experimental and does not replicate all the features or permissions handling that Chrome's official browser-based AI API offers.
The project is released under the MIT license for its own code. The Chrome runtime files and model weights remain under Google's terms and are not redistributed here.
Where it fits
- Point any OpenAI-compatible tool at a local server to run AI inference entirely on your Windows machine without sending data to the cloud
- Add offline AI responses to a personal app by swapping the OpenAI API endpoint for this local server
- Test multi-turn conversations and image prompts against Chrome's on-device Gemini Nano model for free