8-day longest streak
-
pdfLLM
pdfLLM is a completely open source, proof of concept RAG app.
Python ★ 187 10mo agoExplain → -
exaOCR
A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.
Python ★ 51 5mo agoExplain → -
qwen3-2b-ocr-app
A simple streamlit app to play with qwen3-2b-VL to perform OCR. Dockerized set up, tested with 3060 12 GB.
Python ★ 32 7mo agoExplain → -
qwen2.5VLM-OCR
A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.
Python ★ 26 10mo agoExplain → -
hunyuan-1b-ocr-app
The hunyuan 1B OCR model is pretty promising when it comes to OCR. It is lightweight, and very effective.
Python ★ 14 7mo agoExplain → -
Turbo1bit ⑂
Turbo1Bit: Combining 1-bit LLM weights (Bonsai) with TurboQuant KV cache compression for maximum inference efficiency. 4.2x KV cache compression + 16x weight compression = ~10x total memory reduction.
C ★ 4 3mo agoExplain → -
qwen3-4b-horace
This is the qwen3 4b instruct model. Being fine tuned for construction data.
Python ★ 4 10mo agoExplain → -
exaMath
Simple Accounting System for Contractors.
TypeScript ★ 2 3d agoExplain → -
backblaze-lancedb-0.17.0
This is a pet project to use backblaze b2-s3 protocol with LanceDB for vector storage directly. Not too shabby.
Python ★ 1 6mo agoExplain → -
malv-exaPipeline
This repo is only meant for MALV session usage and improvements.
Python ★ 1 5mo agoExplain → -
exaPipeline
exaPipeline is a sophisticated pipeline to process raw files (pdfs etc) into synthetic data for training. View exaOCR and exaPipelineDashboard
Python ★ 1 6mo agoExplain → -
data-wizard ⑂
Extract Structured Data from PDFs, Word Docs and Images. Embeddable directly into your application, regardless of the stack.
★ 1 1y agoExplain → -
page-agent ⑂
JavaScript in-page GUI agent. Control web interfaces with natural language.
★ 0 2d agoExplain → -
agentic-trading-desk ⑂
AI-assisted trading desk for short-term technical analysis on stocks & ETFs via Robinhood MCP. Deterministic Python engines score each asset on a three-pillar framework (Trend · Momentum · Macro-Sentiment) using EMA, RSI, MACD, TRIX & Bollinger Bands. The AI fetches data; the scripts compute; the human approves every order.
★ 0 2d agoExplain → -
DeepRead ⑂
[DeepRead] This is the official implementation of the DeepRead paper.
★ 0 1mo agoExplain → -
vllm-gfx906-mobydick ⑂
A high-throughput and memory-efficient inference and serving engine for LLMs - Optimized for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
★ 0 1mo agoExplain → -
frappe_docker ⑂
Docker environment for developing, deploying, and running Frappe applications (ERPNext and custom apps) in production and development
Python ★ 0 1mo agoExplain → -
erpnext ⑂
Free and Open Source Enterprise Resource Planning (ERP)
★ 0 1mo agoExplain → -
myBidMania
A personalized bid mania for contractors. Easily log your project, add team members, and configure your email settings for automated notifications as you choose. Dockerized so you can easily deploy this anywhere.
TypeScript ★ 0 2mo agoExplain → -
zvec ⑂
A lightweight, lightning-fast, in-process vector database
★ 0 4mo agoExplain → -
glm-ocr-3060-vllm
This is a dockerized set up to run z.ai/GLM-OCR on your 3060 machine. It will require 575 drivers.
Dockerfile ★ 0 4mo agoExplain → -
qwen3-4b-vl-instruct-fp8-OCR-app
This is a production ready OCR Application with Streamlit for testing. Meant to be ran on an L40S 48 GB GPU. The model is Qwen3-4B-Instruct-FP8 for faster inference.
Python ★ 0 4mo agoExplain → -
paddleOCR_rtx6000
WIP testing.
Python ★ 0 4mo agoExplain → -
qwen3-7b-vl-ocr-app
This is strictly to be ran on a RTX Pro 6000 Blackwell. It is a FastAPI + Streamlit application meant to quickly test, and be scaled for batch OCR.
Python ★ 0 4mo agoExplain → -
rtx6000_ocr
test
Python ★ 0 4mo agoExplain → -
clawdbot ⑂
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
★ 0 5mo agoExplain → -
bio ⑂
The world's most powerful open-source bio AI assistant - Access academic literature, clinical trials, drug labels, and more, all through natural conversation.
★ 0 5mo agoExplain → -
agent-farm ⑂
30 MCP tools for local AI agent swarms - runs entirely on qwen3:4b
★ 0 5mo agoExplain → -
vllm-gfx906 ⑂
vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
★ 0 6mo agoExplain → -
exaTokenizer
Uses OpenAI Tokenizer + exaOCR and essentially gets the markdown format content and counts the token. Streamlit is there for interaction as always.
Python ★ 0 6mo agoExplain → -
openshorts ⑂
Generate clips from a video or youtube link
★ 0 6mo agoExplain → -
llama.cpp-gfx906 ⑂
llama.cpp-gfx906
★ 0 6mo agoExplain → -
exaPipelineDashboard
This streamlit app is a demo of exaPipeline as a demo. Feel free to recreate in whatever frontend you like.
Python ★ 0 6mo agoExplain → -
qwen3-4b-vllm-docker-mi50
This docker-compose.yml is going to pull the qwen3-4b-instruct-2507-awq and run it in vLLM. For ease of deployment on a AMD Instinct Mi50 (32GB)
★ 0 6mo agoExplain → -
dgraph ⑂
high-performance graph database for real-time use cases
★ 0 11mo agoExplain → -
OCRmyPDF ⑂
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
★ 0 10mo agoExplain → -
langextract ⑂
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
★ 0 10mo agoExplain →
No repos match these filters.