6-day longest streak
-
big-list-of-naughty-strings ★ PINNED
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
Python ★ 48k 2y agoExplain → -
simpleaichat ★ PINNED
Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
Python ★ 3.5k 2y agoExplain → -
automl-gs ★ PINNED
Provide an input CSV and a target field to predict, generate a model + code to run it.
Python ★ 1.9k 6y agoExplain → -
gpt-2-simple ★ PINNED
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
Python ★ 3.4k 3y agoExplain → -
stylecloud ★ PINNED
Python package + CLI to generate stylistic wordclouds, including gradients and icon shapes!
Python ★ 841 5y agoExplain → -
aitextgen ★ PINNED
A robust Python tool for text-based AI training and generation using GPT-2.
Python ★ 1.8k 3y agoExplain → -
textgenrnn
Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
Python ★ 4.9k 4y agoExplain → -
hacker-news-undocumented
Some of the hidden norms about Hacker News not otherwise covered in the Guidelines and the FAQ.
★ 3.8k 1y agoExplain → -
facebook-page-post-scraper ▣
Data scraper for Facebook Pages, and also code accompanying the blog post How to Scrape Data From Facebook Page Posts for Statistical Analysis
Python ★ 2.1k 7y agoExplain → -
person-blocker
Automatically "block" people in images (like Black Mirror) using a pretrained neural network.
Python ★ 2.0k 3y agoExplain → -
gpt-3-experiments
Test prompts for OpenAI's GPT-3 API and the resulting AI-generated texts.
Python ★ 696 6y agoExplain → -
video-to-gif-osx
A set of utilities that allow the user to easily convert video files to very-high-quality GIFs on OS X.
Shell ★ 394 7y agoExplain → -
copy-syntax-highlight-osx
Copy Syntax Highlight for OS X is an OS X service which copies the selected text to the clipboard, with proper syntax highlighting for the given language.
★ 379 10y agoExplain → -
gemimg
Lightweight wrapper for generating and editing images from Gemini 2.5 Flash Image/Nano Banana
Python ★ 356 6mo agoExplain → -
gpt-2-cloud-run
Text-generation API via GPT-2 for Cloud Run
HTML ★ 312 5y agoExplain → -
reactionrnn
Python module + R package to predict the reactions to a given text using a pretrained recurrent neural network.
Python ★ 300 7y agoExplain → -
gpt-2-keyword-generation
Method to encode text for GPT-2 to generate text based on provided keywords
Python ★ 262 5y agoExplain → -
miditui
An interactive terminal app/UI for MIDI composing, mixing, and playback—written in Rust
Rust ★ 250 5mo agoExplain → -
download-tweets-ai-text-gen
Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.
Python ★ 226 6y agoExplain → -
tweet-generator
Train a neural network optimized for generating tweets based off of any number of Twitter users.
Python ★ 222 7y agoExplain → -
char-embeddings
A repository containing 300D character embeddings derived from the GloVe 840B/300D dataset, and uses these embeddings to train a deep learning model to generate Magic: The Gathering cards using Keras
Python ★ 215 9y agoExplain → -
magic-the-gifening
A Twitter bot which tweets Magic: the Gathering cards with appropriate GIFs superimposed onto them.
Python ★ 214 8y agoExplain → -
system-dashboard
Minimalist Win/OSX/Linux System Dashboard using Flask and Freeboard
HTML ★ 202 9y agoExplain → -
imgmaker
Create high-quality images programmatically with easily-hackable templates.
Python ★ 191 1y agoExplain → -
imgbeddings
Python package to generate image embeddings with CLIP without PyTorch/TensorFlow
Python ★ 161 4y agoExplain → -
ctrl-gce
Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.
Shell ★ 151 6y agoExplain → -
facebook-ad-library-scraper
A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.
Python ★ 139 6y agoExplain → -
ai-generated-pokemon-rudalle
Python script to preprocess images of all Pokémon to finetune ruDALL-E
Python ★ 139 4y agoExplain → -
get-all-hacker-news-submissions-comments
Simple Python scripts to download all Hacker News submissions and comments and store them in a PostgreSQL database.
Python ★ 127 9y agoExplain → -
mtg-gpt-2-cloud-run
Code and UI for running a Magic card text generator API via GPT-2
HTML ★ 125 7y agoExplain → -
hacker-news-gpt-2
Dump of generated texts from GPT-2 trained on Hacker News titles
★ 120 7y agoExplain → -
reddit-bigquery
Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily
R ★ 111 10y agoExplain → -
llm-write-better-code
Conversation logs with Claude 3.5 Sonnet to try and iteratively optimize code
Jupyter Notebook ★ 101 1y agoExplain → -
ballin
A colorful interactive physics simulator with thousands of balls, but in your terminal!
Rust ★ 100 5mo agoExplain → -
optillusion-animation
Python code to submit rotated images to the Cloud Vision API + R code for visualizing it
Python ★ 98 7y agoExplain → -
chatgpt_api_test
Demos utilizing the ChatGPT API
Jupyter Notebook ★ 94 3y agoExplain → -
gpt-3-client
A client for OpenAI's GPT-3 API for ad hoc testing of prompt without using the web interface.
Python ★ 88 5y agoExplain → -
stylistic-word-clouds
Python scripts for creating stylistic word clouds
Python ★ 87 10y agoExplain → -
stable-diffusion-negative-prompt
Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.
Jupyter Notebook ★ 87 3y agoExplain → -
gpt3-blog-title-optimizer
Python code for building a GPT-3 based technical blog post optimizer.
Jupyter Notebook ★ 85 3y agoExplain → -
amazon-spark
R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark
HTML ★ 85 9y agoExplain → -
twcloud
Python package + CLI to generate wordclouds of Twitter tweets.
Python ★ 78 6y agoExplain → -
twitter-cloud-run
A (relatively) minimal configuration app to run Twitter bots on a schedule that can scale to unlimited bots.
Python ★ 78 5y agoExplain → -
get-profile-data-of-repo-stargazers
This repository contains a script used to get the GitHub profile information of all the people who've Stared a given GitHub repository
Python ★ 68 7y agoExplain → -
deep-learning-cpu-gpu-benchmark
Repository to benchmark the performance of Cloud CPUs vs. Cloud GPUs on TensorFlow and Google Compute Engine.
HTML ★ 66 9y agoExplain → -
mtg-embeddings
Code used to create text embeddings of all Magic: The Gathering cards.
Jupyter Notebook ★ 64 1y agoExplain → -
icon-image
Python script to quickly generate a Font Awesome icon imposed on a background for steering AI image generation.
Python ★ 55 3y agoExplain → -
gpt-j-6b-experiments
Test prompts for GPT-J-6B and the resulting AI-generated texts
★ 53 5y agoExplain → -
hacker-news-download-all-stories
Download *ALL* the submissions from Hacker News
Python ★ 51 12y agoExplain → -
ml-data-generator
Python script to generate fake datasets optimized for testing machine learning/deep learning workflows
Python ★ 50 7y agoExplain → -
clickbait-cluster
Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly
HTML ★ 46 5y agoExplain → -
youtube-video-scraper
Tools for scraping YouTube video metadata (mostly for training AI on video titles)
Python ★ 42 5y agoExplain → -
keras-cntk-docker
Docker container for keras + cntk intended for nvidia-docker
Python ★ 42 8y agoExplain → -
foursquare-venue-scraper
A Foursquare data scraper that gathers all venues within a specified geographic area.
Python ★ 39 7y agoExplain → -
interactive-facebook-reactions
Jupyter notebook + Code for processing Facebook Reactions data and making Interactive Charts
HTML ★ 38 10y agoExplain → -
langchain-problems
Demos of some issues with LangChain.
Jupyter Notebook ★ 32 3y agoExplain → -
minimaxir.github.io
Blog Posts and Theme for https://minimaxir.com
HTML ★ 32 1mo agoExplain → -
nyc-taxi-notebook
R Code + Jupyter notebook for analyzing and visualizing NYC Taxi data
R ★ 31 10y agoExplain → -
sdxl-experiments
Jupyter Notebooks for experimenting with Stable Diffusion XL 1.0
Jupyter Notebook ★ 30 2y agoExplain → -
yelp-review-analysis
Repository containing script on how I processed and charted Yelp data.
R ★ 29 11y agoExplain → -
autotweet-from-googlesheet
A minimal proof-of-concept Python script to tweet human-curated Tweets on a schedule.
Python ★ 27 5y agoExplain → -
subreddit-generator
Train a neural network optimized for generating Reddit subreddit posts
Python ★ 27 8y agoExplain → -
predict-reddit-submission-success
Repository w/ Jupyter + R Notebooks for creating a model to predict the success of Reddit submissions with Keras.
HTML ★ 27 9y agoExplain → -
tritonize
Convert images to a styled, minimal representation, quickly with NumPy
Python ★ 27 8y agoExplain → -
frames-to-gif-osx
An application that allows the user to easily convert frames to very-high-quality GIFs on OS X.
★ 26 10y agoExplain → -
keras-cntk-benchmark
Code for Benchmarking CNTK performance on Keras vs. TensorFlow
Python ★ 26 8y agoExplain → -
chatgpt-structured-data
Demos of ChatGPT's function calling/structured data support.
Jupyter Notebook ★ 25 2y agoExplain → -
pokemon-3d
Code + Visualizations processing and visualizing Pokémon data in 3D
HTML ★ 25 10y agoExplain → -
ggplot-tutorial
Repository for ggplot2 tutorial
R ★ 24 11y agoExplain → -
sf-arrests-when-where
R Code + Jupyter notebook for replicating analysis of when and where arrests in San Francisco occur.
R ★ 23 10y agoExplain → -
legaladvice-gpt2
Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles
★ 23 7y agoExplain → -
chatgpt-tips-analysis
Jupyter Notebooks for testing the impact of tip incentives for ChatGPT
Jupyter Notebook ★ 22 2y agoExplain → -
pokemon-ai
A text-generating AI to generate Pokémon names.
Python ★ 20 7y agoExplain → -
pokemon-embeddings
Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.
Jupyter Notebook ★ 20 2y agoExplain → -
facebook-keyword-regression-analysis
Regression Analysis for Facebook keywords.
R ★ 20 13y agoExplain → -
ai-generated-magic-cards
Tools for encoding Magic: The Gathering cards into a form suitable for AI text generation
Python ★ 19 5y agoExplain → -
get-heart-rate-csv
A small Python script to get the heart rate data generated from an Apple Watch in a CSV form
Python ★ 19 8y agoExplain → -
subreddit-related
Code and visualizations for related/similar subreddits
Jupyter Notebook ★ 19 10y agoExplain → -
get-bars-from-foursquare
A quick pair of Python scripts to retrieve all bars within a given area, then retrieve metadata and process it.
Python ★ 19 12y agoExplain → -
stylecloud-examples
Examples of stylistic word clouds generated via the stylecloud Python package
Python ★ 19 6y agoExplain → -
stack-overflow-survey
Code + Visualizations for processing 2016 Stack Overflow Survey Data
Jupyter Notebook ★ 19 10y agoExplain → -
mtg-card-creator-api
Code for running a Magic card image generator API
Python ★ 18 7y agoExplain → -
reddit-gpt-2-cloud-run
Reddit title generator API based on GPT-2
HTML ★ 18 6y agoExplain → -
tensorflow-multiprocess-ray
Proof of concept on how to use TensorFlow for prediction tasks in a multiprocess setting.
Python ★ 18 7y agoExplain → -
lists ⑂
The definitive list of lists (of lists) curated on GitHub
Python ★ 18 11y agoExplain → -
reddit-comment-length
R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization
R ★ 17 11y agoExplain → -
reddit-graph
Jupyter notebook + Code for reproducing Reddit Subreddit graphs
Jupyter Notebook ★ 17 10y agoExplain → -
automl-gs-examples
Examples + Visualizations of datasets modeled using automl-gs
Python ★ 16 7y agoExplain → -
ncaa-basketball
R Code + R Notebook on how to process and visualize NCAA basketball data.
R ★ 16 8y agoExplain → -
get-data-from-photos-from-instagram-tags
Processes data from images which are tagged with the specified Instagram tag.
Python ★ 15 12y agoExplain → -
resetera-gpt-2
Scraper of ResetEra threads and posts to get them into a format suitable for feeding them into GPT-2.
Python ★ 15 7y agoExplain → -
sfba-compensation
Jupyter notebook + Code for scraping AngelList data and making an interactive chart of SFBA salaries/equity
HTML ★ 14 10y agoExplain → -
hn-heatmaps
Code and data necessary to reproduce heatmaps relating HN Submission time to submission score.
R ★ 13 11y agoExplain → -
char-tsne-visualization
Visualizations of character embeddings from derived character vectors.
HTML ★ 13 9y agoExplain → -
hacker-news-comment-analysis
Code used for analysis of Hacker News comments.
R ★ 12 11y agoExplain → -
imdb-data-analysis
R Code + R Notebook on how to process and visualize the official IMDb datasets.
★ 12 8y agoExplain → -
sf-crimes-covid
Spot checking impact of SF shelter-in-places on crime reporting.
★ 12 6y agoExplain → -
nano-banana-tests
Notebooks testing Nano Banana
Jupyter Notebook ★ 11 7mo agoExplain → -
notebooks
This GitHub Repository stores my R Notebooks, allowing GitHub Pages to serve the R Notebooks on my website
HTML ★ 11 6y agoExplain → -
neural-doodle ⑂
Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)
Python ★ 11 10y agoExplain → -
imgur-decline
R Code + R Notebook for analyzing the decline of Imgur on Reddit.
HTML ★ 11 9y agoExplain → -
gpt-2-fanfiction
Experiments with generating GPT-2 fanfiction on specified topics.
★ 11 7y agoExplain → -
all-marvel-comics-characters
Creates a .csv of all Marvel Comics Characters + Statistics via the Marvel API
Python ★ 10 12y agoExplain → -
movie-gender
Data and code for analyzing Movie Lead Gender.
Jupyter Notebook ★ 10 10y agoExplain → -
llm-use
Miscellaneous chat logs of conversations with Claude 3.5/3.7 Sonnet
★ 9 1y agoExplain → -
reddit-subreddit-keywords
Code + Jupyter notebook for analyzing and visualizing means and medians of keywords in the top Reddit Subreddits.
R ★ 9 10y agoExplain → -
online-class-charts
Code needed to reproduce data analysis and charts for MIT/Harvard Online Course Data
R ★ 9 12y agoExplain → -
ggplot2-web
R Code + R Notebook on how to make high quality data visualizations on the web with ggplot2.
HTML ★ 9 9y agoExplain → -
icon-to-image
High-performance Rust library with Python bindings for rendering Font Awesome icons to images.
Jupyter Notebook ★ 8 4mo agoExplain → -
aggregate-data-from-likes-of-friends
Aggregates the data from a Facebook user's friends' likes.
★ 8 13y agoExplain → -
gpt-4o-audio-tests
Testing GPT-4o Audio Generation capabilities
Jupyter Notebook ★ 8 1y agoExplain → -
sf-arrests-predict
R Code + R Notebook for predicting arrest types in San Francisco.
HTML ★ 8 9y agoExplain → -
breach-network
R Code + R Notebook for creating an interactive graph network of Have I Been Pwned data using R and Plotly.
HTML ★ 8 9y agoExplain → -
reddit-mean-score
Quick data visualization for Reddit Mean Submission Score by Subreddit
★ 8 6y agoExplain → -
modeling-link-aggregators
R Code + R Notebook on how to process and visualize both Reddit and Hacker News data.
★ 8 7y agoExplain → -
apps
Repo for my webpage-only apps.
HTML ★ 7 4y agoExplain → -
nndex
In-memory nearest neighbor search engine for Python, implemented in Rust.
Rust ★ 7 4mo agoExplain → -
ru-dalle ⑂
Generate images from texts. In Russian
Jupyter Notebook ★ 7 4y agoExplain → -
get-number-of-friends-facebook-likes
Gets the number of Facebook Likes for each of your friends, and outputs them in a .tsv
R ★ 7 13y agoExplain → -
get-github-repo-descriptions
Gets the discriptions for a specified number of public GitHub repositories.
Python ★ 7 12y agoExplain → -
movie-revenue-ratings
Code + Visualizations for Movie Review Aggregator Ratings Have No Relationship with Box Office Success
Jupyter Notebook ★ 7 10y agoExplain → -
stack-overflow-questions
R Code + R Notebook on how to process and analyze Stack Overflow data.
★ 7 8y agoExplain → -
fast-style-transfer ⑂
Fast Style Transfer in TensorFlow ⚡🖥🎨🖼
Python ★ 6 9y agoExplain → -
display-justin-timberlake-sexiness
An HTML snippet that displays a handsome picture of Justin Timberlake in the lower-right corner.
★ 5 13y agoExplain → -
bootstrap-resample-notebook
Jupyter notebook + additional R code for replicating Bootstrap Resampling
R ★ 5 10y agoExplain → -
interactive-network
R Code + R Notebook for creating an interactive graph network using R and Plotly.
HTML ★ 5 9y agoExplain → -
treemap-of-reddit-top-subreddits
This script creates a tree map of Reddit's Top 100 Subreddits by # of Link submissions to the subreddit.
R ★ 4 12y agoExplain → -
sfo-jfk-flights
Visualizations of Flights between SFO and JFK
★ 4 6y agoExplain → -
facebook-reactions
Notebook + Data for my blog post Facebook Reactions and the Problem With Quantifying Likes Differently
Jupyter Notebook ★ 4 10y agoExplain → -
developer-graphs
Creates charts of developer skills using R and ggplot2
R ★ 3 12y agoExplain → -
LightGBM ⑂
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.
C++ ★ 3 9y agoExplain → -
imdb-embeddings
No description.
Jupyter Notebook ★ 3 1y agoExplain → -
llm-person-identification
No description.
Jupyter Notebook ★ 3 11mo agoExplain → -
mesh-transformer-jax ⑂
Model parallel transformers in JAX and Haiku
Jupyter Notebook ★ 3 5y agoExplain → -
youtube-category-duration-chart
Code and data needed to reproduce The Relationship Between YouTube Video Category and Length of the Video
R ★ 3 11y agoExplain → -
get-statuses-from-facebook-page-for-stat-analysis
Gets all the statuses for a brand's Facebook page, tabulates their likes/comment count, and outputs a .tsv
R ★ 3 13y agoExplain → -
first-comment
R Code + R Notebook for querying, analyzing, and visualizing the Reddit data to determine the impact of the first comment in a Reddit thread.
HTML ★ 3 9y agoExplain → -
claude-haiku-jailbreak
No description.
Jupyter Notebook ★ 2 8mo agoExplain → -
youtube_scraper_opus
No description.
Jupyter Notebook ★ 2 4mo agoExplain → -
StyleCLIP ⑂
No description.
Jupyter Notebook ★ 2 5y agoExplain → -
techcrunch-1999
A CSS snippet used with Stylish to take TechCrunch back to the past.
★ 2 13y agoExplain → -
octoflat ⑂
An Octopress theme based off Twitter Bootstrap and Designmodo's Flat-UI
JavaScript ★ 2 13y agoExplain → -
techcrunch-recent-comments
Retrieves the comments from recent TechCrunch posts made by a specified User.
Python ★ 2 12y agoExplain → -
founder-distribution-data
Code and methodology for reproducing Gender Founder data.
R ★ 2 12y agoExplain → -
MediumFox ⑂
A theme for Octopress that is simple, focused, and clean. Influenced by the clean style of Medium and FoxSlide
JavaScript ★ 2 11y agoExplain → -
youtube-duration-date
Code and data needed to reproduce The Relationship Between YouTube Video Category and Length of the Video
R ★ 2 11y agoExplain → -
lets-code-1
Repository for Let's Code 1 scripts and images.
R ★ 2 10y agoExplain → -
movie-data-sanity-checking
Notebook + Result Data for Sanity Checking movie data from OMDb
Jupyter Notebook ★ 2 10y agoExplain → -
atari-ai ⑂
No description.
★ 2 10y agoExplain → -
gpt-2 ⑂
Code for the paper "Language Models are Unsupervised Multitask Learners"
Python ★ 2 7y agoExplain → -
reddit-imgur-animation
R + ggplot2 code for an animation of Imgur's decline on Reddit
TSQL ★ 2 6y agoExplain → -
llm-blueberry
No description.
Jupyter Notebook ★ 1 10mo agoExplain → -
List-of-all-Foods ⑂
Exhaustive list of all foods & food items in the world. Crawled from Wikipedia.
★ 1 9y agoExplain → -
gender-course
Repository containing code which reproduces the gender analysis on MIT/Harvard's Online Courses
R ★ 1 12y agoExplain → -
show-hn
Data for Hacker News Show HN submissions since 7/11/14 + analysis
R ★ 1 12y agoExplain → -
sigmajs-test
Test sigma.js and GitHub Pages
JavaScript ★ 1 10y agoExplain → -
agdq-2016
Code and Visualizatiosn for processing AGDQ Donation Data
Jupyter Notebook ★ 1 10y agoExplain → -
kubectl-proxy ⑂
A `kubectl proxy` sidecar
Dockerfile ★ 1 7y agoExplain →
No repos match these filters.