-
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Python ★ 10k 4d agoExplain → -
markovify
A simple, extensible Markov chain generator.
Python ★ 3.4k 2y agoExplain → -
waybackpack
Download the entire Wayback Machine archive for a given URL.
Python ★ 3.2k 1y agoExplain → -
nbpreview
Render Jupyter/IPython notebooks without running a notebook server.
CSS ★ 309 1y agoExplain → -
notebookjs
Render Jupyter/IPython notebooks on the fly, in the browser. (Or on the command line, if you'd like.)
JavaScript ★ 296 1y agoExplain → -
spectra
Easy color scales and color conversion for Python.
Python ★ 262 1y agoExplain → -
envplus ▣
Combine your Python virtualenvs.
Python ★ 115 9y agoExplain → -
weightedcalcs
Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.
Python ★ 113 1y agoExplain → -
reporter
Literate data analysis with iPython notebooks and Jekyll.
Ruby ★ 92 12y agoExplain → -
intro-to-visidata
Source files for "An Introduction to VisiData"
HTML ★ 80 1y agoExplain → -
twick ▣
Twitter, quick. Fetch and store tweets on short notice.
Python ★ 79 9y agoExplain → -
visidata-plugins
A place for me to share VisiData plugins I've written.
Python ★ 39 4y agoExplain → -
mplstyle
A simple API for setting matplotlib styles, as well as a repository of nice styles.
Python ★ 33 12y agoExplain → -
visidata-cheat-sheet
A one-page cheat sheet for VisiData, available in multiple languages.
HTML ★ 30 2y agoExplain → -
gekyll
A Jekyll plugin for using Git repositories as posts, giving you access to a post's commits, diffs, and more.
Ruby ★ 25 13y agoExplain → -
tab-bankrupter
A Chrome extension for declaring "tab bankruptcy" without losing all your links.
JavaScript ★ 22 4y agoExplain → -
nbexec
A simple tool for executing Jupyter notebooks from the command line.
Python ★ 22 3y agoExplain → -
Backbone.Table ▣
Render any Backbone.js Collection as an HTML table.
JavaScript ★ 20 14y agoExplain → -
buzzfeed-news-trending-strip
Dataset: BuzzFeed News “Trending” Strip, 2018–2023
Python ★ 18 3y agoExplain → -
astronomer
Fetch information about the users who've starred a given GitHub repository.
Python ★ 16 12y agoExplain → -
nicar-2024-pdfplumber-workshop
No description.
Jupyter Notebook ★ 14 2y agoExplain → -
txtbirds
‾‾\/‾‾
JavaScript ★ 14 13y agoExplain → -
virtualenv-recipes
Recipes for useful Python virtualenvs.
Shell ★ 13 11y agoExplain → -
fbpagefeed
A library and command-line tool for fetching Facebook Pages' published posts.
Python ★ 13 9y agoExplain → -
tinyapi
Python wrapper around TinyLetter's publicly accessible — but undocumented — API.
Python ★ 13 9y agoExplain → -
data-tactics
Half-baked idea: Conceptual building blocks for data analysis.
★ 12 11y agoExplain → -
vinejs
Somewhere between a total joke and a useful library for fetching Vine.co videos.
JavaScript ★ 11 13y agoExplain → -
tinystats
Command-line tool for fetching message, URL, and subscriber data for the TinyLetter newsletters you own.
Python ★ 11 10y agoExplain → -
google-table-converter
A browser-based tool for converting Google Spreadsheets into responsive HTML <table>s.
HTML ★ 10 10y agoExplain → -
mta-colors
CSS & JSON files to help developers use the official colors of New York's Metropolitan Transportation Authority.
CSS ★ 10 12y agoExplain → -
lede-2023
No description.
Jupyter Notebook ★ 9 2y agoExplain → -
nicar-2025-pdfplumber-workshop
No description.
Jupyter Notebook ★ 9 1y agoExplain → -
compleat
Fetch autocomplete suggestions from Google Search.
Python ★ 8 12y agoExplain → -
nicar-2015-schedule
NICAR 2015 conference schedule as CSV and JSON, plus the underlying Python scraper.
Python ★ 8 11y agoExplain → -
gifparse
[Work in progress.] Parse the GIF 89a file format, down to the minor details. Pure Python, no dependencies.
Python ★ 8 12y agoExplain → -
lede-2024
No description.
Jupyter Notebook ★ 7 1y agoExplain → -
nicar-2023-pdfplumber-workshop
No description.
Jupyter Notebook ★ 7 2y agoExplain → -
nicar-2017-schedule
NICAR 2017 conference schedule as JSON and CSV, plus the underlying Python scraper.
Python ★ 6 9y agoExplain → -
WRIT1-CE9741
WRIT1-CE9741, Fall 2013, NYU School of Continuing and Professional Studies
Ruby ★ 6 12y agoExplain → -
csvcat
Efficiently concatenate CSVs (or other tabular text files), stripping extra header lines.
Shell ★ 6 11y agoExplain → -
pdfminer.six ⑂
Community maintained fork of pdfminer
Python ★ 5 3y agoExplain → -
babynames
CSVs and parsers for the Social Security Administration's historical baby name data.
Python ★ 5 12y agoExplain → -
visidata ⑂
A terminal spreadsheet multitool for discovering and arranging data
★ 4 4y agoExplain → -
statusfiles
IDEA: A simple, structured, standardized, technology-agnostic way to represent the status of things.
★ 4 9y agoExplain → -
macmailer
Command-line utility and Ruby library for creating/sending messages in OSX's Mail.app program.
Ruby ★ 4 13y agoExplain → -
minicard
A bare-bones CSS stylesheet for creating "card"-style elements.
CSS ★ 4 12y agoExplain → -
csv-diff ⑂
Python CLI tool and library for diffing CSV and JSON files
Python ★ 3 5y agoExplain → -
nicar-now
Your unofficial guide to what's happening next at NICAR 2020.
★ 3 6y agoExplain → -
nicar-2018-schedule
Your unofficial guide to what's happening next at NICAR 2018.
Python ★ 3 3y agoExplain → -
docs2csv ⑂
Scan a folder of document files of all types and extract the text into a CSV suitable for Overview
Ruby ★ 3 10y agoExplain → -
text-toggle
Let readers toggle between two versions of a text.
JavaScript ★ 3 14y agoExplain → -
parabear
An experiment in stupid-simple HTML article text extraction.
JavaScript ★ 2 14y agoExplain → -
tabletop ⑂
Tabletop.js gives spreadsheets legs
JavaScript ★ 2 14y agoExplain → -
linstapaper
Article-list and site files for linstapaper.com
JavaScript ★ 2 13y agoExplain → -
fidget
Fidget.js is a small, configurable JavaScript library that resizes blocks of text to fit their containers.
JavaScript ★ 2 15y agoExplain → -
glat-glong
Find the precise latitude and longitude of any point on Google Maps. A Chrome extension.
JavaScript ★ 2 12y agoExplain → -
klaxon ⑂
Klaxon enables reporters and editors to monitor scores of sites on the web for newsworthy changes.
Ruby ★ 2 9y agoExplain → -
download-all-attachments-from-a-gmail-conversation
Two methods that *seem* to work...
★ 2 9y agoExplain → -
csvkit ⑂
A suite of utilities for converting to and working with CSV, the king of tabular file formats.
Prolog ★ 2 10y agoExplain → -
gmap-button
A JavaScript library for adding buttons to embedded Google Maps.
JavaScript ★ 2 12y agoExplain → -
jub
As in, "get the jub done." Or as in, "jQuery, Underscore, Backbone." It's a shell script that automatically grabs the latest versions of those libraries, so that you can get on with prototyping.
Shell ★ 2 13y agoExplain → -
crochet
Hook into and/or monkeypatch any Ruby class- or instance-method. Provides 'before' and 'after' hooks, plus their destructive evil twins.
Ruby ★ 2 13y agoExplain → -
round-robin
Goofing around with collaborative storytelling. Fork me!
★ 1 13y agoExplain → -
learninglunches ⑂
Materials for a series of learning lunches on news development topics.
★ 1 13y agoExplain → -
chrollusion ⑂
Collusion for Chrome.
JavaScript ★ 1 14y agoExplain → -
2015-11-lottery-simulations
An attempt to simulate the long-term net profit/loss of people who buy New York state lottery tickets on a regular basis.
★ 1 10y agoExplain → -
warn-scraper ⑂
Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites
★ 1 4y agoExplain → -
weddingroulette
The code behind http://weddingroulette.com/
Ruby ★ 1 13y agoExplain → -
fbiter
A simple library for iterating through paginated Facebook API endpoints.
Python ★ 1 9y agoExplain → -
nicar-2019-schedule
The NICAR 2019 conference schedule as JSON and CSV files, plus the underlying Python scraper.
Python ★ 1 3y agoExplain → -
nbtemplate
Render iPython notebooks to other layouts, via templates. Library and command-line tool.
Python ★ 1 12y agoExplain → -
cartodb-nodejs ⑂
CartoDB Node.js OAuth example
JavaScript ★ 1 14y agoExplain → -
learn.jquery.com ⑂
learn.jquery.com web site
JavaScript ★ 1 14y agoExplain → -
zombie ⑂
Insanely fast, full-stack, headless testing using node.js
CoffeeScript ★ 1 14y agoExplain → -
griddle
Griddle.js is lightweight tool for creating and manipulating programmable, fluid, shift-able grids.
JavaScript ★ 1 15y agoExplain → -
jekyll-auto-s3
Automatically sync your Jekyll project to S3 on every (re)build.
Ruby ★ 0 13y agoExplain → -
declanrjb-dlp-inmate-complaints ⑂
Volunteer data cleaning for the Data Liberation Project, focused on national inmate complaints dataset
★ 0 1y agoExplain → -
aphis-inspection-reports-flags ⑂
Citations data from USDA's Animal and Plant Health Inspection Service, flagged for various phenomenons of public interest.
Jupyter Notebook ★ 0 2y agoExplain → -
mortgage-application-analysis-for-futuro-investigates
Code and data supporting Futuro Investigates’ examination of mortgage application outcomes in New Jersey
Jupyter Notebook ★ 0 2y agoExplain → -
LoungeStats
Profitgraph for CSGOLounge and DOTA2Lounge
JavaScript ★ 0 10y agoExplain → -
SDI-Health ⑂
Dissemination of harmonization code and data for SDI Health surveys
Stata ★ 0 7y agoExplain → -
posix-spawn ⑂
Fast Process::spawn for Rubys >= 1.8.7 based on the posix_spawn() system interfaces
Ruby ★ 0 13y agoExplain →
No repos match these filters.