Websites I Run
Data Repositories I run
Software Frameworks / Libraries
nsq.io - realtime distributed message processing at scale
gomrjob - Go framework for running MapReduce jobs on Hadoop or Dataproc
git-open-pull - convert an issue to a pull request from the CLI
private_s3_httpd - HTTP Server for private Amazon S3 content
lru - Go library for caching arbitrary data with least-recently-used (LRU) eviction strategy
sortdb - HTTP API for querying data in a sorted CSV file
data_hacks - command line tools for data analysis
urlnorm - python library for URL normalization
json2csv - convert stream of JSON messages to CSV
little_bigtable - emulator for Google Bigtable with sqlite3 persistance
Archived Projects