Flood Map Reality Gap
Quantifying where FEMA flood maps have fallen behind reality — 79.4M daily precipitation observations, 1,099 county map dates, and a lag index that scores where outdated maps meet worsening rainfall.
Every research dive, from data pulls to published findings.
Quantifying where FEMA flood maps have fallen behind reality — 79.4M daily precipitation observations, 1,099 county map dates, and a lag index that scores where outdated maps meet worsening rainfall.
Comprehensive market analysis across 26,297 ZIP codes using data from Zillow, Redfin, Census, FRED, and BLS. 13 research entries covering affordability, investment opportunities, price trends, and market dynamics.
A data-driven look at the structure of the US banking system — 4,408 FDIC-insured banks, $25.5 trillion in assets, and the striking concentration of financial power in a handful of institutions.
Mining 10-K risk factors, 8-K filings, and proxy statements to track language contagion across the S&P 500 — when novel risk phrases first appear and how they spread industry by industry.
Parsing every SEC Form 4 filing to detect cluster buying and selling behavior, 10b5-1 plan timing anomalies, and director interlock contagion across company boards.
Combining FAA aircraft registration data with ADS-B flight tracking to map which corporate jets visited which airports — a leading indicator for M&A, executive hiring, and PE deal sourcing.
Tracking patent ownership changes to reveal distressed IP sales before bankruptcy, NPE acquisition patterns, and which university research actually gets commercialized.
Using GH Archive and the GitHub API to determine whether 'open source' projects are actually corporate-staffed, and producing a maintainer reality index for the top 500 npm and PyPI packages.
Analyzing Wikipedia's full edit history and hourly pageview counts to detect edit war patterns and pageview anomalies that sometimes precede news events by hours.
Using the CourtListener/RECAP archive to analyze patent venue migration post-TC Heartland, MDL formation timing, and the rise of Southern District of Texas as the new mega-bankruptcy venue.
Extracting and structuring political ad order data from FCC Public Inspection Files — more granular than FEC data, showing which station, which time slot, and which dollar amount for every political buy.
Mapping grant networks, executive compensation, and board interlocks across US nonprofits using IRS Form 990 data — who funds whom, and which directors sit at the center of the donor class network.
A rigorous causal study of how food environment changes affect health outcomes, using Dollar General county entry as a natural experiment with difference-in-differences estimation.
Joining CMS Open Payments (every pharma payment to a physician >$10) with Medicare Part D prescribing data to quantify the dose-response relationship between payments received and drugs prescribed.
Tracking skill demand decay curves for AI-displaced occupations and growth curves for AI-adjacent roles, using BLS data and job posting indices to produce a skill half-life chart for 20 occupations.
A graph-first analysis of US capital: where money sits, where it flows, and which entities exert disproportionate control over money they don't technically own.
Cross-referencing clinical trial registrations with FDA adverse event reports to find where public data diverges from press releases, and detecting safety signals before they appear on drug labels.