Skip to content

danielmiessler/Substrate

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

71 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Substrate

Infrastructure for Human Knowledge & Progress

GitHub last commit License: MIT Stars TypeScript Bun

What Is It β€’ Data β€’ Quick Start β€’ Docs β€’ Contribute β€’ Roadmap


🎯 What Is Substrate?


A Shared Foundation for Human Progress

Think of substrate as the base layerβ€”the common ground where we can all work together to understand problems and drive solutions forward. Instead of fragmented discussions and reinventing the wheel, Substrate gives us one place to:

  • Document problems β†’ What's actually broken, with evidence
  • Track solutions β†’ What works, what doesn't, with results
  • Connect progress β†’ Link people, organizations, and projects actually moving things forward
  • Build on each other β†’ Arguments and ideas that build on shared evidence
  • Measure outcomes β†’ Did it work? What actually changed?

The Purpose: Accelerate Human Progress

We can't solve problems we don't understand. We can't build on solutions we can't find. Substrate provides:

  • 🎯 Shared understanding β†’ One place to understand what's wrong and what works
  • πŸš€ Faster progress β†’ Build on existing knowledge instead of starting over
  • πŸ“Š Evidence-based action β†’ Ground decisions in authoritative data, not opinions
  • πŸ”— Connected knowledge β†’ See how problems, solutions, people, and data interconnect
  • 🌍 Collective intelligence β†’ Human insight + AI analysis working together toward progress

An open-source framework connecting 17+ knowledge components:

graph TB
    subgraph "🌍 The Substrate"
        A[🧩 Problems]
        B[πŸ’‘ Solutions]
        C[πŸ“Š Data Sources]
        D[πŸ—£οΈ Arguments]
        E[πŸ“‹ Claims]
        F[πŸ‘₯ People]
        G[🏒 Organizations]
        H[πŸš€ Projects]
        I[πŸ“ˆ Plans]
        J[🎯 Values]
        K[πŸ’­ Ideas]
    end

    L[πŸ‘€ Human Contributors] --> A
    L --> B
    L --> D
    M[πŸ€– AI Analysis] --> A
    M --> B
    M --> D

    A -.connects to.-> B
    B -.connects to.-> C
    D -.connects to.-> C
    E -.connects to.-> C
    F -.connects to.-> G
    G -.connects to.-> H
    H -.connects to.-> B

    A --> N[πŸ” Shared Understanding]
    B --> N
    C --> N
    D --> N
Loading

πŸ—οΈ Structured Components

  • Problems - Documented challenges with evidence
  • Solutions - Proven approaches with results
  • Arguments - Reasoning chains with quality scores
  • Claims - Assertions linked to evidence
  • Plans - Actionable strategies with metrics
  • Ideas - Frameworks and concepts
  • People & Organizations - Who's working on what
  • Projects - Active initiatives with outcomes

πŸ“Š Authoritative Data

  • 13 Data Sources across 100+ years
  • Library science methodology (8 dimensions)
  • Government agencies + verified databases
  • TypeScript automation with Bun runtime
  • Full documentation for every dataset
  • Free APIs with excellent access
  • Human wellbeing indicators beyond GDP
  • Real-time to annual update frequencies

πŸš€ What's New

Tip

October 2025 - Massive Data Expansion!

We just added 8 authoritative data sources measuring human wellbeing beyond traditional economic metrics. Now track financial stress, health crises, social connection, worker agency, and environmental health.

πŸ“… Latest Updates (Click to expand)

πŸ”₯ October 27, 2025 - Human Wellbeing Infrastructure

5 NEW US Wellbeing Data Sources:

Added Source What It Measures
πŸ†• DS-00004 FRED Economic Wellbeing Debt stress, unemployment, inequality
πŸ†• DS-00005 CDC WONDER Mortality Overdoses, suicides, deaths of despair
πŸ†• DS-00006 Census ACS Social Living alone, commute times, isolation
πŸ†• DS-00007 BLS JOLTS Labor Worker "permission to quit" index
πŸ†• DS-00008 EPA Air Quality PM2.5, ozone, environmental health

Why This Matters:

"If we measure only GDP and unemployment, we will miss the slow-motion collapse of human thriving happening in plain sight."

These sources capture:

  • πŸ’° Leading indicators - Financial stress predicts recessions
  • 🎭 Behavioral truth - Actions reveal what surveys miss
  • πŸ—οΈ Structural determinants - Air quality you can't "self-care" out of
  • 🚨 Crisis detection - County-level suffering visibility
  • πŸ†“ Worker agency - Quit rate = economic freedom measure

October 25, 2025 - Core Data Updates

  • βœ… Pulitzer Prize, GDP, inflation data refreshed
  • βœ… Knowledge Worker Salaries validation completed

October 16-18, 2025 - Infrastructure

  • πŸ—οΈ Library science methodology (8-dimension evaluation)
  • ⚑ TypeScript automation with auto-discovery
  • πŸ“Š Central logging and health monitoring
  • πŸ“š Complete documentation suite

πŸ“œ Previous Updates (Click to expand)

September 2024 - Community Growth

  • βœ… Claims, Arguments, and Values frameworks
  • βœ… 6+ community contributors
  • βœ… 10+ merged pull requests
  • βœ… TELOS integration

August 2024 - Framework Expansion

  • βœ… Extended component types (Plans, Ideas, Models)
  • βœ… Improved documentation structure
  • βœ… Enhanced contribution guidelines

July 2024 - Foundation

  • βœ… Single-repo structure with 17+ object types
  • βœ… Core framework and documentation
  • βœ… Public launch with initial datasets
  • βœ… Community contribution framework

β†’ View Full Changelog


πŸ“Š Data & Evidence

Note

All data sources include complete library science cataloging with 8-dimension evaluation: Authority, Currency, Objectivity, Accuracy, Methodology, Coverage, Reliability, and Provenance.

Important

We know data can be controversial. That's exactly why we:

  • πŸ“Š Collect from multiple sources - Cross-reference data from different authoritative providers
  • πŸ” Provide complete transparency - Every source fully documented with provenance, methodology, and limitations
  • πŸ“ Full logging - All data pulls logged with timestamps, source versions, and processing steps
  • πŸ”“ Open source everything - TypeScript update scripts show exactly how data is fetched and transformed

You can verify, audit, and challenge our data. That's the point.

Core Datasets (Data/)

Dataset Time Span Data Points Source Status
πŸ‡ΊπŸ‡Έ US GDP 1929-2025 96 years + 314 quarters FRED/BEA βœ… Active
πŸ’΅ US Inflation 1947-2025 945 months FRED/BLS βœ… Active
🦠 Bay Area COVID Wastewater 2022-2025 161 weeks CDPH βœ… Active
πŸ† Pulitzer Prize Winners 1918-2024 249 winners Wikidata βœ… Active
πŸ’Ό Knowledge Worker Salaries Global Multi-region Research βœ… Active

Wellbeing Data Sources (Data-Sources/) πŸ†•

🌍 Global Health & Development (Click to expand)
ID Name Coverage Indicators Update
DS-00001 WHO Global Health Observatory 194 countries 2,000+ health indicators Quarterly
DS-00002 UN SDG Indicators 193 countries 231 SDG targets Biannual
DS-00003 World Bank Open Data Global Development metrics Varies
πŸ‡ΊπŸ‡Έ US Human Wellbeing (October 2025)
Source Key Indicators Why It Matters Update Freq
DS-00004
FRED Economic
β€’ Debt service ratio
β€’ Credit card delinquency
β€’ Financial stress index
β€’ Underemployment (U-6)
β€’ Consumer sentiment
β€’ GINI inequality
πŸ’° Economic Security Foundation
Debt >12% = stress, >14% = crisis. Leading indicator of recession.
Weekly to Annual
DS-00005
CDC WONDER
β€’ Drug overdose deaths
β€’ Opioid-specific deaths
β€’ Suicide rates
β€’ All-cause mortality
🚨 Crisis Detection
Deaths of despair reveal community breakdown surveys miss. County-level visibility.
Annual (1-2yr lag)
DS-00006
Census ACS
β€’ Living alone (isolation)
β€’ Commute times (time poverty)
β€’ No internet (digital divide)
β€’ Household demographics
🀝 Social Connection
Structural isolation and time poverty constrain flourishing. Census tract granularity.
Annual
DS-00007
BLS JOLTS
β€’ Quit rate (worker agency)
β€’ Job openings
β€’ Hire rate
β€’ Layoff/discharge rate
πŸ†“ "Permission to Quit"
People only quit when they have options. Measures economic freedom and dignity.
Monthly
DS-00008
EPA Air Quality
β€’ PM2.5 (particulate matter)
β€’ Ozone (O3)
β€’ SO2, CO, NO2
β€’ AQI exceedances
🌫️ Environmental Health
You can't "self-care" out of breathing toxic air. Structural wellbeing constraint.
Real-time

Composite Wellbeing Indices

Tip

Combine multiple data sources to create powerful wellbeing metrics:

  • πŸ’Έ Financial Stress Composite - Debt + delinquency + evictions + stress index
  • 🚨 Crisis Alert Composite - Overdoses + suicides + long-term unemployment
  • 🀝 Community Health Composite - Living alone + commute + digital divide (inverted)
  • πŸ†“ Worker Agency Index - Quit rate + job openings / unemployment
  • 🌫️ Environmental Health Index - PM2.5 + ozone (inverted)

β†’ Wellbeing Implementation Guide | β†’ Data Directory


πŸ€– Human & AI Collaboration

Substrate provides the pieces. You and your AI create the connections.

πŸ‘€ Humans Contribute

  • Document problems and solutions
  • Add authoritative data sources
  • Create arguments and claims
  • Link entities explicitly
  • Validate AI suggestions
  • Rate quality of evidence

πŸ€– AI Analyzes

  • Scan thousands of components
  • Suggest relationships automatically
  • Detect patterns across datasets
  • Score argument quality
  • Find contradictions
  • Generate knowledge graphs

Everything is designed for dual consumption:

  • βœ… Human-readable - Markdown and CSV anyone can open
  • βœ… Machine-parseable - Consistent formats AI can query
  • βœ… Fully documented - Complete methodology for every dataset
  • βœ… Linked with IDs - Unambiguous entity references

Use Cases

πŸ” OSINT & Investigation
  • Cross-reference public records, corporate filings, government data
  • Link people β†’ organizations β†’ transactions β†’ policies
  • Surface connections invisible in isolated databases
  • Build evidence chains from claims to verifiable records
  • Pattern detection for fraud, corruption, illicit networks
πŸ“Š Research & Analysis
  • Track claims against authoritative data sources
  • Evaluate argument quality based on evidence backing
  • Compare solutions across different implementations
  • Measure progress toward stated goals with real metrics
  • Cross-correlate economic, health, social, environmental data
🌐 Public Accountability
  • Verify political claims against documented evidence
  • Track campaign promises β†’ policy outcomes β†’ measured results
  • Link donations β†’ voting records β†’ policy positions
  • Monitor government spending against stated objectives
  • Environmental justice analysis (who breathes toxic air?)

πŸš€ Quick Start

View the Data (No Installation Required)

All datasets are available as CSV and Markdown files you can browse directly:

# Clone the repository
git clone https://github.com/danielmiessler/Substrate.git
cd Substrate

# Explore core datasets
ls Data/*/

# Explore wellbeing data sources
ls Data-Sources/*/

Run the Automation (Optional)

Warning

Requires Bun runtime. Install: curl -fsSL https://bun.sh/install | bash

# Install dependencies
bun install

# Update a specific dataset
cd Data/US-GDP
bun run update.ts

# Update a wellbeing source (requires API key)
export FRED_API_KEY="your_key_here"
cd Data-Sources/DS-00004β€”FRED_Economic_Wellbeing
bun run update.ts

# Update all datasets
bun run scripts/update-all.ts

Get API Keys (Free)

Data Source Get Key Rate Limit
FRED Economic fred.stlouisfed.org/docs/api 120 req/min
Census ACS api.census.gov/data/key_signup 500 req/day
EPA Air Quality Email: aqs.support@epa.gov 10 req/min
BLS JOLTS bls.gov/developers/home 500 req/day
CDC WONDER No key required Fair use

β†’ Complete Getting Started Guide | β†’ Quick Reference


πŸ’‘ Contribute

Important

Anyone can submit components. No gatekeeping on ideasβ€”just structured formats.

What You Can Add

πŸ“‹ Problems

  • Water quality issues
  • Healthcare access gaps
  • Climate change impacts

πŸ’‘ Solutions

  • Filtration systems
  • Telemedicine networks
  • Infrastructure adaptation

πŸ“ˆ Plans

  • Political platforms
  • Policy proposals
  • Action roadmaps

πŸ—£οΈ Arguments

  • "This works because X, Y, Z"
  • "This failed due to A, B"
  • (We don't judgeβ€”community rates)

πŸ“Š Data

  • Public records
  • Research datasets
  • Compiled statistics

πŸ’­ Ideas

  • Theoretical frameworks
  • Novel measurement approaches
  • Conceptual models

πŸ‘₯ People & Orgs

  • Researchers on problems
  • Organizations implementing
  • Projects with outcomes

πŸ“ Metrics

  • Success criteria
  • Measurement frameworks
  • Progress indicators

🎯 Values

  • Guiding principles
  • Evaluation criteria

How to Submit

  1. Fork the repository on GitHub
  2. Add your component in the appropriate directory (Problems/, Solutions/, etc.)
  3. Follow the format in that directory's README
  4. Submit a Pull Request

Note

We're building a web interface to make this easier for non-technical contributors!

β†’ Contribution Guidelines


πŸ“š Documentation

Getting Started

Technical


πŸ—ΊοΈ Roadmap

βœ… Completed

Phase 1: Foundation (July 2024)
  • βœ… Single-repo structure with 17+ object types
  • βœ… Core framework and documentation
  • βœ… Public launch with initial datasets
  • βœ… Community contribution framework
Phase 2: Community (Aug-Sep 2024)
  • βœ… Claims, Arguments, and Values frameworks
  • βœ… 6+ community contributors
  • βœ… 10+ merged pull requests
  • βœ… TELOS integration
Phase 3: Data Infrastructure (Oct 2025) πŸ”₯
  • βœ… 13 authoritative data sources (5 core + 8 wellbeing)
  • βœ… Library science methodology (8-dimension evaluation)
  • βœ… TypeScript automation system with Bun runtime
  • βœ… 6,000+ lines of documentation across all sources
  • βœ… Comprehensive wellbeing indicators (economic, health, social, labor, environmental)
  • βœ… Free API access with rate limiting and retry logic

🚧 Planned

Phase 4: Enhanced Access (Q4 2025 - Q1 2026)

  • 🎨 Web-based contribution interface (no coding required)
  • πŸ“Š Interactive data visualizations
  • πŸ”Œ RESTful API for programmatic access
  • πŸ“± Mobile-friendly exploration

Phase 5: Dataset Expansion (2026)

  • 🌍 Additional international sources (UNICEF, OECD, IHME)
  • ⚑ Real-time data feeds integration
  • πŸ—³οΈ Community-driven dataset requests
  • 🀝 Partnerships with research institutions

Phase 6: Intelligence Layer (2026+)

  • πŸ€– Automated relationship discovery
  • πŸ“ˆ Confidence scoring for AI-suggested links
  • 🎯 Pattern detection algorithms
  • πŸ”” Email/Slack notifications for data updates
  • πŸ“š Machine-readable catalog (DCAT/CKAN compliance)

πŸ”— Integration with TELOS

Substrate provides evidence. TELOS provides intention.

TELOS (Goals & Strategy) Substrate (Evidence & Solutions)
  • Goals - What you want to achieve
  • Strategies - How you'll pursue it
  • Challenges - Obstacles you face
  • Metrics - Progress tracking
  • Problems - What stands in the way
  • Solutions - Proven approaches
  • Data - Measured evidence
  • Plans - Implementation roadmaps

Together: Complete system for intention + evidence-based action.


πŸ™ Credits

Created By

Daniel Miessler β€’ July 2024

Twitter Follow Newsletter

Special Thanks

Inspiration & Contributions:

  • Jonathan Dunn @xssdoctor - Similar vision and collaboration
  • Joel Parish - Structure wisdom and guidance
  • Joseph Thacker - Continuous flow of innovative ideas

Community Contributors:

@ThatNateGuy β€’ @JaymanW β€’ @karai114 β€’ @DesertEaglePWN β€’ @ktfth


πŸ“Š Repository Stats

Data: 13 sources β€’ 1,700+ data points β€’ 107-year span (1918-2025)

Code: TypeScript automation β€’ 6,000+ lines documentation β€’ GitHub Actions

Community: 6+ contributors β€’ 10+ PRs merged β€’ 17 object types


⬆ Back to Top

Built with ❀️ for human understanding and progress

Powered by TypeScript β€’ Bun β€’ Open Data

About

An Open-source Framework for Human Understanding, Meaning, and Progress.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 7