Clojars Download Stats

A complete sqlite database ”mirroring” Clojars download statistics (Nov 2012 - present). (Well, the source SQL for it, please read on.)

Why This Exists

Clojars publishes daily download stats. Querying it on the may hammer Clojar servers unecessarily. And I wanted to do some queries over time. I downloeded it all and made a sqlite database from it. Then I thought that maybe someone else wants this database too. Therefore this repo provides up-to-date daily download stats as SQL exports, one file per day, plus scripts to import and update.

Batteries Included

a Babashka task that creates a fully populated sqlite database for your local querying. It takes a few minutes to do the import once you have cloned the repository to your machine. This task can also be used for keeping your database up-to-date from the upstream repository.
a standalone update (Babashka) script, letting you eject from the upstream repo (saving your disk space).

Database layout

erDiagram
    artifacts {
        INTEGER id PK
        TEXT group_id
        TEXT artifact_id
    }
    versions {
        INTEGER id PK
        TEXT version
    }
    downloads {
        TEXT date PK
        INTEGER artifact_id PK,FK
        INTEGER version_id PK,FK
        INTEGER downloads
    }
    artifacts ||--o{ downloads : "has"
    versions ||--o{ downloads : "has"

Usage

Create your database

As of December 2025:

Project size: ~6GB, mostly SQL files
Import time: ~4 minutes (No feedback during, sorry!)
Database size: 2.9GB

# Clone the repo
git clone https://github.com/PEZ/clojars-download-stats
cd clojars-download-stats

# Create a local database from SQL exports
bb files.import ./clojars-downloads.sqlite

# Done! Query with your favorite SQLite tool
sqlite3 ./clojars-downloads.sqlite "SELECT * FROM artifacts LIMIT 5"

Keeping Your Database Up To Date

Option A: Keep the repo (simplest)

After the initial import, just pull and reimport:

git pull
bb files.import ./clojars-downloads.sqlite

The import is incremental, reimporting only the missing month(s) with new data. A daily update takes seconds.

Option B: Eject from the repo (saves disk space)

If you don't want to keep the ~6GB repo around, you can "eject" after the initial import:

# Initial import (from the cloned repo)
bb files.import ~/data/clojars.sqlite

# Copy the standalone update script
cp update_clojars_stats.clj ~/data/

# Now you can delete the cloned repo, if you fancy

To update your database later, run the standalone script:

bb ~/data/update_clojars_stats.clj ~/data/clojars.sqlite

The script fetches new days directly from Clojars—no repo needed. It's idempotent and fills any gap since your last update. If it's a big gap, consider cloning the repo and using bb files.import instead.

For maintainers of the repoistory

Tasks

# Database tasks (require SQLite)
bb files.import <db-path>           # SQL files → SQLite (incremental if DB exists)
bb db.export <db-path>              # Database → daily SQL files (slow, ~1 hour)
bb clojars.fetch <db-path>          # Fetch missing dates from Clojars → database
bb db.export.update <db-path>       # Fetch + export (for manual updates)
bb db.export.status <db-path>       # Show database and export status
bb db.export.generate-state <db-path> # Generate state.edn from database

# CI tasks (no database required, use state.edn)
bb clojars.export.update            # Fetch missing days → append to SQL files
bb clojars.export.day <date>        # Fetch one specific day → append to SQL files
bb db.export.status                 # Show state.edn summary (no args)

Database Schema

Normalized schema storing ALL Clojars data with version granularity:

artifacts (id, group_id, artifact_id)                 -- ~34K unique libraries
versions (id, version)                                -- ~31K unique version strings
downloads (date, artifact_id, version_id, downloads)  -- ~36M rows

SQL Export Format

Monthly files in data/monthly/YYYY-MM.sql:

-- 2024-12.sql
INSERT INTO artifacts (id, group_id, artifact_id) VALUES ...
INSERT INTO versions (id, version) VALUES ...
INSERT INTO downloads (date, artifact_id, version_id, downloads) VALUES ...

How CI Updates Work

The CI workflow runs daily and uses state.edn to track ID mappings—no database rebuild required:

Fetch Clojars stats via HTTP (fills any gaps since last run)
Assign IDs for any new artifacts/versions using state.edn
Write SQL file for each new date (data/YYYY/MM/DD.sql)
Update state.edn with new IDs and latest date
Commit changes to git

Known missing data on Clojars

These dates have no data on Clojars

2015-12-27
2021-01-05 through 2021-01-22 (18 consecutive days)
2021-02-20, 2021-06-19, 2021-08-18, 2021-10-16
2022-01-15, 2022-02-13
2023-03-26, 2023-04-23, 2023-11-26

If you See Something, Say Something

The code in this repo is mostly slop coded. I've tried to monitor the AI and its output, but it produced a lot of code, and I ran out of time to be super vigilant about it... If you see something particularly funny, please file an issue. 🙏

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
.github/workflows		.github/workflows
data		data
src/clojars_stats		src/clojars_stats
test-data		test-data
test/clojars_stats		test/clojars_stats
.gitignore		.gitignore
AGENTS.md		AGENTS.md
README.md		README.md
bb.edn		bb.edn
deps.edn		deps.edn
update_clojars_stats.clj		update_clojars_stats.clj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Clojars Download Stats

Why This Exists

Batteries Included

Database layout

Usage

Create your database

Keeping Your Database Up To Date

For maintainers of the repoistory

Tasks

Database Schema

SQL Export Format

How CI Updates Work

Known missing data on Clojars

If you See Something, Say Something

License

About

Uh oh!

Contributors 2

Uh oh!

Languages

PEZ/clojars-download-stats

Folders and files

Latest commit

History

Repository files navigation

Clojars Download Stats

Why This Exists

Batteries Included

Database layout

Usage

Create your database

Keeping Your Database Up To Date

For maintainers of the repoistory

Tasks

Database Schema

SQL Export Format

How CI Updates Work

Known missing data on Clojars

If you See Something, Say Something

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages