fix: enable Gunicorn worker recycling with graceful analytics flush by gagantrivedi · Pull Request #6762 · Flagsmith/flagsmith

gagantrivedi · 2026-02-24T08:01:41Z

Contributes to https://github.com/Flagsmith/pulumi/issues/162

Summary

Enable Gunicorn --max-requests (default 1000) and --max-requests-jitter (default 100) to recycle workers periodically, mitigating memory leaks
Add atexit handler to flush in-process analytics caches (APIUsageCache, FeatureEvaluationCache) via the task processor before a worker exits, preventing data loss during recycling
Rename internal flush methods for clarity: _flush_through_thread (hot path) vs _flush_through_task_processor (shutdown path)

Context

Without --max-requests, Gunicorn workers never recycle and memory leaks accumulate indefinitely. Enabling worker recycling requires flushing any buffered analytics data before exit, otherwise counts are silently lost. The shutdown flush uses .delay() (task queue enqueue) rather than .run_in_thread() to avoid thread-safety issues during Python interpreter teardown.

Both GUNICORN_MAX_REQUESTS and GUNICORN_MAX_REQUESTS_JITTER remain configurable via environment variables. Setting GUNICORN_MAX_REQUESTS=0 disables recycling entirely.

Test plan

Unit tests for flush_on_shutdown on both cache classes (populated and empty)
Unit tests for flush_analytics_caches atexit handler (happy path and exception handling)
Unit test for AppAnalyticsConfig.ready() atexit registration
Manual verification: ran Gunicorn with --max-requests 3, confirmed atexit handler fires and logs flush on worker recycle

…lush Enable --max-requests (default 1000) and --max-requests-jitter (default 100) for Gunicorn workers to mitigate memory leaks. Add atexit handler to flush in-process analytics caches via the task processor before a worker exits, preventing data loss during recycling.

vercel · 2026-02-24T08:01:46Z

The latest updates on your projects. Learn more about Vercel for GitHub.

3 Skipped Deployments

Project	Deployment	Updated (UTC)
docs	Ignored	Feb 24, 2026 8:01am
flagsmith-frontend-preview	Ignored	Feb 24, 2026 8:01am
flagsmith-frontend-staging	Ignored	Feb 24, 2026 8:01am

github-actions · 2026-02-24T08:02:44Z

Docker builds report

Image	Build Status	Security report
`ghcr.io/flagsmith/flagsmith-e2e:pr-6762`	Finished ✅	Skipped
`ghcr.io/flagsmith/flagsmith-api-test:pr-6762`	Finished ✅	Skipped
`ghcr.io/flagsmith/flagsmith-frontend:pr-6762`	Finished ✅	Results ✅
`ghcr.io/flagsmith/flagsmith-api:pr-6762`	Finished ✅	Results ✅
`ghcr.io/flagsmith/flagsmith-private-cloud:pr-6762`	Finished ✅	Results ✅

codecov · 2026-02-24T08:06:39Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 98.26%. Comparing base (deee405) to head (30c0df1).
⚠️ Report is 32 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #6762   +/-   ##
=======================================
  Coverage   98.25%   98.26%           
=======================================
  Files        1312     1313    +1     
  Lines       48568    48642   +74     
=======================================
+ Hits        47722    47796   +74     
  Misses        846      846

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

khvn26 · 2026-02-24T16:05:18Z

api/app_analytics/apps.py

    name = "app_analytics"
+
+    def ready(self) -> None:
+        atexit.register(flush_analytics_caches)


Is gunicorn's worker_exit hook more suitable for this?

I don't see much difference between them for our use case. Can you elaborate a bit more on your reasoning?

IMO it'll better document our intent as we do not need to force-flush outside of worker context, and lower the mental fatigue of mapping out the worker lifecycle as we already manage it here.

On a technical level, worker_exit is a stronger guarantee that the code will run when we need it to run (i.e. when a worker is marked for recycling).

matthewelwell · 2026-02-25T08:46:28Z

api/app_analytics/cache.py

+        for key, value in self._cache.items():
+            track_request.delay(
+                kwargs={
+                    "resource": key.resource.value,
+                    "host": key.host,
+                    "environment_key": key.environment_key,
+                    "count": value,
+                    "labels": dict(key.labels),
+                }
+            )
+        self._cache = {}
+        self._last_flushed_at = timezone.now()


Couldn't we defer the iteration to the task processor as by just sending over self._cache itself as json for example? I don't know how big this cache could be, but I don't love the idea here of creating an indefinite number of tasks on a regular basis.

Yeah, good shout! I will add a task for bulk tracking

gagantrivedi · 2026-03-03T04:30:49Z

After further thought, --max-requests would just mask underlying issues rather than address them properly. We should be more proactive about identifying and resolving the root causes. We've tracked the issues we've found so far in https://github.com/Flagsmith/pulumi/issues/162 and will work through those. Closing this in favour of that approach.

gagantrivedi requested a review from a team as a code owner February 24, 2026 08:01

gagantrivedi requested review from Zaimwa9 and removed request for a team February 24, 2026 08:01

github-actions bot added api Issue related to the REST API fix labels Feb 24, 2026

gagantrivedi assigned Zaimwa9 Feb 24, 2026

khvn26 reviewed Feb 24, 2026

View reviewed changes

Zaimwa9 removed their request for review February 25, 2026 08:10

Zaimwa9 assigned khvn26 and unassigned Zaimwa9 Feb 25, 2026

matthewelwell reviewed Feb 25, 2026

View reviewed changes

gagantrivedi closed this Mar 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: enable Gunicorn worker recycling with graceful analytics flush#6762

fix: enable Gunicorn worker recycling with graceful analytics flush#6762
gagantrivedi wants to merge 1 commit intomainfrom
fix/gunicorn-max-requests-graceful-flush

gagantrivedi commented Feb 24, 2026 •

edited by emyller

Loading

Uh oh!

vercel bot commented Feb 24, 2026

Uh oh!

github-actions bot commented Feb 24, 2026 •

edited

Loading

Uh oh!

codecov bot commented Feb 24, 2026 •

edited

Loading

Uh oh!

khvn26 Feb 24, 2026

Uh oh!

gagantrivedi Feb 25, 2026 •

edited

Loading

Uh oh!

khvn26 Feb 25, 2026

Uh oh!

matthewelwell Feb 25, 2026

Uh oh!

gagantrivedi Feb 25, 2026

Uh oh!

gagantrivedi commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

gagantrivedi commented Feb 24, 2026 • edited by emyller Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Context

Test plan

Uh oh!

vercel bot commented Feb 24, 2026

Uh oh!

github-actions bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Docker builds report

Uh oh!

codecov bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

khvn26 Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

gagantrivedi Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

khvn26 Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

matthewelwell Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

gagantrivedi Feb 25, 2026

Choose a reason for hiding this comment

Uh oh!

gagantrivedi commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gagantrivedi commented Feb 24, 2026 •

edited by emyller

Loading

github-actions bot commented Feb 24, 2026 •

edited

Loading

codecov bot commented Feb 24, 2026 •

edited

Loading

gagantrivedi Feb 25, 2026 •

edited

Loading