Add links to batch invariance and RFC in documentation

bwasti · web-flow · commit f5b896e9f62a · 2025-11-10T17:22:08.000-05:00
Signed-off-by: Bram Wasti &lt;bwasti@fb.com&gt;
diff --git a/_posts/2025-11-10-bitwise-exact-rl.md b/_posts/2025-11-10-bitwise-exact-rl.md
@@ -44,11 +44,11 @@ Running the demonstration associated with this blog post we see exactly the issu
 
 ## How It’s Done & What’s Next
 
-We tackled not only invariance in the same framework, but across two different frameworks.  This was a challenging task as it required effectively auditing every single invocation of every kernel.  We heavily leveraged the forward pass kernels from vLLM’s recent batch invariance work and wrote simple backward passes for these.
+We tackled not only invariance in the same framework, but across two different frameworks.  This was a challenging task as it required effectively auditing every single invocation of every kernel.  We heavily leveraged the forward pass kernels from vLLM’s [recent batch invariance](https://docs.vllm.ai/en/latest/features/batch_invariance/) work and wrote simple backward passes for these.
 
 Then, we wrote a generic reinforcement learning script using GSM8K and a correctness reward.  We run everything synchronously, alternating between trainer and generator on a single host.  This is demonstrative of exactly on-policy execution, but is not very common in large scale runs.
 
-While building this, testing was straightforward as we are able to use exact bitwise checks to ensure the forward logprobs and the perplexity generated by the trainer are identical.  We will continue to improve the performance of vLLM and simplify the integration to support all TorchTitan models.  To follow this work, please see the linked RFC: #28326.
+While building this, testing was straightforward as we are able to use exact bitwise checks to ensure the forward logprobs and the perplexity generated by the trainer are identical.  We will continue to improve the performance of vLLM and simplify the integration to support all TorchTitan models.  To follow this work, please see the linked RFC: [#28326](https://github.com/vllm-project/vllm/issues/28326).
 
 Acknowledgements
 Bram Wasti, Teja Rao, Paul Zhang, Tianyu Liu, Zhuohan Li, Natalia Gimelshein