[tx] Support LoRA in the unembedding layer, redux by pcmoritz · Pull Request #984 · NovaSky-AI/SkyRL

pcmoritz · 2026-01-28T20:21:31Z

This is a better version of #969, to support LoRA also in the unembedding layer.

gemini-code-assist

Code Review

This pull request effectively adds support for LoRA in the unembedding layer, which is a great enhancement. The implementation is clean, with the logic correctly handled through a transposed flag in apply_lora and a new implementation for LoRAEmbed.T. The accompanying tests are thorough, covering both the transposed projection and ensuring consistency between the forward and transposed passes. I have one minor suggestion to make the test code more idiomatic.

skyrl-tx/tests/layers/test_lora.py

raulchen

LGTM overall. maybe also add an e2e test at the model level if possible.

raulchen · 2026-01-29T19:42:36Z

skyrl-tx/tx/layers/lora.py

-        return lambda hidden_states, adapter_indices=None: hidden_states @ self.embedding[...].T
+
+        def project(hidden_states: jax.Array, adapter_indices: jax.Array | None = None) -> jax.Array:
+            base_out = hidden_states @ self.embedding[...].T


nits:

avoid capturing self.

rename hidden_states to something more general, since this is the general LoRAMixin class.

raulchen · 2026-01-29T20:19:00Z

skyrl-tx/tx/layers/lora.py

+        else:
+            # x @ A @ B (or A[x] @ B for embeddings via _apply_lora_weight override)
+            intermediate = self._apply_lora_weight(self.lora_A[...], x_sorted, adapter_indices_sorted, group_sizes)
+            lora_output_sorted = jax.lax.ragged_dot(intermediate, self.lora_B[...], group_sizes)


since this is a Mixin class, instead of the _apply_lora_weight abstraction, I feel it'd be cleaner to handle both lookup-based and matmul-based paths in this class. and subclasses can choose which one to use with a flag.

pcmoritz added 2 commits January 28, 2026 12:20

[tx] Support LoRA in the unembedding layer, redux

e5a90b4

update

a34d809

pcmoritz mentioned this pull request Jan 28, 2026

[tx] [WIP] Support LoRA in the unembedding layer #969

Closed

pcmoritz added the tx label Jan 28, 2026

update

fcfa537

gemini-code-assist bot reviewed Jan 28, 2026

View reviewed changes

skyrl-tx/tests/layers/test_lora.py Outdated Show resolved Hide resolved

pcmoritz added 2 commits January 28, 2026 15:37

add scaling

587f978

update

dadb162

raulchen approved these changes Jan 29, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tx] Support LoRA in the unembedding layer, redux#984

[tx] Support LoRA in the unembedding layer, redux#984
pcmoritz wants to merge 5 commits intoNovaSky-AI:mainfrom
pcmoritz:tx-lora-unembed-2

pcmoritz commented Jan 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

raulchen left a comment

Uh oh!

raulchen Jan 29, 2026

Uh oh!

raulchen Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pcmoritz commented Jan 28, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

raulchen left a comment

Choose a reason for hiding this comment

Uh oh!

raulchen Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

raulchen Jan 29, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants