support for LLMBasic (mlx-swift-examples) #29

davidkoski · 2025-12-18T22:03:16Z

add a minimal LLM chat example mlx-swift-examples#454
fixes [BUG] gemma3text crashes if the attention mask is used #27
move ChatSession integration tests into new test target so we can more easily control when it runs
make a ChatSession unit (more or less) test
fix Sendable / thread safety issues uncovered by LLMBasic

Note that this requires changes in mlx-swift (so likely a new tag there):

Proposed changes

Please include a description of the problem or feature this PR is addressing. If there is a corresponding issue, include the issue #.

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

davidkoski · 2025-12-18T22:05:09Z

Libraries/MLXLLM/Models/Gemma3Text.swift

+                dims: headDim, base: config.ropeTheta, traditional: false,
+                scalingConfig: config.ropeScaling,
+                maxPositionEmbeddings: config.maxPositionEmbeddings)
+        }


Picking up changes post initial port: ml-explore/mlx-lm@714157b...main

davidkoski · 2025-12-18T22:05:32Z

Libraries/MLXLLM/Models/Mistral3Text.swift

-            return suScaledRope(x, offset: offset)
-        }
-        return x
-    }


See ml-explore/mlx-swift#322

davidkoski · 2025-12-18T22:06:01Z

Libraries/MLXLMCommon/AttentionUtils.swift

    } else {
        let (cachedKeys, cachedValues) = cache.update(keys: keys, values: values)
+        // TODO dkoski
+        //        print("\(cachedKeys.shape) \(cachedValues.shape) \(queries.shape), \(mask.masks?[0].shape ?? [])")


WIP debug stuff :-)

davidkoski · 2025-12-18T22:07:23Z

Libraries/MLXLMCommon/ModelContainer.swift

+        _ action: @Sendable (isolated ModelContainer) async throws -> sending R
+    ) async rethrows -> sending R {
+        try await action(self)
+    }


@DePasqualeOrg FYI, trying some different things out re your recent cleanups around Sendable and thread safety. I have some tests that repro some threading issues (based on the LLMBasic example I made).

davidkoski · 2025-12-18T22:12:00Z

Tests/MLXLMIntegrationTests/ChatSessionIntegrationTests.swift

+import XCTest
+
+/// Tests for the streamlined API using real models
+public class ChatSessionTests: XCTestCase {


@DePasqualeOrg FYI moved this into an IntegrationTests directory -- I am not sure this should run on CI as these are rather large, but I think the tests are valuable to run locally.

That makes sense. I thought about that when I modified this test, but I didn't realize that it could be excluded from CI.

davidkoski · 2025-12-18T22:14:48Z

Tests/MLXLMTests/ChatSessionTests.swift

-        let result = try await session.respond(to: "What is 2+2? Reply with just the number.")
-        print("One-shot result:", result)
-        XCTAssertTrue(result.contains("4") || result.lowercased().contains("four"))
+    func testChatSessionAsyncInterrupt() async throws {


@DePasqualeOrg FYI an example of some concurrency issues related to the issues you were working on.

This triggers a variety of crashes:

thread safety -- hold lock while calling stream sync mlx-swift#323

[BUG] gemma3text crashes if the attention mask is used #27

and a couple others without issues where the streaming response is still running for a short time after the loop terminates early and we are doing concurrent modification of the KVCache.

I will use this to test actual fixes.

davidkoski · 2025-12-18T22:15:28Z

Tests/MLXLMTests/ChatSessionTests.swift

-            Self.llmContainer, instructions: "You are a helpful assistant. Keep responses brief.")
+    @MainActor
+    func testViewModel() async throws {
+        let model = ChatModel(model: model())


And this one simulates the activity from LLMBasic which also causes thread safety issues.

- ml-explore/mlx-swift-examples#454 - fixes #27 - move ChatSession integration tests into new test target so we can more easily control when it runs - make a ChatSession _unit_ (more or less) test - fix Sendable / thread safety issues uncovered by LLMBasic - collect TestTokenizer and friends in its own file. fix warnings in tests

davidkoski commented Dec 18, 2025

View reviewed changes

davidkoski force-pushed the llmbasic-support branch from 8a33925 to 1d94ca6 Compare January 6, 2026 19:11

davidkoski force-pushed the llmbasic-support branch from 1d94ca6 to 9063912 Compare January 6, 2026 19:14

davidkoski added 4 commits January 7, 2026 09:27

checkpoint: cleaning up

2a79452

rework streaming so that it doesn't hold the model lock the whole time

bb97ffa

sendability

9569856

UserInputProcessors -> structs

77b0b54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support for LLMBasic (mlx-swift-examples) #29

support for LLMBasic (mlx-swift-examples) #29

Uh oh!

davidkoski commented Dec 18, 2025 •

edited

Loading

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

DePasqualeOrg Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

davidkoski Dec 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

support for LLMBasic (mlx-swift-examples) #29

Are you sure you want to change the base?

support for LLMBasic (mlx-swift-examples) #29

Uh oh!

Conversation

davidkoski commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Proposed changes

Checklist

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

davidkoski commented Dec 18, 2025 •

edited

Loading