v0.12.8: ToolChoice, Parallel Tool Calls, Strategy Pattern Parser, NPU Fix by DenisovAV · Pull Request #205 · DenisovAV/flutter_gemma

DenisovAV · 2026-03-29T11:08:30Z

Summary

ToolChoice enum (auto/required/none) — control tool calling behavior in createChat()
Parallel Tool Calls — ParallelFunctionCallResponse for multiple function calls in one response, parseAll() extraction
Strategy Pattern Parser — per-model FunctionCallFormat implementations (Gemma, Qwen, DeepSeek, Llama, Phi, FunctionGemma) with FunctionCallFormatFactory routing
<tool_call> format — Qwen/Mistral-style function call parsing
NPU fix — pass nativeLibraryDir to LiteRT-LM Backend.NPU()
Example app — handle ParallelFunctionCallResponse in chat_screen and gemma_input_field
Docs — ToolChoice table, parallel calls example, L2-normalized embeddings note
Version bumped to 0.12.8 (pubspec, podspecs, CLAUDE.md)

These files should not be included in pub.dev package: - test_reports/ directory with test output - macos/Resources/litertlm-server.jar downloaded at build time

…rmats Extract monolithic FunctionCallParser into separate format implementations: - FunctionCallFormat interface + Factory - JsonFunctionCallFormat (gemmaIt, hammer, default) - QwenFunctionCallFormat (<tool_call> XML tags) - DeepSeekFunctionCallFormat (Unicode special tokens) - LlamaFunctionCallFormat (<|python_tag|> syntax) - PhiFunctionCallFormat (<|tool_calls|> JSON arrays) - FunctionGemmaCallFormat (<start_function_call> format) - Shared JsonParsingUtils with parseMultipleJsonObjects() for parallel calls

…ration - ToolChoice.auto/required/none controls system prompt injection - ParallelFunctionCallResponse for multiple tool calls in one response - chat.dart uses parseAll() to detect and return parallel calls - Prompt wording changes per ToolChoice mode in createToolsPrompt()

- NPU: pass nativeLibraryDir to Backend.NPU() for Qualcomm/MediaTek/Tensor - Integration tests for 5 models: FunctionGemma, Gemma 3 1B, Qwen 2.5, DeepSeek R1, Gemma 3n E2B - Tests cover: install, auto, required, none, streaming, parallel calls - Verified parallel function calls on Qwen, DeepSeek, Gemma 3n (2 calls each)

…nse in example app - Version 0.12.8 in pubspec.yaml, podspecs, CLAUDE.md - CHANGELOG.md: add 0.12.8 section - README.md: ToolChoice table, parallel calls example, L2-norm note - Example app: handle ParallelFunctionCallResponse in chat_screen and gemma_input_field - chat.dart: use parseAll() for end-of-stream parallel calls

…ffer detection - Add ModelType.phi enum value and wire PhiFunctionCallFormat in factory - Add toolChoice != ToolChoice.none guard to sync and streaming parsing paths - Use FunctionCallParser.isFunctionCallStart() instead of hardcoded { / ``` check - Increase _maxFunctionBufferLength from 150 to 1024 for verbose formats - Fix mutual exclusion of _pendingFunctionCall/_pendingParallelCall in example app - Add _pendingParallelCall handling in onError callback

…tions, history - C1: Mid-stream buffer uses parseAll() instead of parse() for parallel calls - C2: DeepSeek regex uses [\s\S]*? to cross newlines between tokens - C3: Zero-argument functions return empty args map instead of null - C4: Streaming history records Message.toolCall() for function calls - C5: emittedFunctionCall flag moved before stream loop, set mid-stream - I1: addQueryChunk() skips tool prompt injection for ToolChoice.none

… TFLite DLL copy on Windows/Linux - Switch all embedding tests from fromNetwork to fromAsset (models already in assets) - Switch inference/tool calling tests from fromNetwork to fromFile (models pushed via adb) - Add prepare_test_models.sh script to push models to device via adb - Fix Windows CMakeLists: add POST_BUILD copy for tensorflowlite_c.dll (#200) - Fix Linux CMakeLists: add install rule for TFLite C library (#200) - Remove networkUrl from tool_calling_test Gemma 3n E2B config - Add .litertlm to example .gitignore

DenisovAV added 7 commits March 28, 2026 19:38

Add test_reports/ and litertlm-server.jar to .pubignore

caf3362

These files should not be included in pub.dev package: - test_reports/ directory with test output - macos/Resources/litertlm-server.jar downloaded at build time

DenisovAV mentioned this pull request Mar 29, 2026

EmbeddingGemma on Desktop #200

Closed

DenisovAV merged commit e0c00e4 into main Mar 29, 2026
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.12.8: ToolChoice, Parallel Tool Calls, Strategy Pattern Parser, NPU Fix#205

v0.12.8: ToolChoice, Parallel Tool Calls, Strategy Pattern Parser, NPU Fix#205
DenisovAV merged 8 commits intomainfrom
feature/v0.12.8

DenisovAV commented Mar 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

DenisovAV commented Mar 29, 2026

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant