JADE Language Ollama Eval Scaffold

Evaluate Ollama models on JADE (Jade Software language) code-generation tasks, then compile/load-check the generated schema using JADE tooling running inside a Parallels VM.

Important note about `pip install jade`

pip install jade is not the JADE programming language toolchain. It is an unrelated astronomy package.

Known working configuration (validated March 2, 2026)

macOS host path: /Users/maxaitel/Documents/school-projects/jade-ai-training/jade-ai
Parallels VM name: Windows 11
Hosted Ollama: http://100.116.25.114:11434
JADE binaries in VM: C:\Jade2025\bin\
JADE system DB in VM: C:\Jade2025\system
JADE ini in VM: C:\Jade2025\system\jade.ini

Setup

Linux/macOS:

./scripts/setup_venv.sh

Windows PowerShell:

.\scripts\setup_venv.ps1

Manual:

python3 -m venv --clear .venv
. .venv/bin/activate
python -m pip install --upgrade pip

What the evaluator does

Reads tasks from JSONL.
Calls Ollama via CLI or HTTP API.
Extracts generated code and writes to each task output_path when --apply-generated is enabled.
Runs compile/load command per task (local shell or Parallels VM).
Logs one JSON record per model/task with compile + model outputs.

Compile command templating

COMPILE_CMD/--compile-cmd supports placeholders per task:

{task_output_path}: relative task output file (for example eval/generated/hard/DomainModel.scm)
{generated_target_file}: absolute host path of generated file (empty if not applied)
{project_path}: absolute host project path

This is how we run real JADE loader checks against each generated file.

Parallels mode behavior

--compile-mode parallels runs compile command through prlctl exec.
Default mapping for project path:
- /Users/<name>/... -> C:\Mac\Home\...
- /Volumes/... -> C:\Mac\Volumes\...
Override with --parallels-project-path if your VM path is different.

Real JADE compile/load command used

This command was validated in VM:

C:\Jade2025\bin\jadloadb.exe path=C:\Jade2025\system ini=C:\Jade2025\system\jade.ini schemaFile={task_output_path} showProgress=false

Run commands

Quick model eval (Parallels + hosted Ollama)

make eval-jade-parallels \
  MODELS="qwen3.5:122b" \
  TASKS="eval/tasks.jade.jsonl" \
  PROJECT="/Users/maxaitel/Documents/school-projects/jade-ai-training/jade-ai" \
  COMPILE_CMD='C:\Jade2025\bin\jadloadb.exe path=C:\Jade2025\system ini=C:\Jade2025\system\jade.ini schemaFile={task_output_path} showProgress=false' \
  OLLAMA_HOST="http://100.116.25.114:11434" \
  OLLAMA_MODE="http" \
  PARALLELS_VM="Windows 11" \
  APPLY=1

Add KEEP=1 to keep generated files.

Harder task pack

python3 eval/run_jade_eval.py \
  --models qwen3.5:122b \
  --tasks-file eval/tasks.jade.hard.jsonl \
  --project-path /Users/maxaitel/Documents/school-projects/jade-ai-training/jade-ai \
  --compile-mode parallels \
  --parallels-vm "Windows 11" \
  --compile-cmd 'C:\Jade2025\bin\jadloadb.exe path=C:\Jade2025\system ini=C:\Jade2025\system\jade.ini schemaFile={task_output_path} showProgress=false' \
  --ollama-host http://100.116.25.114:11434 \
  --ollama-mode http \
  --apply-generated \
  --keep-generated

Hard task file: eval/tasks.jade.hard.jsonl

Proving the harness catches compiler failures

Control task file: eval/tasks.jade.compile_control.jsonl

control_valid_schema uses eval/controls/valid_reportwriter.scm (known valid JADE schema file).
control_invalid_schema uses eval/controls/invalid_schema.scm (intentionally invalid).

Run:

python3 eval/run_jade_eval.py \
  --models qwen3.5:122b \
  --tasks-file eval/tasks.jade.compile_control.jsonl \
  --project-path /Users/maxaitel/Documents/school-projects/jade-ai-training/jade-ai \
  --compile-mode parallels \
  --parallels-vm "Windows 11" \
  --compile-cmd 'C:\Jade2025\bin\jadloadb.exe path=C:\Jade2025\system ini=C:\Jade2025\system\jade.ini schemaFile={task_output_path} showProgress=false' \
  --skip-ollama

Expected result:

control_valid_schema: compile_ok=true
control_invalid_schema: compile_ok=false

Validated log:

logs/eval-20260303-002512.jsonl
logs/eval-20260303-003236.jsonl (same control rerun after README updates)

Online JADE schema/function validation

To verify parser/loader behavior against real public JADE code, we downloaded schema files from github.com/jadesoftwarenz into eval/online_samples/ and ran compile-only checks.

Task file:

eval/tasks.online_compile.jsonl

Run:

python3 eval/run_jade_eval.py \
  --models qwen3.5:122b \
  --tasks-file eval/tasks.online_compile.jsonl \
  --project-path /Users/maxaitel/Documents/school-projects/jade-ai-training/jade-ai \
  --compile-mode parallels \
  --parallels-vm "Windows 11" \
  --compile-cmd 'C:\Jade2025\bin\jadloadb.exe path=C:\Jade2025\system ini=C:\Jade2025\system\jade.ini schemaFile={task_output_path} showProgress=false' \
  --skip-ollama

Validated log:

logs/eval-20260303-010628.jsonl

Observed result:

4 files compiled successfully.
1 file failed with Compile Error 6020 - Unknown schema because it references WebServiceUtilitiesSchema::... (missing dependency in the current DB load context).

Sources used:

Harness proof on online code (valid vs intentionally broken)

Task file:

eval/tasks.online_harness_validation.jsonl

Run:

python3 eval/run_jade_eval.py \
  --models qwen3.5:122b \
  --tasks-file eval/tasks.online_harness_validation.jsonl \
  --project-path /Users/maxaitel/Documents/school-projects/jade-ai-training/jade-ai \
  --compile-mode parallels \
  --parallels-vm "Windows 11" \
  --compile-cmd 'C:\Jade2025\bin\jadloadb.exe path=C:\Jade2025\system ini=C:\Jade2025\system\jade.ini schemaFile={task_output_path} showProgress=false' \
  --skip-ollama

Validated log:

logs/eval-20260303-010714.jsonl

Observed result:

online_valid_xmlwhitepaper: pass (compile_ok=true, exit 0)
online_broken_xmlwhitepaper: fail (compile_ok=false, exit 255)
Broken file error: Compile Error 7841 - Expecting: schema definition at line 1.

Real compile-only run on existing qwen outputs

Task file:

eval/tasks.real_compile_existing.jsonl

This run reuses generated files and skips model inference to avoid waiting:

python3 eval/run_jade_eval.py \
  --models qwen3.5:122b \
  --tasks-file eval/tasks.real_compile_existing.jsonl \
  --project-path /Users/maxaitel/Documents/school-projects/jade-ai-training/jade-ai \
  --compile-mode parallels \
  --parallels-vm "Windows 11" \
  --compile-cmd 'C:\Jade2025\bin\jadloadb.exe path=C:\Jade2025\system ini=C:\Jade2025\system\jade.ini schemaFile={task_output_path} showProgress=false' \
  --skip-ollama

Validated log:

logs/eval-20260303-010129.jsonl

Observed result:

0/4 compile pass on existing qwen outputs (all failed with schema-definition parse errors).

Example model output snippets (qwen3.5:122b)

From logs/eval-20260303-000609.jsonl:

class Customer {
    id: Integer
    firstName: String
    lastName: String

    fullName(): String {
        return firstName + " " + lastName
    }
}

class Order
    property subtotal: Decimal
    property taxRate: Decimal

    method total(): Decimal
        return subtotal + (subtotal * taxRate)
end class

Files added for repeatability

eval/tasks.jade.hard.jsonl
eval/tasks.jade.compile_control.jsonl
eval/controls/invalid_schema.scm
eval/controls/valid_reportwriter.scm

Output logs

Logs are written to logs/eval-<timestamp>.jsonl and include:

model/task IDs
rendered compile command
compile status (compile_ok, compile_exit_code, timeout, duration)
compile stdout/stderr tails
model status/output tails
generation apply metadata

Troubleshooting

If compile fails immediately with INI file not found, ensure ini=C:\Jade2025\system\jade.ini is present in COMPILE_CMD.
If Parallels path mapping is wrong, pass --parallels-project-path explicitly.
If local ollama CLI is not installed, force --ollama-mode http.
If model calls hang, inspect hosted server with:

curl -s http://100.116.25.114:11434/api/ps

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
eval		eval
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JADE Language Ollama Eval Scaffold

Important note about `pip install jade`

Known working configuration (validated March 2, 2026)

Setup

What the evaluator does

Compile command templating

Parallels mode behavior

Real JADE compile/load command used

Run commands

Quick model eval (Parallels + hosted Ollama)

Harder task pack

Proving the harness catches compiler failures

Online JADE schema/function validation

Harness proof on online code (valid vs intentionally broken)

Real compile-only run on existing qwen outputs

Example model output snippets (qwen3.5:122b)

Files added for repeatability

Output logs

Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

JADE Language Ollama Eval Scaffold

Important note about pip install jade

Known working configuration (validated March 2, 2026)

Setup

What the evaluator does

Compile command templating

Parallels mode behavior

Real JADE compile/load command used

Run commands

Quick model eval (Parallels + hosted Ollama)

Harder task pack

Proving the harness catches compiler failures

Online JADE schema/function validation

Harness proof on online code (valid vs intentionally broken)

Real compile-only run on existing qwen outputs

Example model output snippets (qwen3.5:122b)

Files added for repeatability

Output logs

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Important note about `pip install jade`

Packages