Meditron-4

Axolotl configs and Slurm helpers for training/evaluating of Meditron models on CSCS.

Prerequisites

CSCS account with access to the storage paths referenced in the configs.
Python environment described by your EDF file (see ENV below).
Clone of the lm-evaluation-harness fork alongside this repo: git clone https://github.com/Xkrilandar/lm-evaluation-harness.

Environment setup

Create a .env in the repo root with your paths and tokens (do not commit secrets), following the .env.example format:

Training

Pick a config in axolotl_config/ (for Meditron-4/Qwen-3 use sft_meditron4_qwen3.yaml).
Submit via Slurm (self-submits and tails logs):
```
bash meditron_train.sh axolotl_config/sft_meditron4_qwen3.yaml
```
The script:
- injects your .env values into the template and writes axolotl_config/config.yaml,
- submits itself with sbatch -J <config-name> ...,
- tails reports/R-<job>.<jobid>.err once the log appears.
Adjust SBATCH resources at the top of meditron_train.sh if you need different GPUs/time.

Script usage

meditron_train.sh: submit a training run.

bash train.sh axolotl_config/sft_meditron4_qwen3.yaml

meditron_eval.sh: submit an eval run (data parallel via accelerate).
```
bash eval.sh $STORAGE_ROOT/apertus/huggingface/Apertus8B
```
Optional flags:
- --debug adds --limit 100 and sets verbosity to DEBUG.
- --model_parallelism runs without accelerate and adds parallelize=True to model args (for the 70B)
summarise_evals.sh: scan eval reports and summarize eval outputs.
```
bash summarise_evals.sh
```
find_training_errors.sh: scan reports for training errors.
```
bash find_training_errors.sh
```
slack_helpers.sh: helper functions for other scripts (not meant to be run directly).

Distillation

Quickstart (from repo root):

bash distillation/distill_head.sh distillation/datasets_to_distill.txt \
  --strict-repro \
  --deterministic \
  --seed 42 \
  --model-revision "$DISTILL_MODEL_REVISION"

To prequeue workers immediately (as dependencies on the head job):

bash distillation/submit_distill.sh distillation/datasets_to_distill.txt \
  --strict-repro \
  --deterministic \
  --seed 42 \
  --model-revision "$DISTILL_MODEL_REVISION"

Outputs and logs:

Run state: distill_reports/pool-<model>-<timestamp>-<rid>/ (queue.db, summary, events)
Distilled shards: alongside each source dataset as *_distillation_<model>.shard-*.jsonl
Merged outputs: alongside each source dataset as *_distillation_<model>.jsonl

See distillation/README.md for full details, environment variables, and queue layout.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
axolotl_config		axolotl_config
data_preprocessing		data_preprocessing
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
clean_gpt_oss copy.py		clean_gpt_oss copy.py
clean_gpt_oss.py		clean_gpt_oss.py
clean_meditative.py		clean_meditative.py
clean_synthetic_qa.py		clean_synthetic_qa.py
distill_eval.py		distill_eval.py
distill_guidelines.py		distill_guidelines.py
distill_moove.py		distill_moove.py
distill_synthetic_qa copy.py		distill_synthetic_qa copy.py
distill_synthetic_qa.py		distill_synthetic_qa.py
distill_with_retries.py		distill_with_retries.py
distill_with_retries_medgemma.py		distill_with_retries_medgemma.py
find_training_errors.sh		find_training_errors.sh
meditron_eval.sh		meditron_eval.sh
meditron_eval_2.sh		meditron_eval_2.sh
meditron_eval_logits.sh		meditron_eval_logits.sh
meditron_eval_new.sh		meditron_eval_new.sh
meditron_eval_no_cot.sh		meditron_eval_no_cot.sh
meditron_train.sh		meditron_train.sh
meditron_train_2.sh		meditron_train_2.sh
meditron_train_grpo.sh		meditron_train_grpo.sh
meditron_train_lfm.sh		meditron_train_lfm.sh
new_jsonl.py		new_jsonl.py
new_launch.sh		new_launch.sh
plot_loss.py		plot_loss.py
save_dataset.py		save_dataset.py
summarise_evals.sh		summarise_evals.sh
test_harmony.py		test_harmony.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meditron-4

Prerequisites

Environment setup

Training

Script usage

Distillation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Meditron-4

Prerequisites

Environment setup

Training

Script usage

Distillation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages