AutoMMLab (AAAI2025)

Automatically generating deployable models from language instructions for computer vision tasks

📖 Overview

AutoMMLab is the first request-to-model AutoML platform for computer vision tasks, which involves understanding the user’s natural language request and execute the entire workflow to output production-ready models. The whole pipeline of AutoMMLab consists of five main stages,including request understanding, data selection, model selection, model training with hyperparameter optimization (HPO), and model deployment. Based on AutoMMLab, we build a benchmark termed LAMP for evaluating end-to-end prompt-based model production, and also studying each component in the whole production pipeline.

🎉 News

Jan. 29, 2024: AutoMMLab is now open source.

💻️ Get Started

Installation the environment

# download the code
git clone git@github.com:yang-ze-kang/AutoMMLab.git

# create python environment
cd AutoMMLab
conda create -n autommlab python=3.9
source activate autommlab
pip install -r requirements.txt

Initialize the dataset zoo

Download datsets of dataset zoo

Task	Dataset	URL
Cls.	ImageNet	https://www.image-net.org/challenges/LSVRC/index.php
Det.	COCO	https://cocodataset.org/#download
Seg.	Cityscapes	https://www.cityscapes-dataset.com//a>
Kpt.	COCO	https://cocodataset.org/#download
Kpt.	AP-10k	https://github.com/AlexTheBad/AP-10K?tab=readme-ov-file#download

And change the path of datasets in configuration file (automlab/configs.py) with your path.

DATASET_ZOO = {
    'ImageNet':'sh1984:s3://openmmlab/datasets/classification/imagenet',
    'COCO':'sh1984:s3://openmmlab/datasets/detection/coco',
    'object365': 'sh1984:s3://openmmlab/datasets/detection/Objects365',
    'openimage': 'sh1984:s3://openmmlab/datasets/detection/coco',
    'cityscapes':'s3://openmmlab/datasets/segmentation/cityscapes',
    'ap10k':'sh1986:s3://ap10k/ap-10k/'
}

RU-LLaMA and HPO-LLaMA

Download the base model and LoRA weights: Base Model: https://huggingface.co/meta-llama/Llama-2-7b-hf/tree/main LoRA Weights: https://drive.google.com/file/d/136jt458c6rMOHwDwwVS6U4iVrX9cmo1p/view?usp=drive_link

Change the configuration file (autommlab/configs.py) with your path.

PATH_LLAMA2 = 'llama_weights/llama-2-7b-hf'
PATH_LORAS = {
    'ru-llama2':'weights/llama2_lora_weights/save_dir_reqparse_v2',
    'hpo-llama2-classification':'weights/llama2_lora_weights/hpo_classification',
    'hpo-llama2-detection':'weights/llama2_lora_weights/hpo_detection',
    'hpo-llama2-segmentation':'weights/llama2_lora_weights/hpo_segmentation',
    'hpo-llama2-pose':'weights/llama2_lora_weights/hpo_pose'
}

Setting the configuration

Please edit file 'autommlab/configs.py' to modify the configuration of the demo.

URL_LLAMA = "http://127.0.0.1:10068/llama2"
TRAIN_GPU_NUM = 1
RU_MODEL = 'ru-llama2'
HPO_MODEL = 'hpo-llama2'
HPO_MAX_TRY = 3
TENSORBOARD_PORT = 10066
IP_ADDRESS = 'localhost'

Start Demo

export PYTHONPATH=$PYTHONPATH:$(pwd)

# step 1
# If you use RU-LLaMA and HPO-LLaMA, please deploy them first.
CUDA_VISIBLE_DEVICES=0 python autommlab/models/deploy_llama.py 

# step 2
CUDA_VISIBLE_DEVICES=1 python autommlab/main.py

📺 Demo

demo.mp4

🤝 Acknowledgments

MMEngine: OpenMMLab foundational library for training deep learning models.
MMCV: OpenMMLab foundational library for computer vision.
MMPreTrain: OpenMMLab pre-training toolbox and benchmark.
MMDetection: OpenMMLab detection toolbox and benchmark.
MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
MMPose: OpenMMLab pose estimation toolbox and benchmark.
MMDeploy: OpenMMLab model deployment framework.

⚖️ License

Codes and data are freely available for free non-commercial use, and may be redistributed under these conditions. For commercial queries, please contact Mr. Sheng Jin (jinsheng13@foxmail.com). We will send the detail agreement to you.

📝 Citation

To cite AutoMMLab in publications, please use the following BibTeX entrie.

@inproceedings{yang2025autommlab,
  title={Autommlab: Automatically generating deployable models from language instructions for computer vision tasks},
  author={Yang, Zekang and Zeng, Wang and Jin, Sheng and Qian, Chen and Luo, Ping and Liu, Wentao},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={21},
  pages={22056--22064},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LAMP		LAMP
autommlab		autommlab
docs		docs
mmdetection		mmdetection
mmpose		mmpose
mmpretrain		mmpretrain
mmsegmentation		mmsegmentation
.gitignore		.gitignore
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoMMLab (AAAI2025)

📖 Overview

🎉 News

💻️ Get Started

Installation the environment

Initialize the dataset zoo

RU-LLaMA and HPO-LLaMA

Setting the configuration

Start Demo

📺 Demo

🤝 Acknowledgments

⚖️ License

📝 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AutoMMLab (AAAI2025)

📖 Overview

🎉 News

💻️ Get Started

Installation the environment

Initialize the dataset zoo

RU-LLaMA and HPO-LLaMA

Setting the configuration

Start Demo

📺 Demo

🤝 Acknowledgments

⚖️ License

📝 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages