PowerSAM: Edge-Efficient Segment Anything for Power Systems Through Visual Model Distillation

Authors: Nannan Yan, Yuhao Li, Yingke Mao, Xiao Yu, Wenhao Guan, Jiawei Hou and Taiping Zeng

Tl;dr PowerSAM is proposed as a real-time semantic segmentation framework for edge devices, addressing the challenges of power system equipment inspection, including labor intensity, costs, and human error. By leveraging knowledge distillation from large models to compact backbones and integrating a bounding box prompt generator with a segmentation model, PowerSAM significantly reduces computational complexity while maintaining high segmentation accuracy.

Installation

To set up the environment for PowerSAM, follow these steps:

Clone the repository:

git clone https://github.com/fudan-birlab/PowerSAM.git

Create and activate a new conda environment:

conda create -n powersam python=3.8 -y
conda activate powersam

Install PyTorch and related packages:

pip install torch==2.0.0 torchvision==0.15.1 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu118
# or
conda install -y pytorch==2.0.0 torchvision==0.15.1 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia

Install mmdetection dependencies:

pip install -U openmim
mim install mmengine==0.10.3
mim install mmcv==2.0.0rc4
mim install mmdet==3.3.0

Install the required Python packages:

pip install -r requirements.txt

Install PowerSAM:

pip install -e .

Getting Started with PowerSAM

Demo

To get started with PowerSAM, you can follow the example provided in the Getting Started with PowerSAM. This notebook demonstrates how to use the PowerSAM model for segmentation in power system scenarios.

Note: Please download the weights of PowerSAM first and place them in the appropriate directory:

mkdir weights
wget https://github.com/fudan-birlab/PowerSAM/releases/download/v0.1.0/powersam_b.pth -O weights/powersam_b.pth
wget https://github.com/fudan-birlab/PowerSAM/releases/download/v0.1.0/powersam_s.pth -O weights/powersam_s.pth
wget https://github.com/fudan-birlab/PowerSAM/releases/download/v0.1.0/box_prompt_generator_repvit_epoch_300.pth -O weights/box_prompt_generator_repvit_epoch_300.pth

Inference Example

First, initiating the PowerSAM and bounding box prompt generator respectively:

from power_sam import sam_model_registry, SamPredictor
from box_prompt_generator.apis import init_box_prompt_generator, inference_box_prompt_generator

device = 'cuda' if torch.cuda.is_available() else 'cpu'
sam = sam_model_registry["power_sam"](checkpoint="weights/powersam_s.pth", arch="m0").to(device=device)
predictor = SamPredictor(sam)

bbox_prompt_generator = init_box_prompt_generator(
    '../box_prompt_generator/configs/box_prompt_generator/self_s_repvit_m0.py',
    '../weights/box_prompt_generator_repvit_epoch_300.pth',
    device=device
)

Second, following the steps below to predict mask:

image = cv2.imread(image_path)
image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)

result, feats = inference_box_prompt_generator(bbox_prompt_generator, image, return_feats=True)
bbox_prompts = result.pred_instances
bboxes = bbox_prompts.bboxes
bbox_labels = bbox_prompts.labels
bbox_scores = bbox_prompts.scores

predictor.original_size = image.shape[:2]
predictor.input_size = predictor.transform.get_preprocess_shape(image.shape[0], image.shape[1], predictor.transform.target_length)
predictor.is_image_set = True
transformed_boxes = predictor.transform.apply_boxes_torch(bboxes, image.shape[:2])
masks, _, _ = predictor.predict_torch(
    feats[-1] if feats[-1].shape[-2:] == (64, 64)
    else torch.nn.functional.interpolate(feats[-1], (64, 64), mode="bilinear"),
    point_coords=None,
    point_labels=None,
    boxes=transformed_boxes,
    num_multimask_outputs=1,
)

Training

First, the training and validation datasets of SAM should be prepared like SA-1B Dataset, and the datasets of bounding box prompt generator should be prepared like MMDetection.

To train the PowerSAM, you should download teacher model weights, and move it into weight/ folder.

Distilling backbone, mask decoder and bounding box prompt generator.

bash training_all_stages.sh

Acknowledgement

We would like to acknowledge the following projects and their contributions to the development of our work:

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
box_prompt_generator		box_prompt_generator
evaluation		evaluation
figs		figs
notebooks		notebooks
power_sam		power_sam
training		training
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py
train_all_stages.sh		train_all_stages.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PowerSAM: Edge-Efficient Segment Anything for Power Systems Through Visual Model Distillation

Installation

Getting Started with PowerSAM

Demo

Inference Example

Training

Acknowledgement

About

Uh oh!

Releases 1

Packages

Contributors 2

Languages

License

fudan-birlab/PowerSAM

Folders and files

Latest commit

History

Repository files navigation

PowerSAM: Edge-Efficient Segment Anything for Power Systems Through Visual Model Distillation

Installation

Getting Started with PowerSAM

Demo

Inference Example

Training

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages