Skip to content
#

bbox-analytics

Here are 3 public repositories matching this topic...

This repository contains the Mask Generator project, which leverages the Segment Anything model by Meta AI to automatically generate masks from video frames. It uses the Segment Anything model to segment and process video frames, producing accurate segmentation masks that are then encoded into Run-Length Encoding (RLE) format.

  • Updated Dec 9, 2024
  • Jupyter Notebook

This repository aims to select and track objects using sound control. The architecture is multimodal, featuring two separate Transformer encoder models capable of processing both image and audio features. The system ultimately identifies the most suitable bounding box(es) based on these multimodal inputs.

  • Updated Nov 28, 2025
  • Python

Improve this page

Add a description, image, and links to the bbox-analytics topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bbox-analytics topic, visit your repo's landing page and select "manage topics."

Learn more