SAM2 Interactive Segmentation Tool

This repository contains an interactive segmentation tool built using Meta's Segment Anything Model 2 (SAM2). It provides a graphical user interface for performing segmentation on images, videos, and live camera feeds.

Please refer to this repo to set up sam2 modules first https://github.com/facebookresearch/sam2

Then, insert the .pt files inside sam2/sam/checkpoints

Results

Web UI

Overlay Image

Transparent Image

Features

Interactive Segmentation: Click on images to add positive/negative points for segmentation
Box Selection: Control + click and drag to create bounding boxes for segmentation
Multi-Object Support: Create and manage multiple segmentation masks simultaneously
Image Segmentation: Process single images with point and box prompts
Video Segmentation: Annotate videos frame-by-frame with ability to propagate annotations across frames
Live Camera Segmentation: Real-time segmentation from camera feed, screen capture, or video files
Export Options: Save segmentation masks in multiple formats (binary, color, overlay, numpy arrays)
Video Export: Create annotated videos with segmentation overlays
Transparent Export: Save transparent masks where only segmented pixels remain and the rest are alpha = 0. Enabled via a "Save Transparent" checkbox in the GUI.
Model Selection: Support for different SAM2 model variants (tiny, small, large, base+)
Frame Extraction Control: Configure percentage of frames to extract from videos
Undo/Redo: Comprehensive history with undo/redo functionality
Keyboard Shortcuts: Various shortcuts for efficient operation

Requirements

Python 3.8+
PyTorch
SAM2 models and checkpoints
OpenCV
Tkinter
Matplotlib
NumPy
PIL/Pillow
Hydra
OmegaConf
mss (for screen capture)

Directory Structure

SAM/
├── gui.py              # Main GUI application
├── tools.py            # Utility functions for visualization and processing
├── workspace.ipynb     # Jupyter notebook for experimentation
├── sam2/               # SAM2 core modules and models
├── input/              # Input images and videos (subdirectories: img/, vid/)
├── output/             # Output results (subdirectories: img/, vid/)
├── sample_data/        # Sample input data for testing
└── configs/            # Configuration files for different SAM2 models

Setup

Install required dependencies
Download SAM2 model checkpoints to ./sam2/sam/checkpoints/
Run the application:

python gui.py

Usage

Image Mode

Select "Image Mode" from the controls
Open an image file using "File > Open Image"
Click to add positive points (green stars)
Hold Shift to toggle to negative points (red stars)
Use Ctrl+click+drag to create bounding boxes
Press Enter or click "Predict Mask" to generate segmentation
Use "A" to add new mask or "D" to delete current mask
Adjust overlay alpha and borders as needed
Save results via "File > Save Masks Image"

Output formats for saving image masks:
- Binary: Grayscale mask with different values for each segmented object
- Color: Colored mask with different colors for each object
- Overlay: Original image with segmentation overlay
- Numpy: Raw segmentation data in numpy array format
- Transparent: Images that have the background set to transparent

Video Mode

Select "Video Mode" from the controls
Open a video file using "File > Open Video"
Choose an output directory for extracted frames
Add annotations to the current frame (similar to Image Mode)
Use propagation to extend annotations across the video
Navigate frames with the slider or navigation buttons
Choose save modes (Save Transparent mode creates green screen mp4 files)
Save results via "File > Save Masks Video" (video results will be in video_name/raw_folder/)

Live Mode

Select "Live Mode" from the controls
Choose video source (Camera, Screen, or Video File, curently only camera supported)
Add initial annotations
Start the stream and optionally enable live propagation
Adjust delay for processing speed

Keyboard Shortcuts

Left Click: Add positive point
Ctrl + Left Click + Drag: Create bounding box
Enter: Predict mask
A: Add new mask
D: Delete current mask
Shift: Toggle point label (positive/negative)
Ctrl+Z: Undo
Ctrl+Y: Redo
Ctrl+S: Save masks
Space: Play/Pause video (in video mode)

Model Selection

The application supports multiple SAM2 model variants:

SAM2 Hiera Tiny (fastest, smallest)
SAM2 Hiera Small (good balance of speed and accuracy)
SAM2 Hiera Base+ (better accuracy)
SAM2 Hiera Large (highest accuracy)

Choose your preferred model in "File > Insert Model".

Real Time

https://github.com/Gy920/segment-anything-2-real-time

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
__pycache__		__pycache__
assets		assets
gui		gui
sam2		sam2
.gitignore		.gitignore
README.md		README.md
workspace.ipynb		workspace.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAM2 Interactive Segmentation Tool

Results

Features

Requirements

Directory Structure

Setup

Usage

Image Mode

Video Mode

Live Mode

Keyboard Shortcuts

Model Selection

Real Time

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SAM2 Interactive Segmentation Tool

Results

Features

Requirements

Directory Structure

Setup

Usage

Image Mode

Video Mode

Live Mode

Keyboard Shortcuts

Model Selection

Real Time

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages