Examples
Surgical Video
Select Model Architecture
VLM (Baseline)
ViT (Vision Transformer)
Run Inference