--- comments: true description: Learn about the Vision Transformer (ViT) and segment anything with SAM models. Train and use pre-trained models with Python API. --- # Vision Transformers Vit models currently support Python environment: ```python from ultralytics.vit import SAM # from ultralytics.vit import MODEL_TYPE model = SAM("sam_b.pt") model.info() # display model information model.predict(...) # predict ``` # Segment Anything ## About ## Supported Tasks | Model Type | Pre-trained Weights | Tasks Supported | |------------|---------------------|-----------------------| | sam base | `sam_b.pt` | Instance Segmentation | | sam large | `sam_l.pt` | Instance Segmentation | ## Supported Modes | Mode | Supported | |------------|--------------------| | Inference | :heavy_check_mark: | | Validation | :x: | | Training | :x: |