ultralytics 8.0.53 DDP AMP and Edge TPU fixes (#1362)

Co-authored-by: Richard Aljaste <richardaljasteabramson@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Vuong Kha Sieu <75152429+hotfur@users.noreply.github.com>
2023-03-12 02:08:13 +01:00
parent 177a68b39f
commit f921e1ac21
46 changed files with 1045 additions and 384 deletions
--- a/docs/modes/benchmark.md
+++ b/docs/modes/benchmark.md
@ -0,0 +1,65 @@
+<img width="1024" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png">
+
+**Benchmark mode** is used to profile the speed and accuracy of various export formats for YOLOv8. The benchmarks
+provide information on the size of the exported format, its `mAP50-95` metrics (for object detection and segmentation)
+or `accuracy_top5` metrics (for classification), and the inference time in milliseconds per image across various export
+formats like ONNX, OpenVINO, TensorRT and others. This information can help users choose the optimal export format for
+their specific use case based on their requirements for speed and accuracy.
+
+!!! tip "Tip"
+
+    * Export to ONNX or OpenVINO for up to 3x CPU speedup.
+    * Export to TensorRT for up to 5x GPU speedup.
+
+## Usage Examples
+
+Run YOLOv8n benchmarks on all supported export formats including ONNX, TensorRT etc. See Arguments section below for a
+full list of export arguments.
+
+!!! example ""
+
+    === "Python"
+    
+        ```python
+        from ultralytics.yolo.utils.benchmarks import benchmark
+        
+        # Benchmark
+        benchmark(model='yolov8n.pt', imgsz=640, half=False, device=0)
+        ```
+    === "CLI"
+    
+        ```bash
+        yolo benchmark model=yolov8n.pt imgsz=640 half=False device=0
+        ```
+
+## Arguments
+
+Arguments such as `model`, `imgsz`, `half`, `device`, and `hard_fail` provide users with the flexibility to fine-tune
+the benchmarks to their specific needs and compare the performance of different export formats with ease.
+
+| Key         | Value   | Description                                                          |
+|-------------|---------|----------------------------------------------------------------------|
+| `model`     | `None`  | path to model file, i.e. yolov8n.pt, yolov8n.yaml                    |
+| `imgsz`     | `640`   | image size as scalar or (h, w) list, i.e. (640, 480)                 |
+| `half`      | `False` | FP16 quantization                                                    |
+| `device`    | `None`  | device to run on, i.e. cuda device=0 or device=0,1,2,3 or device=cpu |
+| `hard_fail` | `False` | do not continue on error (bool), or val floor threshold (float)      |
+
+## Export Formats
+
+Benchmarks will attempt to run automatically on all possible export formats below.
+
+| Format                                                             | `format` Argument | Model                     | Metadata |
+|--------------------------------------------------------------------|-------------------|---------------------------|----------|
+| [PyTorch](https://pytorch.org/)                                    | -                 | `yolov8n.pt`              | ✅        |
+| [TorchScript](https://pytorch.org/docs/stable/jit.html)            | `torchscript`     | `yolov8n.torchscript`     | ✅        |
+| [ONNX](https://onnx.ai/)                                           | `onnx`            | `yolov8n.onnx`            | ✅        |
+| [OpenVINO](https://docs.openvino.ai/latest/index.html)             | `openvino`        | `yolov8n_openvino_model/` | ✅        |
+| [TensorRT](https://developer.nvidia.com/tensorrt)                  | `engine`          | `yolov8n.engine`          | ✅        |
+| [CoreML](https://github.com/apple/coremltools)                     | `coreml`          | `yolov8n.mlmodel`         | ✅        |
+| [TF SavedModel](https://www.tensorflow.org/guide/saved_model)      | `saved_model`     | `yolov8n_saved_model/`    | ✅        |
+| [TF GraphDef](https://www.tensorflow.org/api_docs/python/tf/Graph) | `pb`              | `yolov8n.pb`              | ❌        |
+| [TF Lite](https://www.tensorflow.org/lite)                         | `tflite`          | `yolov8n.tflite`          | ✅        |
+| [TF Edge TPU](https://coral.ai/docs/edgetpu/models-intro/)         | `edgetpu`         | `yolov8n_edgetpu.tflite`  | ✅        |
+| [TF.js](https://www.tensorflow.org/js)                             | `tfjs`            | `yolov8n_web_model/`      | ✅        |
+| [PaddlePaddle](https://github.com/PaddlePaddle)                    | `paddle`          | `yolov8n_paddle_model/`   | ✅        |
--- a/docs/modes/export.md
+++ b/docs/modes/export.md
@ -0,0 +1,81 @@
+<img width="1024" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png">
+
+**Export mode** is used for exporting a YOLOv8 model to a format that can be used for deployment. In this mode, the
+model is converted to a format that can be used by other software applications or hardware devices. This mode is useful
+when deploying the model to production environments.
+
+!!! tip "Tip"
+
+    * Export to ONNX or OpenVINO for up to 3x CPU speedup.
+    * Export to TensorRT for up to 5x GPU speedup.
+
+## Usage Examples
+
+Export a YOLOv8n model to a different format like ONNX or TensorRT. See Arguments section below for a full list of
+export arguments.
+
+!!! example ""
+
+    === "Python"
+    
+        ```python
+        from ultralytics import YOLO
+        
+        # Load a model
+        model = YOLO("yolov8n.pt")  # load an official model
+        model = YOLO("path/to/best.pt")  # load a custom trained
+        
+        # Export the model
+        model.export(format="onnx")
+        ```
+    === "CLI"
+    
+        ```bash
+        yolo export model=yolov8n.pt format=onnx  # export official model
+        yolo export model=path/to/best.pt format=onnx  # export custom trained model
+        ```
+
+## Arguments
+
+Export settings for YOLO models refer to the various configurations and options used to save or
+export the model for use in other environments or platforms. These settings can affect the model's performance, size,
+and compatibility with different systems. Some common YOLO export settings include the format of the exported model
+file (e.g. ONNX, TensorFlow SavedModel), the device on which the model will be run (e.g. CPU, GPU), and the presence of
+additional features such as masks or multiple labels per box. Other factors that may affect the export process include
+the specific task the model is being used for and the requirements or constraints of the target environment or platform.
+It is important to carefully consider and configure these settings to ensure that the exported model is optimized for
+the intended use case and can be used effectively in the target environment.
+
+| Key         | Value           | Description                                          |
+|-------------|-----------------|------------------------------------------------------|
+| `format`    | `'torchscript'` | format to export to                                  |
+| `imgsz`     | `640`           | image size as scalar or (h, w) list, i.e. (640, 480) |
+| `keras`     | `False`         | use Keras for TF SavedModel export                   |
+| `optimize`  | `False`         | TorchScript: optimize for mobile                     |
+| `half`      | `False`         | FP16 quantization                                    |
+| `int8`      | `False`         | INT8 quantization                                    |
+| `dynamic`   | `False`         | ONNX/TF/TensorRT: dynamic axes                       |
+| `simplify`  | `False`         | ONNX: simplify model                                 |
+| `opset`     | `None`          | ONNX: opset version (optional, defaults to latest)   |
+| `workspace` | `4`             | TensorRT: workspace size (GB)                        |
+| `nms`       | `False`         | CoreML: add NMS                                      |
+
+## Export Formats
+
+Available YOLOv8 export formats are in the table below. You can export to any format using the `format` argument,
+i.e. `format='onnx'` or `format='engine'`.
+
+| Format                                                             | `format` Argument | Model                     | Metadata |
+|--------------------------------------------------------------------|-------------------|---------------------------|----------|
+| [PyTorch](https://pytorch.org/)                                    | -                 | `yolov8n.pt`              | ✅        |
+| [TorchScript](https://pytorch.org/docs/stable/jit.html)            | `torchscript`     | `yolov8n.torchscript`     | ✅        |
+| [ONNX](https://onnx.ai/)                                           | `onnx`            | `yolov8n.onnx`            | ✅        |
+| [OpenVINO](https://docs.openvino.ai/latest/index.html)             | `openvino`        | `yolov8n_openvino_model/` | ✅        |
+| [TensorRT](https://developer.nvidia.com/tensorrt)                  | `engine`          | `yolov8n.engine`          | ✅        |
+| [CoreML](https://github.com/apple/coremltools)                     | `coreml`          | `yolov8n.mlmodel`         | ✅        |
+| [TF SavedModel](https://www.tensorflow.org/guide/saved_model)      | `saved_model`     | `yolov8n_saved_model/`    | ✅        |
+| [TF GraphDef](https://www.tensorflow.org/api_docs/python/tf/Graph) | `pb`              | `yolov8n.pb`              | ❌        |
+| [TF Lite](https://www.tensorflow.org/lite)                         | `tflite`          | `yolov8n.tflite`          | ✅        |
+| [TF Edge TPU](https://coral.ai/docs/edgetpu/models-intro/)         | `edgetpu`         | `yolov8n_edgetpu.tflite`  | ✅        |
+| [TF.js](https://www.tensorflow.org/js)                             | `tfjs`            | `yolov8n_web_model/`      | ✅        |
+| [PaddlePaddle](https://github.com/PaddlePaddle)                    | `paddle`          | `yolov8n_paddle_model/`   | ✅        |
--- a/docs/modes/index.md
+++ b/docs/modes/index.md
@ -0,0 +1,62 @@
+# YOLOv8 Modes
+
+<img width="1024" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png">
+
+Ultralytics YOLOv8 supports several **modes** that can be used to perform different tasks. These modes are:
+
+**Train**: For training a YOLOv8 model on a custom dataset.  
+**Val**: For validating a YOLOv8 model after it has been trained.  
+**Predict**: For making predictions using a trained YOLOv8 model on new images or videos.  
+**Export**: For exporting a YOLOv8 model to a format that can be used for deployment.  
+**Track**: For tracking objects in real-time using a YOLOv8 model.  
+**Benchmark**: For benchmarking YOLOv8 exports (ONNX, TensorRT, etc.) speed and accuracy.
+
+## [Train](train.md)
+
+Train mode is used for training a YOLOv8 model on a custom dataset. In this mode, the model is trained using the
+specified dataset and hyperparameters. The training process involves optimizing the model's parameters so that it can
+accurately predict the classes and locations of objects in an image.
+
+[Train Examples](train.md){ .md-button .md-button--primary}
+
+## [Val](val.md)
+
+Val mode is used for validating a YOLOv8 model after it has been trained. In this mode, the model is evaluated on a
+validation set to measure its accuracy and generalization performance. This mode can be used to tune the hyperparameters
+of the model to improve its performance.
+
+[Val Examples](val.md){ .md-button .md-button--primary}
+
+## [Predict](predict.md)
+
+Predict mode is used for making predictions using a trained YOLOv8 model on new images or videos. In this mode, the
+model is loaded from a checkpoint file, and the user can provide images or videos to perform inference. The model
+predicts the classes and locations of objects in the input images or videos.
+
+[Predict Examples](predict.md){ .md-button .md-button--primary}
+
+## [Export](export.md)
+
+Export mode is used for exporting a YOLOv8 model to a format that can be used for deployment. In this mode, the model is
+converted to a format that can be used by other software applications or hardware devices. This mode is useful when
+deploying the model to production environments.
+
+[Export Examples](export.md){ .md-button .md-button--primary}
+
+## [Track](track.md)
+
+Track mode is used for tracking objects in real-time using a YOLOv8 model. In this mode, the model is loaded from a
+checkpoint file, and the user can provide a live video stream to perform real-time object tracking. This mode is useful
+for applications such as surveillance systems or self-driving cars.
+
+[Track Examples](track.md){ .md-button .md-button--primary}
+
+## [Benchmark](benchmark.md)
+
+Benchmark mode is used to profile the speed and accuracy of various export formats for YOLOv8. The benchmarks provide
+information on the size of the exported format, its `mAP50-95` metrics (for object detection and segmentation)
+or `accuracy_top5` metrics (for classification), and the inference time in milliseconds per image across various export
+formats like ONNX, OpenVINO, TensorRT and others. This information can help users choose the optimal export format for
+their specific use case based on their requirements for speed and accuracy.
+
+[Benchmark Examples](benchmark.md){ .md-button .md-button--primary}
--- a/docs/modes/predict.md
+++ b/docs/modes/predict.md
@ -0,0 +1,180 @@
+<img width="1024" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png">
+
+Inference or prediction of a task returns a list of `Results` objects. Alternatively, in the streaming mode, it returns
+a generator of `Results` objects which is memory efficient. Streaming mode can be enabled by passing `stream=True` in
+predictor's call method.
+
+!!! example "Predict"
+
+    === "Return a List"
+
+    ```python
+    inputs = [img, img]  # list of np arrays
+    results = model(inputs)  # List of Results objects
+    
+    for result in results:
+        boxes = result.boxes  # Boxes object for bbox outputs
+        masks = result.masks  # Masks object for segmenation masks outputs
+        probs = result.probs  # Class probabilities for classification outputs
+    ```
+    
+    === "Return a Generator"
+
+    ```python
+    inputs = [img, img]  # list of numpy arrays
+    results = model(inputs, stream=True)  # generator of Results objects
+    
+    for r in results:
+        boxes = r.boxes  # Boxes object for bbox outputs
+        masks = r.masks  # Masks object for segmenation masks outputs
+        probs = r.probs  # Class probabilities for classification outputs
+    ```
+
+## Sources
+
+YOLOv8 can run inference on a variety of sources. The table below lists the various sources that can be used as input
+for YOLOv8, along with the required format and notes. Sources include images, URLs, PIL images, OpenCV, numpy arrays,
+torch tensors, CSV files, videos, directories, globs, YouTube videos, and streams. The table also indicates whether each
+source can be used as a stream and the model argument required for that source.
+
+| source     | stream  | model(arg)                                 | type           | notes            |
+|------------|---------|--------------------------------------------|----------------|------------------|
+| image      |         | `'im.jpg'`                                 | `str`, `Path`  |                  |
+| URL        |         | `'https://ultralytics.com/images/bus.jpg'` | `str`          |                  |
+| screenshot |         | `'screen'`                                 | `str`          |                  |
+| PIL        |         | `Image.open('im.jpg')`                     | `PIL.Image`    | HWC, RGB         |
+| OpenCV     |         | `cv2.imread('im.jpg')[:,:,::-1]`           | `np.ndarray`   | HWC, BGR to RGB  |
+| numpy      |         | `np.zeros((640,1280,3))`                   | `np.ndarray`   | HWC              |
+| torch      |         | `torch.zeros(16,3,320,640)`                | `torch.Tensor` | BCHW, RGB        |
+| CSV        |         | `'sources.csv'`                            | `str`, `Path`  | RTSP, RTMP, HTTP |         
+| video      | &check; | `'vid.mp4'`                                | `str`, `Path`  |                  |
+| directory  | &check; | `'path/'`                                  | `str`, `Path`  |                  |
+| glob       | &check; | `'path/*.jpg'`                             | `str`          | Use `*` operator |
+| YouTube    | &check; | `'https://youtu.be/Zgi9g1ksQHc'`           | `str`          |                  |
+| stream     | &check; | `'rtsp://example.com/media.mp4'`           | `str`          | RTSP, RTMP, HTTP |
+
+## Image Formats
+
+For images, YOLOv8 supports a variety of image formats defined
+in [yolo/data/utils.py](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/yolo/data/utils.py). The
+following suffixes are valid for images:
+
+| Image Suffixes | Example Predict Command          | Reference                                                                            |
+|----------------|----------------------------------|--------------------------------------------------------------------------------------|
+| bmp            | `yolo predict source=image.bmp`  | [Microsoft](https://docs.microsoft.com/en-us/windows/win32/gdi/bitmap-file-format)   |
+| dng            | `yolo predict source=image.dng`  | [Adobe](https://helpx.adobe.com/photoshop/using/digital-negative.html)               |
+| jpeg           | `yolo predict source=image.jpeg` | [Joint Photographic Experts Group](https://jpeg.org/jpeg/)                           |
+| jpg            | `yolo predict source=image.jpg`  | [Joint Photographic Experts Group](https://jpeg.org/jpeg/)                           |
+| mpo            | `yolo predict source=image.mpo`  | [CIPA](https://www.cipa.jp/std/documents/e/DC-007-Translation-2018-E.pdf)            |
+| png            | `yolo predict source=image.png`  | [Portable Network Graphics](https://www.w3.org/TR/PNG/)                              |
+| tif            | `yolo predict source=image.tif`  | [Adobe](https://www.adobe.com/content/dam/acom/en/products/photoshop/pdfs/tiff6.pdf) |
+| tiff           | `yolo predict source=image.tiff` | [Adobe](https://www.adobe.com/content/dam/acom/en/products/photoshop/pdfs/tiff6.pdf) |
+| webp           | `yolo predict source=image.webp` | [Google Developers](https://developers.google.com/speed/webp)                        |
+| pfm            | `yolo predict source=image.pfm`  | [HDR Labs](http://hdrlabs.com/tools/pfrenchy/)                                       |
+
+## Video Formats
+
+For videos, YOLOv8 also supports a variety of video formats defined
+in [yolo/data/utils.py](https://github.com/ultralytics/ultralytics/blob/main/ultralytics/yolo/data/utils.py). The
+following suffixes are valid for videos:
+
+| Video Suffixes | Example Predict Command          | Reference                                                                                                      |
+|----------------|----------------------------------|----------------------------------------------------------------------------------------------------------------|
+| asf            | `yolo predict source=video.asf`  | [Microsoft](https://docs.microsoft.com/en-us/windows/win32/wmformat/asf-file-structure)                        |
+| avi            | `yolo predict source=video.avi`  | [Microsoft](https://docs.microsoft.com/en-us/windows/win32/directshow/avi-riff-file-reference)                 |
+| gif            | `yolo predict source=video.gif`  | [CompuServe](https://www.w3.org/Graphics/GIF/spec-gif89a.txt)                                                  |
+| m4v            | `yolo predict source=video.m4v`  | [Apple](https://developer.apple.com/library/archive/documentation/QuickTime/QTFF/QTFFChap2/qtff2.html)         |
+| mkv            | `yolo predict source=video.mkv`  | [Matroska](https://matroska.org/technical/specs/index.html)                                                    |
+| mov            | `yolo predict source=video.mov`  | [Apple](https://developer.apple.com/library/archive/documentation/QuickTime/QTFF/QTFFPreface/qtffPreface.html) |
+| mp4            | `yolo predict source=video.mp4`  | [ISO 68939](https://www.iso.org/standard/68939.html)                                                           |
+| mpeg           | `yolo predict source=video.mpeg` | [ISO 56021](https://www.iso.org/standard/56021.html)                                                           |
+| mpg            | `yolo predict source=video.mpg`  | [ISO 56021](https://www.iso.org/standard/56021.html)                                                           |
+| ts             | `yolo predict source=video.ts`   | [MPEG Transport Stream](https://en.wikipedia.org/wiki/MPEG_transport_stream)                                   |
+| wmv            | `yolo predict source=video.wmv`  | [Microsoft](https://docs.microsoft.com/en-us/windows/win32/wmformat/wmv-file-structure)                        |
+| webm           | `yolo predict source=video.webm` | [Google Developers](https://developers.google.com/media/vp9/getting-started/webm-file-format)                  |
+
+## Working with Results
+
+Results object consists of these component objects:
+
+- `Results.boxes`: `Boxes` object with properties and methods for manipulating bboxes
+- `Results.masks`: `Masks` object used to index masks or to get segment coordinates.
+- `Results.probs`: `torch.Tensor` containing the class probabilities/logits.
+- `Results.orig_img`: Original image loaded in memory.
+- `Results.path`: `Path` containing the path to input image
+
+Each result is composed of torch.Tensor by default, in which you can easily use following functionality:
+
+```python
+results = results.cuda()
+results = results.cpu()
+results = results.to("cpu")
+results = results.numpy()
+```
+
+### Boxes
+
+`Boxes` object can be used index, manipulate and convert bboxes to different formats. The box format conversion
+operations are cached, which means they're only calculated once per object and those values are reused for future calls.
+
+- Indexing a `Boxes` objects returns a `Boxes` object
+
+```python
+results = model(inputs)
+boxes = results[0].boxes
+box = boxes[0]  # returns one box
+box.xyxy 
+```
+
+- Properties and conversions
+
+```python
+boxes.xyxy  # box with xyxy format, (N, 4)
+boxes.xywh  # box with xywh format, (N, 4)
+boxes.xyxyn  # box with xyxy format but normalized, (N, 4)
+boxes.xywhn  # box with xywh format but normalized, (N, 4)
+boxes.conf  # confidence score, (N, 1)
+boxes.cls  # cls, (N, 1)
+boxes.data  # raw bboxes tensor, (N, 6) or boxes.boxes .
+```
+
+### Masks
+
+`Masks` object can be used index, manipulate and convert masks to segments. The segment conversion operation is cached.
+
+```python
+results = model(inputs)
+masks = results[0].masks  # Masks object
+masks.segments  # bounding coordinates of masks, List[segment] * N
+masks.data  # raw masks tensor, (N, H, W) or masks.masks 
+```
+
+### probs
+
+`probs` attribute of `Results` class is a `Tensor` containing class probabilities of a classification operation.
+
+```python
+results = model(inputs)
+results[0].probs  # cls prob, (num_class, )
+```
+
+Class reference documentation for `Results` module and its components can be found [here](../reference/results.md)
+
+## Plotting results
+
+You can use `plot()` function of `Result` object to plot results on in image object. It plots all components(boxes,
+masks, classification logits, etc) found in the results object
+
+```python
+res = model(img)
+res_plotted = res[0].plot()
+cv2.imshow("result", res_plotted)
+```
+
+!!! example "`plot()` arguments"
+
+    `show_conf (bool)`: Show confidence
+
+    `line_width (Float)`: The line width of boxes. Automatically scaled to img size if not provided
+
+    `font_size (Float)`: The font size of . Automatically scaled to img size if not provided
--- a/docs/modes/track.md
+++ b/docs/modes/track.md
@ -0,0 +1,96 @@
+<img width="1024" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png">
+
+Object tracking is a task that involves identifying the location and class of objects, then assigning a unique ID to
+that detection in video streams.
+
+The output of tracker is the same as detection with an added object ID.
+
+## Available Trackers
+
+The following tracking algorithms have been implemented and can be enabled by passing `tracker=tracker_type.yaml`
+
+* [BoT-SORT](https://github.com/NirAharon/BoT-SORT) - `botsort.yaml`
+* [ByteTrack](https://github.com/ifzhang/ByteTrack) - `bytetrack.yaml`
+
+The default tracker is BoT-SORT.
+
+## Tracking
+
+Use a trained YOLOv8n/YOLOv8n-seg model to run tracker on video streams.
+
+!!! example ""
+
+    === "Python"
+    
+        ```python
+        from ultralytics import YOLO
+        
+        # Load a model
+        model = YOLO("yolov8n.pt")  # load an official detection model
+        model = YOLO("yolov8n-seg.pt")  # load an official segmentation model
+        model = YOLO("path/to/best.pt")  # load a custom model
+        
+        # Track with the model
+        results = model.track(source="https://youtu.be/Zgi9g1ksQHc", show=True) 
+        results = model.track(source="https://youtu.be/Zgi9g1ksQHc", show=True, tracker="bytetrack.yaml") 
+        ```
+    === "CLI"
+    
+        ```bash
+        yolo track model=yolov8n.pt source="https://youtu.be/Zgi9g1ksQHc"  # official detection model
+        yolo track model=yolov8n-seg.pt source=...   # official segmentation model
+        yolo track model=path/to/best.pt source=...  # custom model
+        yolo track model=path/to/best.pt  tracker="bytetrack.yaml" # bytetrack tracker
+
+        ```
+
+As in the above usage, we support both the detection and segmentation models for tracking and the only thing you need to
+do is loading the corresponding (detection or segmentation) model.
+
+## Configuration
+
+### Tracking
+
+Tracking shares the configuration with predict, i.e `conf`, `iou`, `show`. More configurations please refer
+to [predict page](https://docs.ultralytics.com/cfg/#prediction).
+!!! example ""
+
+    === "Python"
+    
+        ```python
+        from ultralytics import YOLO
+        
+        model = YOLO("yolov8n.pt")
+        results = model.track(source="https://youtu.be/Zgi9g1ksQHc", conf=0.3, iou=0.5, show=True) 
+        ```
+    === "CLI"
+    
+        ```bash
+        yolo track model=yolov8n.pt source="https://youtu.be/Zgi9g1ksQHc" conf=0.3, iou=0.5 show
+
+        ```
+
+### Tracker
+
+We also support using a modified tracker config file, just copy a config file i.e `custom_tracker.yaml`
+from [ultralytics/tracker/cfg](https://github.com/ultralytics/ultralytics/tree/main/ultralytics/tracker/cfg) and modify
+any configurations(expect the `tracker_type`) you need to.
+!!! example ""
+
+    === "Python"
+    
+        ```python
+        from ultralytics import YOLO
+        
+        model = YOLO("yolov8n.pt")
+        results = model.track(source="https://youtu.be/Zgi9g1ksQHc", tracker='custom_tracker.yaml') 
+        ```
+    === "CLI"
+    
+        ```bash
+        yolo track model=yolov8n.pt source="https://youtu.be/Zgi9g1ksQHc" tracker='custom_tracker.yaml'
+        ```
+
+Please refer to [ultralytics/tracker/cfg](https://github.com/ultralytics/ultralytics/tree/main/ultralytics/tracker/cfg)
+page
+
--- a/docs/modes/train.md
+++ b/docs/modes/train.md
@ -0,0 +1,88 @@
+<img width="1024" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png">
+
+**Train mode** is used for training a YOLOv8 model on a custom dataset. In this mode, the model is trained using the
+specified dataset and hyperparameters. The training process involves optimizing the model's parameters so that it can
+accurately predict the classes and locations of objects in an image.
+
+!!! tip "Tip"
+
+    * YOLOv8 datasets like COCO, VOC, ImageNet and many others automatically download on first use, i.e. `yolo train data=coco.yaml`
+
+## Usage Examples
+
+Train YOLOv8n on the COCO128 dataset for 100 epochs at image size 640. See Arguments section below for a full list of
+training arguments.
+
+!!! example ""
+
+    === "Python"
+    
+        ```python
+        from ultralytics import YOLO
+        
+        # Load a model
+        model = YOLO("yolov8n.yaml")  # build a new model from scratch
+        model = YOLO("yolov8n.pt")  # load a pretrained model (recommended for training)
+        
+        # Train the model
+        model.train(data="coco128.yaml", epochs=100, imgsz=640)
+        ```
+    === "CLI"
+    
+        ```bash
+        yolo detect train data=coco128.yaml model=yolov8n.pt epochs=100 imgsz=640
+        ```
+
+## Arguments
+
+Training settings for YOLO models refer to the various hyperparameters and configurations used to train the model on a
+dataset. These settings can affect the model's performance, speed, and accuracy. Some common YOLO training settings
+include the batch size, learning rate, momentum, and weight decay. Other factors that may affect the training process
+include the choice of optimizer, the choice of loss function, and the size and composition of the training dataset. It
+is important to carefully tune and experiment with these settings to achieve the best possible performance for a given
+task.
+
+| Key               | Value    | Description                                                                 |
+|-------------------|----------|-----------------------------------------------------------------------------|
+| `model`           | `None`   | path to model file, i.e. yolov8n.pt, yolov8n.yaml                           |
+| `data`            | `None`   | path to data file, i.e. coco128.yaml                                        |
+| `epochs`          | `100`    | number of epochs to train for                                               |
+| `patience`        | `50`     | epochs to wait for no observable improvement for early stopping of training |
+| `batch`           | `16`     | number of images per batch (-1 for AutoBatch)                               |
+| `imgsz`           | `640`    | size of input images as integer or w,h                                      |
+| `save`            | `True`   | save train checkpoints and predict results                                  |
+| `save_period`     | `-1`     | Save checkpoint every x epochs (disabled if < 1)                            |
+| `cache`           | `False`  | True/ram, disk or False. Use cache for data loading                         |
+| `device`          | `None`   | device to run on, i.e. cuda device=0 or device=0,1,2,3 or device=cpu        |
+| `workers`         | `8`      | number of worker threads for data loading (per RANK if DDP)                 |
+| `project`         | `None`   | project name                                                                |
+| `name`            | `None`   | experiment name                                                             |
+| `exist_ok`        | `False`  | whether to overwrite existing experiment                                    |
+| `pretrained`      | `False`  | whether to use a pretrained model                                           |
+| `optimizer`       | `'SGD'`  | optimizer to use, choices=['SGD', 'Adam', 'AdamW', 'RMSProp']               |
+| `verbose`         | `False`  | whether to print verbose output                                             |
+| `seed`            | `0`      | random seed for reproducibility                                             |
+| `deterministic`   | `True`   | whether to enable deterministic mode                                        |
+| `single_cls`      | `False`  | train multi-class data as single-class                                      |
+| `image_weights`   | `False`  | use weighted image selection for training                                   |
+| `rect`            | `False`  | support rectangular training                                                |
+| `cos_lr`          | `False`  | use cosine learning rate scheduler                                          |
+| `close_mosaic`    | `10`     | disable mosaic augmentation for final 10 epochs                             |
+| `resume`          | `False`  | resume training from last checkpoint                                        |
+| `lr0`             | `0.01`   | initial learning rate (i.e. SGD=1E-2, Adam=1E-3)                            |
+| `lrf`             | `0.01`   | final learning rate (lr0 * lrf)                                             |
+| `momentum`        | `0.937`  | SGD momentum/Adam beta1                                                     |
+| `weight_decay`    | `0.0005` | optimizer weight decay 5e-4                                                 |
+| `warmup_epochs`   | `3.0`    | warmup epochs (fractions ok)                                                |
+| `warmup_momentum` | `0.8`    | warmup initial momentum                                                     |
+| `warmup_bias_lr`  | `0.1`    | warmup initial bias lr                                                      |
+| `box`             | `7.5`    | box loss gain                                                               |
+| `cls`             | `0.5`    | cls loss gain (scale with pixels)                                           |
+| `dfl`             | `1.5`    | dfl loss gain                                                               |
+| `fl_gamma`        | `0.0`    | focal loss gamma (efficientDet default gamma=1.5)                           |
+| `label_smoothing` | `0.0`    | label smoothing (fraction)                                                  |
+| `nbs`             | `64`     | nominal batch size                                                          |
+| `overlap_mask`    | `True`   | masks should overlap during training (segment train only)                   |
+| `mask_ratio`      | `4`      | mask downsample ratio (segment train only)                                  |
+| `dropout`         | `0.0`    | use dropout regularization (classify train only)                            |
+| `val`             | `True`   | validate/test during training                                               |
--- a/docs/modes/val.md
+++ b/docs/modes/val.md
@ -0,0 +1,86 @@
+<img width="1024" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png">
+
+**Val mode** is used for validating a YOLOv8 model after it has been trained. In this mode, the model is evaluated on a
+validation set to measure its accuracy and generalization performance. This mode can be used to tune the hyperparameters
+of the model to improve its performance.
+
+!!! tip "Tip"
+
+    * YOLOv8 models automatically remember their training settings, so you can validate a model at the same image size and on the original dataset easily with just `yolo val model=yolov8n.pt` or `model('yolov8n.pt').val()`
+
+## Usage Examples
+
+Validate trained YOLOv8n model accuracy on the COCO128 dataset. No argument need to passed as the `model` retains it's
+training `data` and arguments as model attributes. See Arguments section below for a full list of export arguments.
+
+!!! example ""
+
+    === "Python"
+    
+        ```python
+        from ultralytics import YOLO
+        
+        # Load a model
+        model = YOLO("yolov8n.pt")  # load an official model
+        model = YOLO("path/to/best.pt")  # load a custom model
+        
+        # Validate the model
+        metrics = model.val()  # no arguments needed, dataset and settings remembered
+        metrics.box.map    # map50-95
+        metrics.box.map50  # map50
+        metrics.box.map75  # map75
+        metrics.box.maps   # a list contains map50-95 of each category
+        ```
+    === "CLI"
+    
+        ```bash
+        yolo detect val model=yolov8n.pt  # val official model
+        yolo detect val model=path/to/best.pt  # val custom model
+        ```
+
+## Arguments
+
+Validation settings for YOLO models refer to the various hyperparameters and configurations used to
+evaluate the model's performance on a validation dataset. These settings can affect the model's performance, speed, and
+accuracy. Some common YOLO validation settings include the batch size, the frequency with which validation is performed
+during training, and the metrics used to evaluate the model's performance. Other factors that may affect the validation
+process include the size and composition of the validation dataset and the specific task the model is being used for. It
+is important to carefully tune and experiment with these settings to ensure that the model is performing well on the
+validation dataset and to detect and prevent overfitting.
+
+| Key           | Value   | Description                                                        |
+|---------------|---------|--------------------------------------------------------------------|
+| `data`        | `None`  | path to data file, i.e. coco128.yaml                               |
+| `imgsz`       | `640`   | image size as scalar or (h, w) list, i.e. (640, 480)               |
+| `batch`       | `16`    | number of images per batch (-1 for AutoBatch)                      |
+| `save_json`   | `False` | save results to JSON file                                          |
+| `save_hybrid` | `False` | save hybrid version of labels (labels + additional predictions)    |
+| `conf`        | `0.001` | object confidence threshold for detection                          |
+| `iou`         | `0.6`   | intersection over union (IoU) threshold for NMS                    |
+| `max_det`     | `300`   | maximum number of detections per image                             |
+| `half`        | `True`  | use half precision (FP16)                                          |
+| `device`      | `None`  | device to run on, i.e. cuda device=0/1/2/3 or device=cpu           |
+| `dnn`         | `False` | use OpenCV DNN for ONNX inference                                  |
+| `plots`       | `False` | show plots during training                                         |
+| `rect`        | `False` | support rectangular evaluation                                     |
+| `split`       | `val`   | dataset split to use for validation, i.e. 'val', 'test' or 'train' |
+
+## Export Formats
+
+Available YOLOv8 export formats are in the table below. You can export to any format using the `format` argument,
+i.e. `format='onnx'` or `format='engine'`.
+
+| Format                                                             | `format` Argument | Model                     | Metadata |
+|--------------------------------------------------------------------|-------------------|---------------------------|----------|
+| [PyTorch](https://pytorch.org/)                                    | -                 | `yolov8n.pt`              | ✅        |
+| [TorchScript](https://pytorch.org/docs/stable/jit.html)            | `torchscript`     | `yolov8n.torchscript`     | ✅        |
+| [ONNX](https://onnx.ai/)                                           | `onnx`            | `yolov8n.onnx`            | ✅        |
+| [OpenVINO](https://docs.openvino.ai/latest/index.html)             | `openvino`        | `yolov8n_openvino_model/` | ✅        |
+| [TensorRT](https://developer.nvidia.com/tensorrt)                  | `engine`          | `yolov8n.engine`          | ✅        |
+| [CoreML](https://github.com/apple/coremltools)                     | `coreml`          | `yolov8n.mlmodel`         | ✅        |
+| [TF SavedModel](https://www.tensorflow.org/guide/saved_model)      | `saved_model`     | `yolov8n_saved_model/`    | ✅        |
+| [TF GraphDef](https://www.tensorflow.org/api_docs/python/tf/Graph) | `pb`              | `yolov8n.pb`              | ❌        |
+| [TF Lite](https://www.tensorflow.org/lite)                         | `tflite`          | `yolov8n.tflite`          | ✅        |
+| [TF Edge TPU](https://coral.ai/docs/edgetpu/models-intro/)         | `edgetpu`         | `yolov8n_edgetpu.tflite`  | ✅        |
+| [TF.js](https://www.tensorflow.org/js)                             | `tfjs`            | `yolov8n_web_model/`      | ✅        |
+| [PaddlePaddle](https://github.com/PaddlePaddle)                    | `paddle`          | `yolov8n_paddle_model/`   | ✅        |