open-edge-platform · Bepitic · Mar 19, 2024 · Mar 22, 2024 · Mar 24, 2024 · Apr 6, 2024
@@ -1,4 +1,4 @@
-# Custom Data
+# Custom Data (Images)
 
 This tutorial will show you how to train anomalib models on your custom
 data. More specifically, we will show you how to use the [Folder](../../reference/data/image/folder.md)

@@ -0,0 +1,347 @@
+# Custom Data (Videos)
+
+This tutorial will show you how to train anomalib models on your custom
+data. More specifically, we will show you how to use the [FolderVideo](../../reference/data/video/folder_video.md)
+dataset to train anomalib models on your custom data.
+
+```{warning}
+This tutorial assumes that you have already installed anomalib.
+If not, please refer to the installation section.
+```
+
+```{note}
+We will use our [MicroDatasets](https://github.com/openvinotoolkit/anomalib/releases/download/microvideodatasets/microdatasets.zip) three datasets to show how the folder structure should be for the compatibility of the [FolderVideo](../../reference/data/video/folder_video.md)
+dataset, but you can use any dataset you want.
+```
+
+```{note}
+The folder structure should be:
+- path/to/dataset/ <-- root
+  - folder_containing_normal_videos/ <- normal_dir
+  - folder_containing_abnormal_videos/ <- test_dir
+  - folder_containing_masks_for_abnormal_videos/ <- mask_dir
+
+If the videos are in a frame by frame format, every video should be inside a different folder in test_dir:
+
+- test_dir/
+  - vid_00/
+    - frame_00.png
+    - frame_01.png
+    - frame_02.png
+    -...
+  - vid_01/
+    - frame_00.png
+    - frame_01.png
+    - frame_02.png
+    -...
+```
+
+We will split the section to two tasks: Classification and Segmentation.
+
+## Classification Dataset
+
+In certain use-cases, ground-truth masks for the abnormal Frames of the video may not be
+available. In such cases, we could use the classification task to train a model
+that will be able to detect the abnormal Frames in the test set.
+
+We will split this section into two tasks:
+
+- Classification with normal and abnormal frame masks.
+- Classification with only normal videos.
+
+### With Normal and Abnormal frame masks
+
+We could use [FolderVideo](../../reference/data/video/folder_video.md) datamodule to train
+a model on this dataset. We could run the following python code to create the
+custom datamodule:
+
+:::::{dropdown} Code Syntax
+:icon: code
+
+::::{tab-set}
+:::{tab-item} API
+:sync: label-1
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/classification/default.txt
+:language: python
+```
+
+```{note}
+As can be seen above, we only need to specify the ``task`` argument to ``classification``. We could have also use ``TaskType.CLASSIFICATION`` instead of ``classification``.
+```
+
+The [FolderVideo](../../reference/data/video/folder_video.md) datamodule will create training, validation, test and
+prediction datasets and dataloaders for us. We can access the datasets
+and dataloaders by following the same approach as in the segmentation
+task.
+
+When we check the samples from the dataloaders, we will see that the
+`mask` key is not present in the samples. This is because we do not need
+the masks for the classification task.
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/classification/dataloader_values.txt
+:language: python
+```
+
+Training the model is as simple as running the following command:
+
+```{literalinclude} ../../../../snippets/train/api/classification/model_and_engine_video.txt
+:language: python
+```
+
+where we train a AiVad model on this custom dataset with default model parameters.
+
+:::
+
+:::{tab-item} CLI
+:sync: label-2
+
+Here is the dataset config to create the same custom datamodule:
+
+```{literalinclude} ../../../../snippets/config/data/video/folder/classification/cli/default.yaml
+:language: yaml
+```
+
+Assume that we have saved the above config file as `classification.yaml`. We could run the following CLI command to train a AiVad model on above dataset:
+
+```bash
+anomalib train --data classification.yaml --model anomalib.models.AiVad --task CLASSIFICATION
+```
+
+```{note}
+As can be seen above, we also need to specify the ``task`` argument to ``CLASSIFICATION`` to explicitly tell anomalib that we want to train a classification model. This is because the default `task` is `SEGMENTATION` within `Engine`.
+```
+
+:::
+::::
+:::::
+
+### With Only Normal Videos
+
+There are certain cases where we only have normal frames in our dataset but
+would like to train a classification model.
+
+This could be done in two ways:
+
+- Train the model and skip the validation and test steps, as we do not have
+  abnormal Videos to validate and test the model on.
+- Use the synthetic anomaly generation feature to create abnormal videos/masks from
+  normal frames, and perform the validation and test steps.
+
+For now we will focus on the second approach.
+
+#### With Validation and Testing via Synthetic Anomalies
+
+If we want to check the performance of the model, we will need to have abnormal
+videos/masks to validate and test the model on. During the validation stage,
+these anomalous videos are used to normalize the anomaly scores and find the
+best threshold that separates normal and abnormal videos.
+
+Anomalib provides synthetic anomaly generation capabilities to create abnormal
+frames from normal frames so we could check the performance. We could use the
+[FolderVideo](../../reference/data/video/folder_video.md) datamodule to train a model on
+this dataset.
+
+:::::{dropdown} Code Syntax
+:icon: code
+
+::::{tab-set}
+:::{tab-item} API
+:sync: label-1
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/classification/normal_and_synthetic.txt
+:language: python
+```
+
+Once the datamodule is setup, the rest of the process is the same as in the previous classification example.
+
+```{literalinclude} ../../../../snippets/train/api/classification/model_and_engine_video.txt
+:language: python
+```
+
+where we train a AiVad model on this custom dataset with default model parameters.
+
+:::
+
+:::{tab-item} CLI
+:sync: label-2
+
+Here is the CLI command to create the same custom datamodule with only normal videos. We only need to change the `test_split_mode` argument to `SYNTHETIC` to generate synthetic anomalies.
+
+```{literalinclude} ../../../../snippets/config/data/video/folder/classification/cli/normal_and_synthetic.yaml
+:language: yaml
+```
+
+Assume that we have saved the above config file as `normal.yaml`. We could run the following CLI command to train a AiVad model on above dataset:
+
+```bash
+anomalib train --data normal.yaml --model anomalib.models.AiVad --task CLASSIFICATION
+```
+
+```{note}
+As shown in the previous classification example, we, again, need to specify the ``task`` argument to ``CLASSIFICATION`` to explicitly tell anomalib that we want to train a classification model. This is because the default `task` is `SEGMENTATION` within `Engine`.
+```
+
+:::
+::::
+:::::
+
+## Segmentation Dataset
+
+Assume that we have a dataset in which the training set contains only
+normal videos, and the test set contains both normal and abnormal
+videos. We also have masks for the abnormal video-frames in the test set. We
+want to train an anomaly segmentation model that will be able to detect the
+abnormal regions in the test set.
+
+### With Normal and Abnormal Videos
+
+We could use [FolderVideo](../../reference/data/video/folder_video.md) datamodule to load the microvideo dataset in a format that is readable by Anomalib's models.
+
+:::::{dropdown} Code Syntax
+::::{tab-set}
+:::{tab-item} API
+:sync: label-1
+
+We could run the following python code to create the custom datamodule:
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/segmentation/default.txt
+:language: python
+```
+
+The [FolderVideo](../../reference/data/video/folder_video.md) datamodule will create training, validation, test and
+prediction datasets and dataloaders for us. We can access the datasets
+and dataloaders using the following attributes:
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/segmentation/datamodule_attributes.txt
+:language: python
+```
+
+To check what individual samples from dataloaders look like, we can run
+the following command:
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/segmentation/dataloader_values.txt
+:language: python
+```
+
+We could check the shape of the videos and masks using the following
+commands:
+
+```python
+print(train_data["image"].shape)
+# torch.Size([2, 2, 3, 360, 640])
+
+print(train_data["mask"].shape)
+# torch.Size([2, 360, 640])
+```
+
+Training the model is as simple as running the following command:
+
+```{literalinclude} ../../../../snippets/train/api/segmentation/model_and_engine_video.txt
+:language: python
+```
+
+where we train a AiVad model on this custom dataset with default model parameters.
+:::
+
+:::{tab-item} CLI
+:sync: label-2
+Here is the CLI command to create the same custom datamodule:
+
+```{literalinclude} ../../../../snippets/config/data/video/folder/segmentation/cli/default.yaml
+:language: yaml
+```
+
+Assume that we have saved the above config file as `segmentation.yaml`.
+We could run the following CLI command to train a AiVad model on above dataset:
+
+```bash
+anomalib train --data segmentation.yaml --model anomalib.models.AiVad
+```
+
+:::
+
+::::
+:::::
+
+This example demonstrates how to create a segmentation dataset with
+normal and abnormal videos. We could expand this example to create a
+segmentation dataset with only normal videos.
+
+### With Only Normal Videos
+
+There are certain cases where we only have normal videos in our dataset
+but would like to train a segmentation model. This could be done in two ways:
+
+- Train the model and skip the validation and test steps, as
+  we do not have abnormal videos to validate and test the model on, or
+- Use the synthetic anomaly generation feature to create abnormal
+  videos from normal videos, and perform the validation and test steps.
+
+For now we will focus on the second approach.
+
+#### With Validation and Testing via Synthetic Anomalies
+
+We could use the synthetic anomaly generation feature again to create abnormal
+videos/frames from normal videos. We could then use the
+[FolderVideo](../../reference/data/video/folder_video.md) datamodule to train a model on
+this dataset. Here is the python code to create the custom datamodule:
+
+:::::{dropdown} Code Syntax
+:icon: code
+
+::::{tab-set}
+:::{tab-item} API
+:sync: label-1
+
+We could run the following python code to create the custom datamodule:
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/segmentation/normal_and_synthetic.txt
+:language: python
+```
+
+As can be seen from the code above, we only need to specify the
+`test_split_mode` argument to `SYNTHETIC`. The [FolderVideo](../../reference/data/video/folder_video.md) datamodule will create training, validation, test and prediction datasets and
+dataloaders for us.
+
+To check what individual samples from dataloaders look like, we can run
+the following command:
+
+```{literalinclude} ../../../../snippets/data/video/folder_video/segmentation/dataloader_values.txt
+:language: python
+```
+
+We could check the shape of the videos and masks using the following
+commands:
+
+```python
+print(train_data["image"].shape)
+# torch.Size([2, 2, 3, 360, 640])
+
+print(train_data["mask"].shape)
+# torch.Size([2, 360, 640])
+```
+
+:::
+
+:::{tab-item} CLI
+:sync: label-2
+
+Here is the CLI command to create the same custom datamodule with only normal
+videos. We only need to change the `test_split_mode` argument to `SYNTHETIC` to
+generate synthetic anomalies.
+
+```{literalinclude} ../../../../snippets/config/data/video/folder/segmentation/cli/normal_and_synthetic.yaml
+:language: yaml
+```
+
+Assume that we have saved the above config file as `synthetic.yaml`. We could
+run the following CLI command to train a AiVad model on above dataset:
+
+```bash
+anomalib train --data synthetic.yaml --model anomalib.models.AiVad
+```
+
+:::
+::::
+:::::
@@ -6,11 +6,18 @@ This section contains tutorials on how to fully utilize the data components of a
 :margin: 1 1 0 0
 :gutter: 1
 
-:::{grid-item-card} {octicon}`database` Train on Custom Data.
+:::{grid-item-card} {octicon}`database` Train on Custom Data(Images).
 :link: ./custom_data
 :link-type: doc
 
-Learn more about how to use `Folder` dataset to train anomalib models on your custom data.
+Learn more about how to use `Folder` dataset to train anomalib models on your custom data (Images).
+:::
+
+:::{grid-item-card} {octicon}`database` Train on Custom Data(Videos).
+:link: ./custom_data_videos
+:link-type: doc
+
+Learn more about how to use `FolderVideo` dataset to train anomalib models on your custom data (Videos).
 :::
 
 :::{grid-item-card} {octicon}`table` Input tiling
@@ -27,5 +34,6 @@ Learn more about how to use the tiler for input tiling.
 :hidden:
 
 ./custom_data
+./custom_data_videos
 ./input_tiling
 ```
@@ -0,0 +1,7 @@
+# FolderVideo Data
+
+```{eval-rst}
+.. automodule:: anomalib.data.video.folder_video
+   :members:
+   :show-inheritance:
+```