YOLOV8 port to keras-hub #1899

oarriaga · 2024-10-01T18:44:49Z

This PR ports YOLOV8 from keras-cv to keras-hub (#176). All necessary YOLOV8 functions are now found inside keras-hub:

Add CIOU loss.
Add missing masking functionality in the bounding_boxes module.
Add multibackend non maximum supression layer.
Add label encoder.
Build basic abstract object detector task class.
Add YOLOV8 backbone and detector.

Missing steps include:

Upload previous presets to Kaggle.
Remove skipping tests with presets.
Add colab with basic functionality.
Add weight transfer script from keras-cv to keras-hub.
Add training script.

divyashreepathihalli

Thanks for the PR @oarriaga left some initial comments

keras_hub/src/layers/modeling/non_max_suppression.py

divyashreepathihalli · 2024-10-02T04:29:30Z

keras_hub/src/models/yolo_v8/yolo_v8_detector.py

+        label_encoder=None,
+        prediction_decoder=None,
+        **kwargs,
+    ):


restructure the code to define all teh layers first, functional model next and config last with these comments
=== Layers ===
.
.
=== Functional model ===
.
.
=== Config ===
.
.
example : https://github.com/keras-team/keras-hub/blob/master/keras_hub/src/models/bert/bert_backbone.py#L92

Hi Divya, the current model applies multiple blocks of layers. That will imply that we would need to initialize many layers in the constructor. Moreover, the connections between those layers are not so straightforward as in Bert. What would you suggest? Shall we still initialize all layers in the layer block and connect them in the functional block?

divyashreepathihalli · 2024-10-02T04:30:34Z

keras_hub/src/models/yolo_v8/yolo_v8_detector.py

+    def predict_step(self, *args):
+        outputs = super().predict_step(*args)
+        if isinstance(outputs, tuple):
+            return self.decode_predictions(outputs[0], args[-1]), outputs[1]


will model.fit work ?

divyashreepathihalli · 2024-10-02T04:32:12Z

keras_hub/src/models/yolo_v8/yolo_v8_detector_test.py

@@ -0,0 +1,318 @@
+import os
+


add generic task test - self.run_task_test - example : https://github.com/keras-team/keras-hub/blob/master/keras_hub/src/models/bert/bert_text_classifier_test.py#L41

you will need to add preprocessor flow - example follow resnet - https://github.com/keras-team/keras-hub/tree/master/keras_hub/src/models/resnet

divyashreepathihalli · 2024-10-02T04:33:34Z

keras_hub/src/models/yolo_v8/yolo_v8_detector_test.py

+
+    @pytest.mark.large  # Saving is slow, so mark these large.
+    def test_saved_model(self):
+        model = keras_hub.models.YOLOV8Detector(


use self.run_model_saving_test - example : https://github.com/keras-team/keras-hub/blob/master/keras_hub/src/models/resnet/resnet_image_classifier_test.py#L62C9-L62C35

divyashreepathihalli · 2024-10-02T04:34:04Z

keras_hub/src/models/yolo_v8/yolo_v8_detector_test.py

+
+    # TODO(tirthasheshpatel): Support updating prediction decoder in Keras Core.
+    @pytest.mark.skip(reason="Missing presets")
+    @pytest.mark.tf_keras_only


no tf_keras_only in KerasHub

…only implementation

fchollet

Thanks for the updates! What's the current progress of the PR? Is it nearly ready to merge?

fchollet · 2024-11-08T13:43:02Z

keras_hub/src/bounding_box/mask_invalid_detections.py

+        raise ValueError(
+            "`bounding_box.mask_invalid_detections()` requires inputs to be "
+            "Dense tensors. Please call "
+            "`bounding_box.to_dense(bounding_boxes)` before passing your boxes "


Where is to_dense actually located?

In keras-cv to_dense was located outside of the training loop when building a tf.data pipeline.

the bounding box code is now moved to keras repo

fchollet · 2024-11-08T13:43:55Z

keras_hub/src/models/yolo_v8/yolo_v8_detector.py

+
+    Example:
+    ```python
+    images = tf.ones(shape=(1, 512, 512, 3))


Please make sure code examples don't have any TF references; use keras.ops or np and the like.

fchollet · 2024-11-08T13:44:12Z

keras_hub/src/models/yolo_v8/yolo_v8_detector.py

+        classification_loss='binary_crossentropy',
+        box_loss='ciou',
+        optimizer=tf.optimizers.SGD(global_clipnorm=10.0),
+        jit_compile=False,


No compilation support?

One can set the flag to True; however, the latest training runs with keras-cv were not converging when padded boxes were added. I am going through the loss function to find out what could be the issue.

does it work with compilation now?

fchollet · 2024-11-08T13:44:34Z

keras_hub/src/models/yolo_v8/yolo_v8_label_encoder.py

+    """
+    Encodes ground truth boxes to target boxes and class labels for training a
+    YOLOV8 model. This is an implementation of the Task-aligned sample
+    assignment scheme proposed in https://arxiv.org/abs/2108.07755.


Please use markdown-formatted links.

fchollet · 2024-11-08T13:45:13Z

keras_hub/src/models/yolo_v8/yolo_v8_label_encoder.py

+        """Computes target boxes and classes for anchors.
+
+        Args:
+            scores: a Float Tensor of shape (batch_size, num_anchors,


Please use backticks around code keywords, such as shape tuples. All docstrings are rendered as markdown.

fchollet · 2024-11-08T13:45:51Z

keras_hub/src/models/yolo_v8/yolo_v8_label_encoder.py

@@ -0,0 +1,257 @@
+import keras
+import tensorflow as tf


Long-term, we cannot depend on TF, please remove this import

fchollet · 2024-11-08T13:47:06Z

keras_hub/src/models/yolo_v8/yolo_v8_label_encoder.py

+                    truth box. Anchors that didn't match with a ground truth
+                    box should be excluded from both class and box losses.
+        """
+        if isinstance(gt_bboxes, tf.RaggedTensor):


To avoid importing TF you can use

def is_tensorflow_ragged(value): if hasattr(value, "__class__"): return ( value.__class__.__name__ == "RaggedTensor" and "tensorflow.python." in str(value.__class__.__module__) ) return False

oarriaga · 2024-11-08T14:28:16Z

Hi, thank you! I think it’s nearly ready. Right now, I’m validating the model’s expected convergence with PASCAL, which has turned out to be more challenging than anticipated. After that, the only remaining step will be to go through your comments and incorporate any additional input from Divya.

…s presets

…der and preprocessor

divyashreepathihalli · 2024-11-25T21:54:51Z

@oarriaga can you please add a demo notebook to the PR to verify outputs. What is the inference time of this implementation versus the original implementation?

divyashreepathihalli · 2024-10-23T23:58:37Z

keras_hub/src/models/yolo_v8/non_max_suppression.py

+            boxes, iou_threshold, output_size, tile_arg, tile_size
+        )
+
+    selected_boxes, _, output_size, _ = ops.while_loop(


can this be vectorized?

this is a potential performance bottleneck. This part of the code needs to be vectorized.

This works with all backends and I used the same one for retinanet but for now if we can include as this is detachable from model easily and not involved in trained and only while predictions.

We have to check if its bottleneck compared to torch and then we can make some changes as they use loop based approach rather than ops.while_loop.

We have to check the model convertions to onxx and then to tensorrt as well because later in the progression of model those are the important aspects we may have to look into.

Reference: https://github.com/pytorch/vision/blob/acbfd8d94d10f989f4540252e92e8855c19f7ff7/torchvision/models/detection/retinanet.py#L518

Hi @divyashreepathihalli from what I understand NMS seems to be programmed sequentially because of the necessity of carrying a state of unprocessed boxes. From what I have seen, NMS seems to be usually implemented using a while / for loop since boxes are greedily chosen and removed. I don't know if vectorizing this would decrease the computation time given the sequential nature of NMS. Moreover, pytorch's NMS implementation is done in c++ using a double for loop across boxes. Do let me know how you would like me to proceed.

divyashreepathihalli · 2024-11-25T21:55:36Z

keras_hub/src/bounding_box/mask_invalid_detections.py

+        raise ValueError(
+            "`bounding_box.mask_invalid_detections()` requires inputs to be "
+            "Dense tensors. Please call "
+            "`bounding_box.to_dense(bounding_boxes)` before passing your boxes "


the bounding box code is now moved to keras repo

divyashreepathihalli · 2024-11-25T21:59:10Z

keras_hub/src/models/yolo_v8/yolo_v8_detector.py

+        classification_loss='binary_crossentropy',
+        box_loss='ciou',
+        optimizer=tf.optimizers.SGD(global_clipnorm=10.0),
+        jit_compile=False,


does it work with compilation now?

divyashreepathihalli · 2024-11-25T22:01:16Z

keras_hub/src/models/yolo_v8/yolo_v8_detector_test.py

+        return xs, {"boxes": ys, "classes": y_classes}
+
+
+class YOLOV8DetectorTest(TestCase):


add generic task test - self.run_task_test

divyashreepathihalli · 2024-11-25T22:06:28Z

keras_hub/src/bounding_box/mask_invalid_detections.py

+            boxes.
+    Returns:
+        bounding boxes with proper masking of the boxes according to
+        `num_detections`. This allows proper interop with non-max supression.


NIT: suppression

divyashreepathihalli · 2024-11-25T22:06:55Z

keras_hub/src/models/yolo_v8/non_max_suppression_test.py

+from keras_hub.src.tests.test_case import TestCase
+
+
+class NonMaxSupressionTest(TestCase):


NIT : Suppression

divyashreepathihalli · 2024-11-25T22:07:21Z

keras_hub/src/models/yolo_v8/non_max_suppression.py

+
+    Returns:
+      iou_suppressed: a tensor of shape [batch_size, num_boxes_with_padding].
+      iou_diff: a scalar tensor representing whether any box is supressed in


suppressed update spelling everywhere

divyashreepathihalli · 2024-11-25T22:09:31Z

keras_hub/src/models/yolo_v8/yolo_v8_label_encoder.py

+            alignment score of an anchor box. This is the beta parameter in
+            equation 9 of https://arxiv.org/pdf/2108.07755.pdf.
+        epsilon: float, a small number used for numerical stability in division
+            (to avoid diving by zero), and used as a threshold to eliminate very


dividing by zero

divyashreepathihalli · 2024-11-25T22:16:45Z

keras_hub/src/models/yolo_v8/ciou_loss.py

+        bounding_box_format: a case-insensitive string (for example, "xyxy").
+            Each bounding box is defined by these 4 values. For detailed
+            information on the supported formats, see the [KerasCV bounding box
+            documentation](https://keras.io/api/keras_cv/bounding_box/formats/).


add link to keras instead

Hi @divyashreepathihalli I was unable to find in the keras documentation the available box formats. I am linking to the keras source code. We can update this once the documentation is available.

divyashreepathihalli · 2024-11-25T22:19:24Z

keras_hub/src/models/yolo_v8/ciou_loss.py

+        low=0,
+        high=10)
+    loss = keras_hub.src.models.yolo_v8.ciou_loss.CIoULoss("xyxy")
+    loss(y_true, y_pred).numpy()


the y_true and y_pred have different shapes - would this not result in a n error?

divyashreepathihalli · 2024-11-25T22:20:43Z

keras_hub/src/models/yolo_v8/ciou_loss.py

+        self.bounding_box_format = bounding_box_format
+
+    def call(self, y_true, y_pred):
+        y_pred = ops.convert_to_tensor(y_pred)


checking if y_pred is a tensor and the dtype before converting could improve efficiency

divyashreepathihalli · 2024-11-25T22:21:09Z

keras_hub/src/models/yolo_v8/ciou_loss.py

+                f"y_true={y_true.shape[-2]} and number of boxes in "
+                f"y_pred={y_pred.shape[-2]}."
+            )
+


raise error for unsupported bbox format

divyashreepathihalli · 2024-11-25T22:22:58Z

keras_hub/src/models/yolo_v8/ciou_loss_test.py

+            [4, 5, 5, 6],
+            [2, 1, 3, 3],
+        ]
+        expected_loss = 1.03202


how was this value calculated?

This is the original value provided by the KerasCV tests.

divyashreepathihalli · 2024-11-25T22:26:49Z

keras_hub/src/models/yolo_v8/non_max_suppression.py

+            boxes, iou_threshold, output_size, tile_arg, tile_size
+        )
+
+    selected_boxes, _, output_size, _ = ops.while_loop(


this is a potential performance bottleneck. This part of the code needs to be vectorized.

divyashreepathihalli · 2024-11-25T22:28:15Z

keras_hub/src/models/yolo_v8/non_max_suppression.py

+    Ported from https://github.com/tensorflow/tensorflow/blob/v2.12.0/tensorflow/python/ops/image_ops_impl.py#L5368-L5458
+
+    Args:
+      boxes: a tensor of rank 2 or higher with a shape of [..., num_boxes, 4].


shouldn't the shape always be [batch_size, num_boxes, 4]?

divyashreepathihalli · 2024-11-25T22:31:34Z

keras_hub/src/models/yolo_v8/non_max_suppression.py

+    boxes = ops.pad(ops.cast(boxes, "float32"), [[0, 0], [0, pad], [0, 0]])
+    scores = ops.pad(ops.cast(scores, "float32"), [[0, 0], [0, pad]])
+    num_boxes_after_padding = num_boxes + pad
+    num_iterations = num_boxes_after_padding // tile_size


add a check here is verify the num_boxes_after_padding does not exceed max_output_size +pad

divyashreepathihalli · 2024-11-26T00:44:36Z

keras_hub/src/models/yolo_v8/yolo_v8_backbone.py

+    return x
+
+
+def build_block(x, block_arg, channels, depth, block_depth, activation):


the function names here are not readable as we have one build_block and another build_blocks. maybe rename this to yolo_block or whatever you think is suitable and rename build_blocks to stackwise_blocks
Also build might lead to more confusion.

divyashreepathihalli · 2024-11-26T01:14:42Z

keras_hub/src/models/yolo_v8/yolo_v8_detector.py

+        label_encoder: Optional. A `YOLOV8LabelEncoder` that is
+            responsible for transforming input boxes into trainable labels for
+            YOLOV8Detector. If not provided, a default is provided.
+        prediction_decoder: Optional. A `keras.layers.Layer` that is


document default values in docstring. here and everywhere.

divyashreepathihalli · 2024-11-26T01:16:35Z

keras_hub/src/models/yolo_v8/yolo_v8_label_encoder.py

+        # Only anchors which are inside of relevant GT boxes are considered
+        # for assignment.
+        # This is a boolean tensor of shape (B, num_gt_boxes, num_anchors)
+        matching_anchors_in_gt_boxes = is_anchor_center_within_box(


rename all gt_ to ground_truth

divyashreepathihalli · 2024-11-26T01:19:44Z

keras_hub/src/bounding_box/mask_invalid_detections_test.py

@@ -0,0 +1,80 @@
+import numpy as np
+import pytest
+import tensorflow as tf


no tf code in Keras_hub

oarriaga added 8 commits October 1, 2024 19:43

Add regression loss for object detectors

80e4589

Add missing mask function for invalid detections

d155f28

Add multibackend non maximum supression layer

5388655

Add abstract object detector task class

c8ffdd1

Add YOLOV8 backbone and detector with keras-hub only imports

0c32ef7

Add previous backbone and detector presets as template

f1be3d5

Update API with new functions for YOLOV8

6d2df25

Add new API modules for YOLOV8 following previous keras-cv structure

13ca589

divyashreepathihalli reviewed Oct 2, 2024

View reviewed changes

oarriaga added 17 commits October 2, 2024 09:26

Move NMS layer to model directory

9756681

Remove backend gating for non-max supression layer and use keras ops …

3bb7ec7

…only implementation

Add missing args to docstrings

ac3ab62

Remove unnecessary linter exception

27b03c5

Add better docstrings of internal nms functions arguments

524a280

Rename single idx variables to more readable name

96ea865

Add generic layer test to not trainable NMS layer

1f46bbd

Move CIOU loss inside YOLOV8 model directory

0883fcf

Add changes from automatic refactorer

1d0ff43

Fix docstring for new loss location

79e433f

Rename YOLOV8 object detector model

d6e264f

Change docstring to default argument type naming

3a16688

Add standard keras-hub model build separation

36091a4

Add convertion script for YOLOV8 backbone models

aa8db7a

Update conversion script to work for all YOLOV8 backbones

f083c83

Merge branch 'keras-team:master' into master

00f335a

Refactor layers and models

64a58be

fchollet reviewed Nov 8, 2024

View reviewed changes

oarriaga added 2 commits November 9, 2024 10:38

Change docstring to include only keras ops

05af893

Remove tensorflow import to check for ragged tensors

67097b1

oarriaga added 16 commits November 15, 2024 09:14

Remove image rescaling from backbone

832d850

Add preprocessor field and change shape to use backticks

184211d

Fix import name for decode and encode functions

db7e123

Add default YOLOV8 image preprocessor

853ae2b

Add default YOLOV8 object detector preprocessor

4341d05

Extend checkpoint conversion to include backbones and object detector…

7e5e28d

…s presets

Change base class to ImageObjectDetector class

ac4519d

Remove object detector base class

5a9f327

Remove unnecessary comments

499a263

Remove from API previous base ObjectDetector Task class

843a05c

Add changes from automatic formatter

3175b52

Fix serialization to include backbone, label encoder, prediction deco…

3138a46

…der and preprocessor

Add preset reload for testing numerics

6fa961c

Update preset versions and register detector presets

906889e

Fix unit tests

b3af94b

Add changes from automatic formatter

da91ea1

divyashreepathihalli reviewed Nov 25, 2024

View reviewed changes

divyashreepathihalli requested changes Nov 26, 2024

View reviewed changes

divyashreepathihalli requested a review from sineeli November 26, 2024 01:29

oarriaga added 8 commits November 30, 2024 11:27

Rename block function names

7fc29d4

Fix typo with shape loss docstring

a88b246

Fix suppression spelling typos

925db4c

Fix docstring dividing typo

408f1f7

Change link to keras box formats

714733d

Remove tensorflow import from test file

9430d33

Add passing run task test

b1c12c1

Fix docstring tensor shape

6bf964a

		return xs, {"boxes": ys, "classes": y_classes}


		class YOLOV8DetectorTest(TestCase):

		from keras_hub.src.tests.test_case import TestCase


		class NonMaxSupressionTest(TestCase):

		return x


		def build_block(x, block_arg, channels, depth, block_depth, activation):

YOLOV8 port to keras-hub #1899

Are you sure you want to change the base?

YOLOV8 port to keras-hub #1899

Conversation

oarriaga commented Oct 1, 2024

divyashreepathihalli left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fchollet left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oarriaga Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oarriaga Nov 8, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oarriaga commented Nov 8, 2024

divyashreepathihalli commented Nov 25, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sineeli Nov 27, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oarriaga Nov 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

oarriaga Nov 8, 2024 •

edited

Loading

oarriaga Nov 8, 2024 •

edited

Loading

sineeli Nov 27, 2024 •

edited

Loading

oarriaga Nov 30, 2024 •

edited

Loading