Create RetinaNet object detection guide #2069

sachinprasadhs · 2025-03-24T22:15:06Z

Summary:
This Pull Request introduces comprehensive documentation detailing how to use an object detection model from Keras Hub, using RetinaNet as an example.

This tutorial includes:

Loading and preprocessing a training dataset
Demonstrating inference using pretrained weights
Fine-tuning a pretrained object detector
Custom training of an object detector
Summary

divyashreepathihalli

Thanks for the PR @sachinprasadhs - since this is a guide - we should have more detailed explanations that would help users understand our framework better. Lets add more of that.

Thanks!

guides/keras_hub/object_detection_retinanet.py

…guide

mattdangerw

Thanks! All minor comments. Also heads up I added keras-team/keras-hub#2219 for coco id - name mappings. Whenever it is released we can use it.

guides/keras_hub/object_detection_retinanet.py

divyashreepathihalli · 2025-07-11T00:24:06Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a guide for object detection with RetinaNet using Keras Hub. I have identified a critical issue in the data loading logic, along with suggestions to improve code quality and clarity.

gemini-code-assist · 2025-07-11T00:26:00Z

guides/keras_hub/object_detection_retinanet.py

+        cache_dir=data_dir,
+        extract=True,
+    )
+    data_dir = os.path.join(get_data, extracted_dir)


The keras.utils.get_file function returns the path to the downloaded file, not the directory where files are extracted. Use os.path.dirname to get the directory of the downloaded file.

Suggested change

data_dir = os.path.join(get_data, extracted_dir)

data_dir = os.path.join(os.path.dirname(get_data), extracted_dir)

gemini-code-assist · 2025-07-11T00:26:00Z

guides/keras_hub/object_detection_retinanet.py

+    year="2007",
+    split="trainval",
+    data_dir="./",
+    voc_url=VOC_2007_URL,
+):
+    extracted_dir = os.path.join("VOCdevkit", f"VOC{year}")


The image_file_paths variable is defined but not used. Remove it to avoid confusion.

Suggested change

year="2007",

split="trainval",

data_dir="./",

voc_url=VOC_2007_URL,

):

extracted_dir = os.path.join("VOCdevkit", f"VOC{year}")

# Parallel process all the images.

annotation_file_paths = tf.io.gfile.glob(

os.path.join(data_dir, "Annotations", "*.xml")

)

gemini-code-assist · 2025-07-11T00:26:00Z

guides/keras_hub/object_detection_retinanet.py

+def decode_tfds(record):
+    """Decodes a standard TFDS object detection record.
+
+    Args:
+      record: A dictionary representing a single TFDS record.
+
+    Returns:
+      A dictionary with "images" and "bounding_boxes".
+    """
+    image = record["image"]
+    image_shape = tf.shape(image)
+    height, width = image_shape[0], image_shape[1]
+    boxes = keras.utils.bounding_boxes.convert_format(
+        record["objects"]["bbox"],
+        source="rel_yxyx",
+        target=bbox_format,
+        height=height,
+        width=width,
+    )
+    labels = record["objects"]["label"]
+
+    bounding_boxes = {"boxes": boxes, "labels": labels}
+
+    return {"images": image, "bounding_boxes": bounding_boxes}


The function decode_tfds is defined but never used. Remove it to improve maintainability.

gemini-code-assist · 2025-07-11T00:26:00Z

guides/keras_hub/object_detection_retinanet.py

+def preprocess_tfds(ds):
+    """Preprocesses a TFDS dataset for object detection.
+
+    Args:
+        ds: The TFDS dataset.
+        resizing: A resizing function.
+        max_box_layer: A max box processing function.
+        batch_size: The batch size.
+
+    Returns:
+      A preprocessed TFDS dataset.
+    """
+    ds = ds.map(resizing, num_parallel_calls=tf.data.AUTOTUNE)
+    ds = ds.map(max_box_layer, num_parallel_calls=tf.data.AUTOTUNE)
+    ds = ds.batch(batch_size, drop_remainder=True)
+    return ds


The resizing and max_box_layer functions, and batch_size variable are used from the global scope, but are documented as arguments. Explicitly pass them as arguments to increase reusability.

Suggested change

def preprocess_tfds(ds):

"""Preprocesses a TFDS dataset for object detection.

Args:

ds: The TFDS dataset.

resizing: A resizing function.

max_box_layer: A max box processing function.

batch_size: The batch size.

Returns:

A preprocessed TFDS dataset.

"""

ds = ds.map(resizing, num_parallel_calls=tf.data.AUTOTUNE)

ds = ds.map(max_box_layer, num_parallel_calls=tf.data.AUTOTUNE)

ds = ds.batch(batch_size, drop_remainder=True)

return ds

def preprocess_tfds(ds, resizing, max_box_layer, batch_size):

"""Preprocesses a TFDS dataset for object detection.

Args:

ds: The TFDS dataset.

resizing: A resizing function.

max_box_layer: A max box processing function.

batch_size: The batch size.

Returns:

A preprocessed TFDS dataset.

"""

ds = ds.map(resizing, num_parallel_calls=tf.data.AUTOTUNE)

ds = ds.map(max_box_layer, num_parallel_calls=tf.data.AUTOTUNE)

ds = ds.batch(batch_size, drop_remainder=True)

return ds

gemini-code-assist · 2025-07-11T00:26:01Z

guides/keras_hub/object_detection_retinanet.py

+            update_freq=1,
+        ),
+        keras.callbacks.ModelCheckpoint(
+            ckpt_path + "/{epoch:04d}-{val_loss:.2f}.weights.h5",


Use os.path.join for constructing file paths to ensure platform independence.

Suggested change

ckpt_path + "/{epoch:04d}-{val_loss:.2f}.weights.h5",

os.path.join(ckpt_path, "{epoch:04d}-{val_loss:.2f}.weights.h5"),

gemini-code-assist · 2025-07-11T00:26:01Z

guides/md/keras_hub/object_detection_retinanet.md

+keras-nlp 0.19.0 requires keras-hub==0.19.0, but you have keras-hub 0.20.0 which is incompatible.
+
+WARNING: All log messages before absl::InitializeLog() is called are written to STDERR
+E0000 00:00:1746815719.896182    8973 cuda_dnn.cc:8310] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
+E0000 00:00:1746815719.902635    8973 cuda_blas.cc:1418] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered


The output block contains a package incompatibility warning and some CUDA/cuDNN errors. Either resolve the dependency conflict, explain why these messages appear and if they can be safely ignored, or remove this output block from the guide to avoid confusion.

retinanet guide

17e0e12

sachinprasadhs requested review from fchollet, MarkDaoust and pcoet as code owners March 24, 2025 22:15

sachinprasadhs requested review from mattdangerw and divyashreepathihalli and removed request for MarkDaoust and pcoet March 24, 2025 22:15

github-actions bot assigned sachinprasadhs Mar 24, 2025

sachinprasadhs added 3 commits March 25, 2025 22:12

generate md and .ipynb files

2dd6863

trim output

e39d55a

Merge branch 'keras-team:master' into retinanet_guide

7addfab

divyashreepathihalli reviewed Apr 16, 2025

View reviewed changes

sachinprasadhs added 2 commits April 25, 2025 22:36

add detailed description

0aaa652

Merge remote-tracking branch 'origin/retinanet_guide' into retinanet_…

9422d34

…guide

mattdangerw reviewed May 1, 2025

View reviewed changes

sachinprasadhs added 5 commits May 9, 2025 17:25

address comments

435efd1

Merge branch 'keras-team:master' into retinanet_guide

4850d84

generate md and .ipynb

1dbc697

fix img path

fbfa169

nit

8559e10

gemini-code-assist bot reviewed Jul 11, 2025

View reviewed changes

	data_dir = os.path.join(get_data, extracted_dir)
	data_dir = os.path.join(os.path.dirname(get_data), extracted_dir)

	ckpt_path + "/{epoch:04d}-{val_loss:.2f}.weights.h5",
	os.path.join(ckpt_path, "{epoch:04d}-{val_loss:.2f}.weights.h5"),

Create RetinaNet object detection guide #2069

Are you sure you want to change the base?

Create RetinaNet object detection guide #2069

Uh oh!

Conversation

sachinprasadhs commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

divyashreepathihalli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattdangerw left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

divyashreepathihalli commented Jul 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sachinprasadhs commented Mar 24, 2025 •

edited

Loading