avatarDavid Cochard

Summary

M2Det is an advanced object detection model that outperforms YOLOv3 and RetinaDet, utilizing a Multi-Level Feature Pyramid Network (MLFPN) with a Thinned U-shape Module (TUM) for improved accuracy and featuring integration with the ailia SDK for easy implementation in AI applications.

Abstract

The M2Det model represents a significant advancement in object detection, offering superior performance over previous models like YOLOv3 and RetinaDet. It leverages a specialized backbone for object detection, known as MLFPN, which incorporates TUM to enhance hierarchical feature extraction. This design allows M2Det to detect objects from 80 categories in the COCO dataset with high precision. The model is accessible through the ailia SDK, which facilitates the development of AI applications across various platforms with GPU acceleration. Users can choose between VGG-16 and ResNet-101 as the backbone for M2Det, balancing speed and accuracy according to their needs. Additionally, ax Inc., the developer of ailia SDK, offers comprehensive services, including consulting, model creation, and AI application development.

Opinions

  • M2Det is recognized for its high accuracy in object detection, surpassing the performance of contemporaneous models.
  • The use of MLFPN and TUM in M2Det is seen as an innovative approach to feature extraction, contributing to its superior detection capabilities.
  • The integration of M2Det with the ailia SDK is considered beneficial for developers, providing a streamlined process for implementing the model in AI applications.
  • The choice between VGG-16 and ResNet-101 as backbones for M2Det allows for flexibility in balancing computational speed and detection accuracy.
  • ax Inc. is portrayed as a competent entity in the field of AI, offering a range of services to support the deployment of AI solutions.

M2Det : Highly Accurate Object Detection Model

This is an introduction to「M2Det」, a machine learning model that can be used with ailia SDK. You can easily use this model to create AI applications using ailia SDK as well as many other ready-to-use ailia MODELS.

Overview

M2Det is a highly accurate object detection model proposed in November 2018. It can detect bounding boxes of objects from the 80 categories in COCO.

Conventional object detection uses an image classification backbone (Mobilenet, VGG, ResNet, etc.) to compute bounding boxes. For example, in SSD, after extracting features with the image classification backbone, the bounding box is calculated by adding a similar backbone in the later stage.

The same was true for ReinaDet, which uses Feature Pyramids, and although it uses more hierarchical features than SSD, the backbone of object detection was based on image classification.

In M2Det, after extracting features with the image classification backbone, a specialized backbone is used for object detection to achieve higher accuracy.

(Source:https://arxiv.org/abs/1811.04533

This backbone specialized for object detection is called MLFPN (Multi-Level Feature Pyramid Network), which uses TUM (Thinned U-shape Module) in a hierarchical manner.

(Source:https://arxiv.org/abs/1811.04533

TUM has the following structure.

(Source:https://arxiv.org/abs/1811.04533

Performance of M2Det

M2Det performs better than YOLOv3 and RetinaDet.

(Source:https://arxiv.org/abs/1811.04533

VGG-16 and ResNet-101 are used as backbone for M2Det. VGG-16 is faster and ResNet-101 is more accurate.

(Source:https://arxiv.org/abs/1811.04533

Usage

The ailia SDK sample below allows you to use M2Det, with a VGG-16 based M2Det that takes 3x512x512 as input.

You can use the following command to perform object detection using M2Det on a web camera.

$ python3 m2det.py -v 0

Related topics

ax Inc. has developed ailia SDK, which enables cross-platform, GPU-based rapid inference.

ax Inc. provides a wide range of services from consulting and model creation, to the development of AI-based applications and SDKs. Feel free to contact us for any inquiry.

Ailia Models
Machine Learning
Deep Learning
Object Detection
Yolo
Recommended from ReadMedium