You are required to read and agree to the below before accessing a full-text version of an article in the IDE article repository.

The full-text document you are about to access is subject to national and international copyright laws. In most cases (but not necessarily all) the consequence is that personal use is allowed given that the copyright owner is duly acknowledged and respected. All other use (typically) require an explicit permission (often in writing) by the copyright owner.

For the reports in this repository we specifically note that

  • the use of articles under IEEE copyright is governed by the IEEE copyright policy (available at http://www.ieee.org/web/publications/rights/copyrightpolicy.html)
  • the use of articles under ACM copyright is governed by the ACM copyright policy (available at http://www.acm.org/pubs/copyright_policy/)
  • technical reports and other articles issued by M‰lardalen University is free for personal use. For other use, the explicit consent of the authors is required
  • in other cases, please contact the copyright owner for detailed information

By accepting I agree to acknowledge and respect the rights of the copyright owner of the document I am about to access.

If you are in doubt, feel free to contact webmaster@ide.mdh.se

DenseDisp: Resource-Aware Disparity Map Estimation by Compressing Siamese Neural Architecture

Fulltext:


Authors:

Mohammad Loni, Ali Zoljodi, Amin Majd , Masoud Daneshtalab, Mikael Sjödin, Ben Juurlink , Reza Akbari

Publication Type:

Conference/Workshop Paper

Venue:

IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE (WCCI) 2020

Publisher:

IEEE

DOI:

https://doi.org/10.1109/CEC48606.2020.9185611


Abstract

Stereo vision cameras are flexible sensors due to providing heterogeneous information such as color, luminance, disparity map (depth), and shape of the objects. Today, Convolutional Neural Networks (CNNs) present the highest accuracy for the disparity map estimation [1]. However, CNNs require considerable computing capacity to process billions of floating-point operations in a real-time fashion. Besides, commercial stereo cameras produce huge size images (e.g., 10 Megapixels [2]), which impose a new computational cost to the system. The problem will be pronounced if we target resource-limited hardware for the implementation. In this paper, we propose DenseDisp, an automatic framework that designs a Siamese neural architecture for disparity map estimation in a reasonable time. DenseDisp leverages a meta-heuristic multi-objective exploration to discover hardware-friendly architectures by considering accuracy and network FLOPS as the optimization objectives. We explore the design space with four different fitness functions to improve the accuracy-FLOPS trade-off and convergency time of the DenseDisp. According to the experimental results, DenseDisp provides up to 39.1x compression rate while losing around 5% accuracy compared to the state-of-the-art results.

Bibtex

@inproceedings{Loni5813,
author = {Mohammad Loni and Ali Zoljodi and Amin Majd and Masoud Daneshtalab and Mikael Sj{\"o}din and Ben Juurlink and Reza Akbari},
title = {DenseDisp: Resource-Aware Disparity Map Estimation by Compressing Siamese Neural Architecture},
month = {July},
year = {2020},
booktitle = {IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE (WCCI) 2020},
publisher = {IEEE},
url = {http://www.es.mdu.se/publications/5813-}
}