Computing Reviews
Today's Issue Hot Topics Search Browse Recommended My Account Log In
Review Help
Search
Dense 3D-convolutional neural network for person re-identification in videos
Liu J., Zha Z., Chen X., Wang Z., Zhang Y. ACM Transactions on Multimedia Computing, Communications, and Applications15 (1s):1-19,2019.Type:Article
Date Reviewed: Apr 19 2019

It is well known that the current types of neural networks perform quite well in identifying faces in (still) images. But what about re-identifying moving pedestrians in non-overlapping video sequences taken from different cameras?

The paper’s novel approach increases the accuracy of re-identification (here: rank-1 recognition rate and mean average precision) by 40.8 percent and 4.2 percent, respectively, compared to the best alternative video-based and image-based algorithms. This is essentially achieved by the following two innovative design choices for their 56-layer (generalized) convolutional neural network with 3.2 million training parameters.

First, the authors compose the whole network of four blocks of so-called dense 3D blocks where each layer in a block is connected in a feed-forward fashion to all(!) subsequent layers in the block (as opposed to just the immediately following layer). This has been chosen to “enlarge the receptive ... neurons in both spatial and temporal dimensions,” enabling the network to discriminate “short-term and long-term motion patterns.”

Second, they extend the classical identification loss function with a second term aiming to minimize center loss, that is, the algorithm tries to maintain the “centers” of training samples from each class and to use this center point for feature embedding.

As is evident from the decidedly technical description above, the paper is geared toward neural network specialists, as even advanced concepts are used without any explanation in the text. Complemented by highly illustrative graphics and even pseudocode for the overall algorithm, the authors rightfully concentrate on providing all the details necessary for (re)building their nontrivial neural network. And while I have not checked this meticulously, I am confident that they are very close to achieving this goal.

Reviewer:  Christoph F. Strnadl Review #: CR146538 (1907-0285)
Bookmark and Share
  Editor Recommended
Featured Reviewer
 
 
Object Recognition (I.4.8 ... )
 
 
Computer Vision (I.5.4 ... )
 
 
Neural Nets (I.5.1 ... )
 
Would you recommend this review?
yes
no
Other reviews under "Object Recognition": Date
On view likelihood and stability
Weinshall D., Werman M. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(2): 97-108, 1997. Type: Article
Feb 1 1999
Junctions
Parida L., Geiger D., Hummel R. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(7): 687-698, 1998. Type: Article
Apr 1 1999
Topological recognition of polyhedral objects from multiple views
Laurentini A. Artificial Intelligence 127(1): 31-55, 2001. Type: Article
Apr 17 2002
more...

E-Mail This Printer-Friendly
Send Your Comments
Contact Us
Reproduction in whole or in part without permission is prohibited.   Copyright 1999-2024 ThinkLoud®
Terms of Use
| Privacy Policy