Deep Insight Detection

advertisement
DeeperVision and
DeepInsight Solutions
Junjie Yan*, Naiyan Wang*, Yinan Yu, Linjiao Zhao, Stan Z. Li, Dit-Yan
Yeung
* denotes equal contribution
DeeperVision Classification
• Deeper network always helps
Top 1 Accuracy (single center test)
67
66
65
64
63
62
61
60
59
5
6
7
8
9
DeeperVision Classification
• Nesterov method based optimization
• With large momentum and Nesterov based optimization method, the
algorithm could smooth out the optimization path.
• It can improve top 1 accuracy by 0.8%
DeeperVision Classification
• More findings…
• Slow down the speed of data abstraction (stride, kernel size, etc.)
• More complicated data augmentations
• Spatial Pyramid Pooling (SPP)
• Our final results
• Single net: Top 5 error: 10.5%
• Ensemble 5 nets: Top 5 error: 9.5%
Deep Insight Detection
Region proposal + CNN feature extraction
• Selective Search + Structural Edge [1] for region proposal.
• 7/8/9 Convolution Layers + SPM +2 Fully Connected Layers.
• Deeper Models need more tuning iterations.
• Better (Deeper) Classification CNN always helps Detection.
[1]C. Lawrence Zitnick and Piotr Dollár Edge Boxes: Locating Object Proposals from Edges ECCV 2014
Diagnosis Experiments (on 2013-val2 )
Original RCNN
+ 9conv + SPM
+ more iterations
+ Structural Edge Proposal
+ 7/8/9 Conv Ensemble
+ CLS Context
31.4
36.6
39.2
40.1
40.7
42.0
Our Final Result
• We have the best single model (40.2 mAP V.S. the 38.0 mAP of
GoogLeNet)
• We use a non-optimal ensemble method when submitting result. A
better ensemble method leads to a 42.0 mAP on val2 after the
competition.
• Keeps improving…
Advertisement
• Junjie and I are looking for postdoc and job positions 
• Junjie Yan: http://www.cbsr.ia.ac.cn/users/jjyan/main.htm
• Naiyan Wang: http://winsty.net
Download