计算机研究生专业英语课程设计
合集下载
相关主题
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
It clearly shows that features at the aligned facial points are much more effective to separate those difficult windows.While effective, the face alignment based post classifier is brute force and too slow for a standard cascade detector when a high recall is desired. In our experiment, a low threshold is set to achieve 99% recall of the OpenCV detector and each image outputs 3000 windows on average. Applying the post classifier for those windows takes a few seconds.
Alignment Helps Detection: a Post Classifier
We used the Viola-Jones detector in OpenCV with alow threshold to ensure a high recall. The detector output a lot of image windows, of which many are false positives. We split all the images into two parts and used the positive and negative output windows in the first part to train a linear SVM classifier, and test e d all the output windows in the second part. We call it a post classifier as it operates on the output of a cascade detector. All windows are resized into 96*96 pixels. We compare three types of features, without and with alignment.
大家好
Joint Cascade Face Detection and Alignment
European Conference on Computer Vision (ECCV), 2014 DongChen,Shaoqing Ren
University of Science and Technology of China
So we presented a face detector that established the new state-of-the-art in terms of both accuracy and speed.Our approach benefited from the recent advances in cascade face alignment and used simple pixel differences as feature which bring advantages on the efficiency. Our detector took only 28.6 milliseconds for a VGA image, more than 1000 times faster than Zhu's[32]. It also achieved the best accuracy on the challenging datasets, significantly out performed all existing academia solutions.
Abstract
T h i s p a p e r present e d a new state-of-the-art approach for face detection. The key idea w a s to combine face alignment with detection, observing that aligned face shapes provided better features for face classification. To make this combination more effective, t h e n e w approach learned the two tasks jointly in the same cascade framework, by exploiting recent advances in face alignment. Such joint learning greatly enhance d the capability of cascade detection and still retain e d its realtime performance. Extensive experiments show that t his approach achieved the best accuracy on challenging datasets, where all existing solutions are either inaccurate or too slow.
Cascade Alignment:
Recent work proposes a pose regression framework that combines pose indexed features with boosted regression. The framework has been shown highly effective for face alignment.
1. we divide the window into 6*6 non-overlapping cells and extract a SIFT descriptor in each cell.
2. we use a fixed mean face shape with 27 facial points and extract a SIFT descriptor centered on each point.
To better exploit the alignment information, we propose a unified framework for cascade face detection and alignment.
Cascade Detection:
Without loss of generality, the classification
3. we align the 27 facial points using the alignment algorithm in [21] and extract a SIFT descriptor centered on each point.
([21]:Face Alignment at 3000 FPS via Regressing)
In the cascade regression, it is progressively updated through T stages as
S t S t 1 R t(x ,S t 1 ),t 1 ,...,T (2)
Each Rt is a regression function. It adds an increment to the estimated shape from the previous stage St−1.
current stage, as where index i iterates over all the
training samples.
A Unified Framework:
A Unified Framework:
A key innovation : the features w e r e defined relative to St−1, so called pose/shape indexed features.We propose d to
^
Fra Baidu bibliotek
R targm in R
||(Si(S ti 1R (xi,S ti 1))||2,
(3)
It is learnt to minimize the shape residual error
between the ground truth shape Sˆand estimated shape in the
score in the cascade detection cNan be written as:
f N C i ( x)
i 1
(1)
T he cascade detection is very fast because most negative image windows are rejected after evaluating only a few weak classifiers.
apply such features in detection as well, by making the learning of weak classifier Ci(x) also dependent on the face
shape.
We divided the N weak classifiers into the T stages.
Introduction
The seminal f ace detection work of Viola and Jones has established the two foundation principles : 1)boosted cascade structure; 2)simple features. Most realtime face detectors in academia and industry nowadays are based on them and work well for near-frontal faces under normal conditions but less effective for non-frontal faces or under more wild conditions (lighting, expression, occlusion).Zhu et al.[32]used a mixture of deformable part models to capture viewpoints and expressions.Shen et al.[24]proposed the first exemplarbased face detector and exploited advanced image retrieval techniques. Both methods are better than Viola-Jones detectors on wild and challenging datasets[32,9]. H o w e v e r , t h e y a r e a l l q u i t e s l o w d u e to t h e h ig h c o m p l e x i t y , i t t o o k a few seconds and this m a d e such detectors unpractical for many scenarios.