Head pose estimation network based on simple, handmade CNN architecture. Angle regression layers are convolutions + ReLU + batch norm + fully connected with one output.
Biwi Kinect Head Pose Database
Metric | Value |
---|---|
Supported ranges | YAW [-90,90], PITCH [-70,70], ROLL [-70,70] |
GFlops | 0.105 |
MParams | 1.911 |
Source framework | Caffe* |
Angle | Mean ± standard deviation of absolute error |
---|---|
yaw | 5.4 ± 4.4 |
pitch | 5.5 ± 5.3 |
roll | 4.6 ± 5.6 |
- name: "data" , shape: [1x3x60x60] - An input image in [1xCxHxW] format. Expected color order is BGR.
Output layer names in Inference Engine format:
- name: "angle_y_fc", shape: [1, 1] - Estimated yaw (in degrees).
- name: "angle_p_fc", shape: [1, 1] - Estimated pitch (in degrees).
- name: "angle_r_fc", shape: [1, 1] - Estimated roll (in degrees).
Output layer names in Caffe* format:
- name: "fc_y", shape: [1, 1] - Estimated yaw (in degrees).
- name: "fc_p", shape: [1, 1] - Estimated pitch (in degrees).
- name: "fc_r", shape: [1, 1] - Estimated roll (in degrees).
Each output contains one float value that represents value in Tait-Bryan angles (yaw, pitсh or roll).
[*] Other names and brands may be claimed as the property of others.