(Translated by https://www.hiragana.jp/)
Vanilla CNN for Facial Landmark Regression - Wolfram Neural Net Repository

Vanilla CNN for Facial Landmark Regression

Determine the locations of the eyes, nose and mouth from a facial image

Released in 2015, this net is a regressor for locating five facial landmarks from facial images: eyes, nose and mouth corners. The net output is to be interpreted as the positions of {EyeLeft, EyeRight, Nose, MouthLeft, MouthRight}, where values are rescaled to the input image size so that the bottom-left corner is identified by {0, 0} and the top-right corner by {1, 1}.

Number of layers: 24 | Parameter count: 111,050 | Trained size: 485 KB |

Training Set Information

Performance

Examples

Resource retrieval

Get the pre-trained net:

In[1]:=
NetModel["Vanilla CNN for Facial Landmark Regression"]
Out[1]=

Basic usage

Get a facial image and the net:

In[2]:=
(* Evaluate this cell to get the example input *) CloudGet["https://www.wolframcloud.com/obj/3053cf62-c347-4351-8a91-8917991b0c1f"]
In[3]:=
net = NetModel["Vanilla CNN for Facial Landmark Regression"];

Get the locations of the eyes, nose and mouth corners:

In[4]:=
landmarks = net[img]
Out[4]=

Show the prediction:

In[5]:=
HighlightImage[img, {PointSize[0.04], landmarks}, DataRange -> {{0, 1}, {0, 1}}]
Out[5]=

Requirements

Wolfram Language 11.2 (September 2017) or above

Resource History

Reference

  • Y. Wu, T. Hassner, K. Kim, G. Medioni, P. Natarajan, "Facial Landmark Detection with Tweaked Convolutional Neural Networks," arXiv:1511.04031 (2015)
  • Available from: https://github.com/ishay2b/VanillaCNN
  • Rights: Unrestricted use