Classification of Monkey Species using Deep Learning with MATLAB
MATLABHELPERS demonstrate classification of monkey species using deep learning in MATLAB. In this video we attempt to build a classifier that can identify the species of monkeys from its physical measurements. Six facial characteristics of monkeys are considered: frontallip, face length, facer width and face depth. The problem on hand is to identify the specie of mokey given the observed values for each of these 6 physical characteristics. The six physical characteristics will act as inputs to a neural network and the specie of monkey will be the target. Given an input, which constitutes the six observed values for the physical characteristics of a monkey, the neural network is expected to identify the specie of the monkey.
Abstract
Video recording is now ubiquitous in the study of animal behavior, but its analysis on a large scale is prohibited by the time and resources needed to manually process large volumes of data. We present a deep convolutional neural network (CNN) approach that provides a fully automated pipeline for face detection, tracking, and recognition of wild chimpanzees from long-term video records. In a 14-year dataset yielding 10 million face images from 23 individuals over 50 hours of footage, we obtained an overall accuracy of 92.5% for identity recognition and 96.2% for sex recognition. Using the identified faces, we generated co-occurrence matrices to trace changes in the social network structure of an aging population. The tools we developed enable easy processing and annotation of video datasets, including those from other species. Such automated analysis unveils the future potential of large-scale longitudinal video archives to address fundamental questions in behavior and conservation.
Deep learning
The field of machine learning uses algorithms that enable computer systems to solve tasks without being hand programmed, relying instead on learning from examples. With increasing computational power and the availability of large datasets, “deep learning” techniques have been developed that have brought breakthroughs in a number of different fields, including speech recognition, natural language processing, and computer vision. Deep learning involves training computational models composed of multiple processing layers that learn representations of data with many levels of abstraction, enabling the performance of complex tasks. A particularly effective technique in computer vision is the training of deep convolutional neural network (henceforth CNN) architectures to perform fine-grained recognition of different categories of objects and animals, including automated image and video processing techniques for facial recognition in humans, outperforming humans in both speed and accuracy.
Advances in the field of computer vision have led to the realization among wildlife scientists of the potential of automated computational methods to monitor wildlife. In particular, the emerging field of animal biometrics has adopted computer vision models for pattern recognition to identify numerous species through phenotypic appearance. Similar methods, developed for individual recognition of human faces, have been applied to nonhuman primates, including lemurs, macaques, gorillas, and chimpanzees. A substantial hurdle is developing models that are robust enough to perform on highly challenging datasets, such as those with low resolution and poor visibility. Since most of these previous face recognition methods applied to primates are limited by the size of training datasets, they are mostly shallow methods (small in the number of trainable parameters, hence not using deep learning), using cropped images of frontal faces or datasets from the controlled conditions of animals in captivity. While these methods have made valuable contributions, they are not robust to the inevitable variation in lighting conditions, image quality, pose, occlusions, and motion blur that characterize “wild” footage.