Accelerating computer vision-based human identification through the integration of deep learning-based age estimation from 2 to 89 years

Heinrich, Andreas

doi:10.1038/s41598-024-54877-1

Download PDF

Article
Open access
Published: 20 February 2024

Accelerating computer vision-based human identification through the integration of deep learning-based age estimation from 2 to 89 years

Andreas Heinrich¹

Scientific Reports volume 14, Article number: 4195 (2024) Cite this article

563 Accesses
Metrics details

Subjects

Abstract

Computer Vision (CV)-based human identification using orthopantomograms (OPGs) has the potential to identify unknown deceased individuals by comparing postmortem OPGs with a comprehensive antemortem CV database. However, the growing size of the CV database leads to longer processing times. This study aims to develop a standardized and reliable Convolutional Neural Network (CNN) for age estimation using OPGs and integrate it into the CV-based human identification process. The CNN was trained on 50,000 OPGs, each labeled with ages ranging from 2 to 89 years. Testing included three postmortem OPGs, 10,779 antemortem OPGs, and an additional set of 70 OPGs within the context of CV-based human identification. Integrating the CNN for age estimation into CV-based human identification process resulted in a substantial reduction of up to 96% in processing time for a CV database containing 105,251 entries. Age estimation accuracy varied between postmortem and antemortem OPGs, with a mean absolute error (MAE) of 2.76 ± 2.67 years and 3.26 ± 3.06 years across all ages, as well as 3.69 ± 3.14 years for an additional 70 OPGs. In conclusion, the incorporation of a CNN for age estimation in the CV-based human identification process significantly reduces processing time while delivering reliable results.

Diagnostic accuracy of brain age prediction in a memory clinic population and comparison with clinically available volumetric measures

Article Open access 11 September 2023

Personal identification with orthopantomography using simple convolutional neural networks: a preliminary study

Article Open access 11 August 2020

Deep learning-based age estimation from chest X-rays indicates cardiovascular prognosis

Article Open access 09 December 2022

Introduction

The international criminal police organization (INTERPOL) designates fingerprints, deoxyribonucleic acid (DNA) profiling, and forensic odontology as primary identification methods due to their scientific reliability^1,2. INTERPOL has established standardized codes for dental and oral features in postmortem and antemortem images/documents used in identification examinations, enabling the verification of data concordance. The search for suitable antemortem reference materials becomes challenging and time-consuming in the absence of limiting clues to potential identity. A novel Computer Vision (CV)-based method for identifying unknown deceased individuals was introduced in the last few years^3,4. This method involves extracting CV features from pre-processed images, such as orthopantomograms (OPGs), which are then stored in an antemortem CV feature database (CV database). Automated human identification is achieved by comparing postmortem CV features with an extensive CV database (refer to Fig. 1 and for more details the reference https://www.nature.com/articles/s41598-020-60817-6)³. The CV-based human identification is fundamentally not a legally secure method for identification. Its task is rather to locate suitable reference materials and/or obtain clues to the searched individual (refer to supplementary material Fig. S1). By narrowing down the possible identity from thousands of potential identities, a legally secure and forensic identification is subsequently facilitated with appropriate reference materials by a highly qualified professional. This study is part of a project that aims to establish the application of CV-based human identification with very large CV databases.

A CV database can be automatically filled using medical image databases. With the increasing size of a CV database, the signal processing time increases, thus extending the time it takes to identify a individual. Additional filters, such as estimated age of the unknown individual, can help avoid unnecessary queries in the database. Age estimation is a highly complex process that demands expert knowledge. Convolutional Neural Networks (CNNs) have displayed significant promise in objective and automated age estimation, as evidenced by multiple studies focused on OPGs^5,6,7,8,9,10. However, there is currently limited experience in CNN-based age estimation for postmortem individuals, and results for adults age groups often fall short of expectations. Furthermore, existing CNNs for age estimation with OPGs have typically been trained, validated, and tested on relatively small datasets, ranging from 304 to 10,400 OPGs for training, 76 to 2,600 OPGs for validation, and 95 to 2,000 OPGs for testing^{5,6,8,9,10,11}. Therefore, age estimation using CNN and OPGs is frequently implemented based on transfer learning.

Forensic age estimation is based on the developmental stages of certain anatomical structures, such as hand and wrist bones¹², medial clavicular epiphysis^13,14, and third molars¹⁵. In Germany, the Study Group on Forensic Age Diagnostics (AGFAD) of the German Forensic Medicine Society (DGRM) comprises leading experts in the field. They recommend utilizing a maximum diversity of age indicators for age estimations in the living^{15,16,17,18,19}. While age estimation in the living is not the primary focus of this study, the cited works illustrate the social, legal, and scientific sensitivity of age estimation as a forensic issue. It should be emphasized that the goal of a fully automated approach for age estimation, as presented in this study, cannot replace forensic age estimation. The approach is intended solely as an auxiliary tool for intelligent searches in CV databases. Therefore, it is not obligated to achieve the same level of accuracy as a forensic science expertise provided by forensic scientists in court. Moreover, the CNN cannot "explain to the judge how it yielded the estimate", which may be a crucial consideration in court.

The primary objective of this study was to develop a standardized and robust CNN for age estimation using OPGs and incorporate it into the method of human identification through a CV-database. To achieve this, a larger dataset comprising 40,000/10,000 OPGs for training/validation and 10,779 OPGs for CNN testing will be leveraged, covering a wider age range from 2 to 89 years. Furthermore, the CNN's applicability for age estimation in cases involving postmortem OPGs will be assessed, and within the context of CV-based human identification for an additional set of 70 independent OPGs divided into 7 age categories.

Results

CV-based human identification with CNN for age estimation

Integrating a CNN for age estimation into CV-based human identification with a CV database of 105,251 entries reduced signal processing time by up to 96% (refer to Table 1). The signal processing time was 9.34 ± 9.48 min per individual for ± 5-year search range, 18.64 ± 16.84 min per individual for ± 10-year search range, 57.66 ± 39.41 min per individual for ± 15-year search range, and 246.00 ± 5.47 min per individual without age filtering. Applying the age filter (100% corresponding to the full database with 105,251 entries), only 1–17% (± 5 years), 3–33% (± 10 years) and 5–46% (± 15 years) of CV database rows needed evaluation, based on the individual age. The number of considered rows varies with age and search radius. The CNN was tested with 70 OPGs (10 per age group) that were not part of its training or testing.

Table 1 CNN-based age estimation of 70 individuals (not used for training, validation or testing) was applied to reduce the signal processing time for a CV-based human identification process.

Full size table

The results showed a mean absolute error (MAE) of 3.69 ± 3.14 years and a mean signed error (MSE) of 1.70 ± 4.56 years compared to the actual age (refer to Table 1). In the case of CV-based human identification with the best result being the searched individual (rank 1), 48 out of 70 (69%) were identified within a ± 5-year search range, 68 out of 70 (97%) within a ± 10-year search range, and all 70 out of 70 (100%) within a ± 15-year search range.

Evaluation the CNN for age estimation

The CNN for age estimation was successful when applied to postmortem OPGs, demonstrating a MAE of 2.76 ± 2.67 years and a MSE of 1.10 ± 4.09 years relative to the actual age. Furthermore, in the antemortem test dataset comprising over 10,779 OPGs, the MAE was 3.26 ± 3.06 years, and the MSE was 0.16 ± 4.47 years across all age groups relative to the actual age (refer to Table 2). The study's findings indicated the attainment of reliable age estimations, with MAE ranging from 1.48 years (95% CI 1.14, 1.82) in the age group of 2–9 years to 6.02 years (95% CI 5.54, 6.50) in the age group of 80–89 years.

Table 2 Result of applying the CNN for age estimation to a test dataset that was not used for either training or validation for 10,779 antemortem OPGs and three postmortem OPGs.

Full size table

The age estimation success rates (age prediction was correct) within various error ranges across all data were as follows: 24.23% within ± 1 year, 43.71% within ± 2 years, 58.91% within ± 3 years, 70.03% within ± 4 years, 78.58% within ± 5 years, 96.17% within ± 10 years, 99.29% within ± 15 years, and 99.93% within ± 20 years. The success rate was highest for the youngest age range and decreased with increasing age. The scatter plot (refer to Fig. 2) and the boxplots (refer to Fig. 3) illustrate a correlation between decreasing prediction accuracy and increasing age, particularly evident from the age of 70 onward. In these cases, individuals were often predicted to be younger than their actual age. The results for each age year are presented in supplementary material Table S1.

In Fig. 4 and in supplementary material Fig. S2 without Gradient-weighted Class Activation Mapping (Grad-CAM), examples of OPGs with predicted and actual ages are presented. The absolute error in age estimation exceeded 20 years for only eight out of 10,779 OPGs (0.07%), corresponding to actual ages of 2, 3, 25, 79, 82, 85, 86, and 89 years. Three examples are shown in Fig. 4M–O.

The CNN's performance was assessed with an R² value of 0.95 and a root mean squared error (RMSE) of 4.40 years. When applying the CNN, there was no statistical significance (P > 0.5) observed with respect to sex or whether the individual (not the test OPG) was already known during the CNN training. The CNN was trained for 2216 epochs, resulting in a MAE of 2.80 years and a loss of 12.63 years squared for the training set. In validation, the CNN achieved a MAE of 3.42 years and a loss of 22.06 years squared. The test dataset of 10,779 OPGs have a MAE of 3.26 years and a loss of 19.97 years squared. Using OPGs with different resolutions for the CNN does not reduce age prediction accuracy (refer to supplementary material Fig. S3). In fact, the trained CNN remains robust across various resolutions, ensuring accuracy for its intended purposes.

Further analyses

With the same number of training epochs, a larger dataset results in reduced loss and MAE (refer to supplementary material Table S2). When applying the test dataset with 10,779 OPGs, the MAE for the best CNN within 100 epochs is 7.00 ± 5.88 years (5,000 OPGs), 5.66 ± 4.90 years (20,000 OPGs), 4.63 ± 4.11 years (35,000 OPGs), and 4.22 ± 3.86 years (50,000 OPGs).

Transfer learning with InceptionV3, incorporating 50,000 OPGs, yielded a MAE of 4.85 ± 4.29 years (loss 41.87 years squared) for the best CNN on the test dataset, which included 10,779 OPGs.

Discussion

The process of CV-based human identification for an unknown deceased individual can be significantly expedited when additionally using a CNN to estimate the individual's age. The deep learning-based age estimation has shown reliability for individuals between the ages of 2 and 89 years. For the majority of all tested 10,852 OPGs, a ± 5-year tolerance range is sufficient for expedited identification, as it covers the actual age of the searched individual. However, individuals aged 70 and above are increasingly estimated as younger by the CNN than they actually are, which is why the search radius should be expanded to younger ages. In general, the signal processing time is also influenced by the age distribution of the CV database and the computing technology used.

In cases involving unknown deceased individuals, the acquisition of postmortem images may be a necessary step. These images can be compared to antemortem images to narrow down the possible group of individuals. However, manual comparisons by forensics experts are time-consuming and demand high concentration. Furthermore, experts are limited in the maximum number of possible comparisons they can make, whereas computers can efficiently handle millions of data queries. Automated methods for comparing these images with databases can expedite the identification process, assisting forensic pathologists without replacing them. In essence, CV-based human identification aims to narrow down potential identities, making it easier for qualified professionals to retrieve appropriate reference materials for forensic identification. A CV database offers advantages in preserving patient data privacy (CV features do not allow for image reconstruction) and enabling faster identification with reduced database storage requirements. CV features can be extracted from various images, and the choice of OPGs is practical due to their widespread use, including among younger people. This technology has the potential to be as effective as fingerprint or DNA identification methods.

Automated filtering methods are essential for efficient CV-based human identification with an extensive CV database, as matching with millions of entries can be time-consuming. This study focused on automated age estimation using a CNN, as it can be conducted objectively, independently, and significantly faster than an expert's estimation. Furthermore, there is the potential for additional filtering methods like sex estimation²⁰, dental segmentation²¹, and geographical region-based filtering, to avoid unnecessary database queries. The longer it takes to identify an unknown deceased individual, the harder it is to find possible clues to the perpetrator. To enhance the identification probability, it is advisable to store CV features for each individual's OPG in the CV database.

Age estimation holds significant importance in various fields, including forensic science, law enforcement and virtopsy, where it provides an objective assessment of an individual's age when such information is lacking^{3,6,22,23,24,25,26,27}. In expert-based forensic age estimation, teeth are crucial. Various dental factors, such as crown formation, mineralization, root growth, eruption sequence, pulp size, loss, calcification, mobility, and microstructure, are considered^8,9. Wisdom teeth are reliable for estimating ages between 10 and 24 years⁷. However, forensic age estimation in adults is challenging due to limited reliable methods. Dental pulp, alveolar bone loss and third molars are important in adult forensic age estimation, but it's more uncertain due to variability^9,28,29. The accepted forensic threshold for dental age estimation method in adults from diverse population groups is a standard deviation below ± 10 years²⁸. The inherent variability in dental maturity among 95% of the population is approximately ± 1.5–2 years, meaning that it would be theoretically impossible to achieve a higher level of precision³⁰. Studies have reported average error rates of around ± 1–2 years for children and approximately ± 5 to ± 14 years for adults³⁰.

Manual methods like Demirjian's, Willems', Cameriere's, Nolla's, Smith's, Haavikko's, and Chaillet's, developed for children and subadults, determine chronological age from dental features observed in X-rays^31,32. Subjective judgment and a wide margin of error remain problematic⁹, as seen in a review³² with contrasting results for the same population. Demirjian's method, widely accepted for forensic age estimation in children, has been extensively applied in various populations³² based on developmental stages and characteristics of teeth. Studies²⁸ on adults using OPGs demonstrated varying standard errors of estimation (SEE) per tooth or group of tooth. For example, the Kvaal et al. method showed SEE ranging from 3.63 to 11.60 years, mean SEE across all publications being 8.59 ± 1.56 years based on a maximum of 200 OPGs. The Cameriere et al. method exhibited SEE ranging from 3.27 to 6.38 years, mean SEE across all publications being 5.13 ± 1.64 years based on a maximum of 606 OPGs. Kvaal et al. method measures pulp, tooth, and root length, as well as root and pulp width at three levels. Cameriere et al. method analyzes pulp and tooth area. These adult-oriented methods are complex and require expertise in measurement techniques.

Age estimation using a CNN introduces a novel approach by incorporating additional features, beyond teeth and tooth roots, for estimation without the need for preprocessing the OPG. Consequently, a relatively low resolution of 256 × 256 pixels proves adequate for achieving excellent results with a CNN. Figure 4's Grad-CAM analysis revealed that the CNN leverages not only dental features but also a broader set of features across the entire head region for age estimation. Therefore, teeth are not necessarily required for age estimation using a CNN, as demonstrated in Fig. 4J,K. Furthermore, cropping OPGs to focus solely on teeth and tooth roots has been found to be detrimental to age estimation accuracy with a CNN—especially at lower resolutions.

A CNN is tailored for a specific task. In this study, a single CNN for age estimation was applied across a wide age range to integrate seamlessly into a CV-based human identification process. The estimated age is utilized exclusively within the automated process and is not presented as an output. The reason is that the use of artificial intelligence (AI) to expedite tasks typically handled by highly skilled professionals necessitates careful consideration. The single CNN integrates age-related changes in subadults (mineralization, eruption) with those in adults (attrition, root resorption, etc.), potentially leading to increased measurement uncertainty. Overestimated or underestimated predictions in the context of CV-based human identification only result in longer signal processing times, with no legal implications. Exploring the use of two separate CNNs for subadults and adults may reduce uncertainties, enhancing our understanding of system behavior. However, these considerations were not the primary focus of this study. In essence, inaccurate age predictions have significant implications in legal and investigative contexts and should only be conducted by highly skilled professionals. For example, overestimating age can misclassify minors as adults, influencing legal decisions and potentially violating rights. Conversely, underestimating age may misidentify adults as minors, jeopardizing legal protection.

In recent literature, transfer learning approaches^6,8,9,10,11 have been primarily employed, with a relatively small number of OPGs used for training and testing the CNN. The overview in Table 3 comprises studies concerning age estimation through CNN and OPGs across a wide age range, but please note that the list is not exhaustive due to the diversity within the literature. The age ranges investigated ranged from 2 to 90 years, with two studies^6,9 using almost exclusively OPGs of individuals under 30 years old. Ko et al.⁸ utilized the algorithm Darknet-19 for classifying dental age, achieving accuracy rates ranging from 59 to 84% and 49 to 96%-based on the acceptable deviation ranges of ± 5 and ± 10 years, respectively. Atas et al.¹⁰ tested transfer learning for six different CNNs. The best-performing one was InceptionV3, which was then modified to improve accuracy and speed. Kim et al.⁹ used a custom CNN for extracting image patches and transfer learning (ResNet-152) for predicting age group of the first molar, resulting in a classification of the age into 5 different groups. Vila-Blanco et al.⁶ proposed two transfer learning methods for predicting chronological age, with the best results achieved using DASNet. The datasets used consisted of 84% of individuals aged below 30 years. Alkaabi et al.¹¹ evaluates a custom CNN and various transfer learning methods.

Table 3 Overview of literature on age estimation using CNN methods with OPGs across a wide age range.

Full size table

All of these studies^6,8,9,10,11 have in common their use of a low resolution, typically up to 256 × 256 pixels, which has consistently yielded favorable results. As a result, this study also employed a 256 × 256 pixel resolution. Furthermore, these studies utilize transfer learning, advantageous for achieving rapid and satisfactory results with limited datasets. Ad-hoc CNNs may outperform transfer learning when trained specifically for a particular solution^33,34. Transfer learning, often relying on pretraining with non-medical images, carries the risk of inheriting undesired patterns or biases, potentially diminishing accuracy in addressing specific issues. In this study, the ad-hoc CNN outperformed transfer learning, possibly due to the diversity of OPGs. Notably, no OPG filtering was performed, except for excluding improperly stored images and those from anonymous individuals. While meticulous filtering (e.g., excluding images with large implants, good image quality, no artifacts) can improve results. However, deviating from this specific group might result in less optimal outcomes. Nevertheless, the objective of this study necessitates a diverse range of OPGs to avoid compromising potential CV-based human identification through unreliable age filtering. The emphasis is not on achieving the most accurate age estimation but rather on obtaining a reliable estimation, even with less optimal OPGs. This approach aims to enhance result robustness, potentially even for postmortem OPGs.

The fluctuating number of training samples per age year is sufficiently balanced and has an adequate quantity, excluding the edges of the age range (refer to supplementary material Fig. S4). Additionally, the minor surplus of male OPGs does not significantly impact the CNN's performance in this study (P > 0.5). The developed CNN exhibits robust generalizability due to the diversity of OPGs, allowing effective learning of various patterns. The decision to omit data balancing was made because the methods commonly employed to address imbalanced data in classification may not be the most suitable solution for regression, and further research is needed³⁵. Treating different ages as separate groups in regression isn't the most effective because it doesn't make the most of similarities between close ages³⁵. Increasing the number of OPGs has had a positive impact on the outcome within the same number of trained epochs. However, there may be a saturation point where additional datasets do not significantly enhance the CNN's effectiveness. Currently, this saturation point has not been reached with the proposed CNN, even with 50,000 OPGs. The decision to use the 2216-epoch CNN was-based on its performance evaluation on a separate validation dataset, which demonstrated a balance between improved performance and the risk of overfitting. To enhance the robustness of the CNN, data augmentation techniques and different resolutions of the OPGs were employed during training.

This study highlights how using a CNN for age estimation can speed up the process of automated CV-based human identification. Further acceleration can be achieved by sorting the database entries for comparison with the postmortem OPG based on the absolute difference from the estimated age. This approach has the potential to successfully identify more than half (52.15%) of the OPGs tested in this study with a search radius of less than 2.5 years. Otherwise, the search radius will automatically expand until the entire database has been queried. In the case of a significant match (where the matching points are significantly higher than typically found between different individuals), the search can be concluded prematurely, saving additional measurement time.

This study has several limitations. The CNN, trained on antemortem OPGs, was tested on only three postmortem OPGs, emphasizing the need for additional investigations to ensure robust evidence due to the limited sample size. Another limitation is the utilization of datasets solely from a single hospital. This research explores the synergy between CV-based human identification and deep learning-based age estimation, revealing advantages for the former. While it does not claim to identify the best CNN for age estimation, it demonstrates the reliable handling of a broad range of OPGs, significantly reducing the duration of CV-based human identification. The study does not constitute research in the context of forensic age estimation or forensic human identification. No forensic expert knowledge is incorporated during the training of the CNN.

In conclusion, this study has shown that a robust CNN for age estimation using OPGs across a wide age range of 2 to 89 years can substantially reduce the signal processing time for automated CV-based human identification. This enables the effective handling of much larger CV databases and can be used in conjunction with other time-saving methods. Using more datasets improves the performance of a CNN for age estimation. The CNN has not yet reached the maximum benefit from an increasing number of training data, even with 50,000 OPGs. An individual CNN tailored for a specific task and trained from scratch can outperform transfer learning, especially when ample datasets are available.

Methods

All methods used in the study have been approved by the local ethics committee of the Jena University Hospital (registration number 2019-1505-MV) and were carried out in accordance with relevant guidelines and regulations. As this was a retrospective analysis of routine work, written informed consent was waived by the ethics committee.

CV-based human identification

Preprocessing OPGs and CV feature extraction

The original CV-based human identification method used Matlab and the CV algorithm Speeded Up Robust Features (SURF)^3,4. To enhance efficiency, the method was reimplemented in C + + with an embedded SQLite database. The new implementation maintains the same preprocessing steps for OPGs before CV feature extraction (refer to Fig. 1 A for the original OPG). First, the color depth was normalized to 8 bits and the margins were cropped to a final size of 180 mm × 100 mm. Then, eight Sobel filter masks (0°–315°, step size 45°) were used to enhance the edges in the OPG, and the edge strength was modulated with a parameter of 1.8. The maxima of the pixels from all eight Sobel gradient images were used to create a single image with emphasized edges in all directions. Afterwards, an averaging filter with a size of 6 was applied to reduce image noise (refer to Fig. 1 B for the ultimate preprocessed OPG). Finally, the CV algorithm AKAZE was employed with the following parameters: Octaves = 8, Layers = 5, Diffusivity = PM_G2, and Descriptor type = KAZE_UPRIGHT, for extracting CV features (refer to Fig. 1C for an example of extracted CV features).

Matching CV features with CV database

The matching process generates index pairs that indicate matches between the CV features of the unknown individual and a database entry, resulting in unique matching points. The RANdom SAmple Consensus (RANSAC) algorithm was employed for robustness by removing outliers. The number of remaining matching points indicated identification success. Matching was performed twice, first with the order "unknown individual—database entry" and then with the order "database entry—unknown individual". The average matching points was calculated from these two runs. In this study, CV-based human identification was considered successful if the result with the highest number of matching points corresponded to the searched individual (rank 1).

Evaluation CV-based human identification with integrated CNN for age estimation

The CNN for age estimation was integrated into the CV-based human identification software. A random OPG sample of 70 individuals (not used for CNN development process), divided into 7 age categories (10 individuals per category), was used for CV-based human identification with and without CNN-based age estimation. CV features from 105,251 antemortem OPGs of 56,008 individuals were extracted and stored in the CV database. The signal processing time for the CV-based human identification process and the success rate (best match = the searched individual, rank 1) were measured for age estimation error tolerances of ± 5, ± 10, and ± 15 years. The evaluation has been performed on a standard computer (Intel® i5-8500, 6 × 3 GHz CPU, 32 GB 2133 MHz RAM).

Development of the CNN for age estimation

Datasets

For the development and testing of the CNN for age estimation, 60,779 antemortem OPGs and three postmortem OPGs (actual ages 32, 51 and 61 years) were used (refer to Fig. 5), acquired consecutively from 33,045 individuals between 2006 and 2019. A fixation system was utilized for postmortem OPG acquisition, incorporating a mobile bedstead with a sterilizable wooden plate, a metal rail with a slide-on fixation structure, and adjustable elements for securing the body in a seated position, along with provisions for head and teeth support (further details and an illustration can be found in the reference³). OPGs that were improperly stored or obtained from anonymous individuals were excluded from the study. The OPGs had a color depth between 8 and 16 bits with different resolutions (in particular, 18,859 OPGs with 2948 × 1540 pixels, 10,731 OPGs with 2440 × 1280 pixels, 7267 OPGs with 2948 × 1552 pixels, 6476 OPGs with 1688 × 875 pixels and 2944 OPGs with 2648 × 1280 pixels). All OPGs were scaled to a size of 256 × 256 pixels.

CNN development

Several CNN architectures and hyperparameter configurations, including variations in batch size, learning rate, and dropouts, were systematically evaluated. The study focuses on the most promising CNN selected from these experiments. A dropout rate of 0.25, a batch size of 64, and a learning rate of 0.0001 were chosen as the optimal hyperparameters for the final CNN. The CNN was implemented in python using Keras library.

The final CNN architecture (https://github.com/AG-TraFo/CNN-age-estimation) consists of four blocks of three pairs of convolutional layers and batch normalization each followed by max pooling and dropout layers to extract image features and to avoid overfitting (refer to supplementary material Fig. S5). The number of filters in each blocks of convolutional layers ranges from 32, 64, 128 to 256, with a fixed filter size of 3 × 3. The flattened output from the convolutional layers is fed into a fully connected layer containing 1024 units, followed by batch normalization and dropout layers. For regression tasks, the final layer consists of a single output unit with linear activation. The CNN was compiled using the Adam optimizer and the mean squared error loss function. The MAE was computed as a metric during training.

CNN training and evaluation

The CNN was trained for 2500 epochs using a dataset of 50,000 OPGs with corresponding age labels from 2 to 89 years. The data was split into a training set (females/males 18,071/21,929 OPGs) and a validation set (females/males 4,531/5,469 OPGs) with a 80/20 ratio. After every 10 epochs, the training and validation data are thoroughly shuffled separately. In order to increase the diversity and number of training examples, data augmentation was applied using the ImageDataGenerator class in Keras. The datagen object was configured to perform several image transformations during training, including 0–1° rotation, 0–10% width and 0–20% height shifting, 0–20% zooming, and horizontal flipping. The fill_mode parameter was set to nearest, which specifies the strategy used to fill in any newly created pixels during the transformation.

For the age estimation evaluation, the CNN trained for 2216 epochs was used as it produced the most promising results. The CNN was applied to three postmortem OPGs and 10,779 OPGs (refer to Fig. 5). The results were statistically analyzed to investigate deviations and success rates in age categories. In addition, the influence of sex and whether an individual was already known in the training of the CNN were examined using a t-test with significance level of 0.05.

Further analyses

To assess the impact of numbers of training OPGs, the CNN underwent training for 100 epochs using datasets of 5,000, 20,000, 35,000, and 50,000 OPGs, respectively. In all sub-datasets, OPGs ranging from 2 to 89 years were included. The best CNN from each training set was then evaluated using the 10,779 OPGs in the test dataset.

Transfer learning with InceptionV3 was performed using 50,000 OPGs. The process involved a batch size of 64, a learning rate of 0.0001, and early stopping with patience set to 10. ImageNet weights were utilized during this phase, with the weights frozen. Additionally, the best CNN from this phase underwent further fine-tuning, during which the weights were unfrozen, using a learning rate of 0.00001.

Data availability

The datasets analyzed during the current study are not publicly available due to privacy concerns related to the large amount of patient images. However, these datasets are available from the corresponding author on reasonable request, subject to appropriate ethical and legal considerations. The final CNN architecture is accessible at: https://github.com/AG-TraFo/CNN-age-estimation.

References

INTERPOL. Disaster victim identification guide 2018. (Diakses).
Rötzscher, K. Forensische Zahnmedizin: Forensische Odonto-Stomatologie (Springer-Verlag, 2013).
Google Scholar
Heinrich, A., Güttler, F. V., Schenkl, S., Wagner, R. & Teichgräber, U. K. M. Automatic human identification based on dental X-ray radiographs using computer vision. Sci. Rep. 10, 3801. https://doi.org/10.1038/s41598-020-60817-6 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Heinrich, A. et al. Forensic odontology: Automatic identification of persons comparing antemortem and postmortem panoramic radiographs using computer vision. RoFo Fortschritte auf dem Gebiete der Rontgenstrahlen und der Nuklearmedizin 190, 1152–1158 (2018).
Article PubMed Google Scholar
Aliyev, R., Arslanoglu, E., Yasa, Y. & Oktay, A. B. Age Estimation from Pediatric Panoramic Dental Images with CNNs and LightGBM. 2022 Medical Technologies Congress (TIPTEKNO), 1–4 (2022).
Vila-Blanco, N., Carreira, M. J., Varas-Quintana, P., Balsa-Castro, C. & Tomas, I. Deep neural networks for chronological age estimation from OPG images. IEEE Trans. Med. Imaging 39, 2374–2384 (2020).
Article PubMed Google Scholar
Cular, L. et al. Dental age estimation from panoramic X-ray images using statistical models. In: Proc. 10th International Symposium on Image and Signal Processing and Analysis, 25–30 (2017).
Ko, J. et al. Dental panoramic radiography in age estimation for dental care using Dark-Net 19. J. Magn. 27, 485–491 (2022).
Article Google Scholar
Kim, S., Lee, Y.-H., Noh, Y.-K., Park, F. C. & Auh, Q. Age-group determination of living individuals using first molar images based on artificial intelligence. Sci. Rep. 11, 1–11 (2021).
ADS Google Scholar
Atas, I., Ozdemir, C., Atas, M. & Dogan, Y. Forensic dental age estimation using modified deep learning neural network. Preprint at http://arXiv.org/quant-ph/2208.09799 (2022).
Alkaabi, S., Yussof, S. & Al-Mulla, S. Evaluation of convolutional neural network based on dental images for age estimation. 2019 International Conference on Electrical and Computing Technologies and Applications (ICECTA), 1–5 (2019).
Pyle, S. I. & Greulich, W. W. Radiographic Atlas of Skeletal Development of the Hand and Wrist (Stanford University Press, 1959).
Google Scholar
Kreitner, K.-F., Schweden, F., Schild, H., Riepert, T. & Nafe, B. RöFo-Fortschritte auf dem Gebiet der Röntgenstrahlen und der bildgebenden Verfahren. (Georg Thieme Verlag Stuttgart, New York). 481–486.
Wittschieber, D., Schulz, R., Pfeiffer, H., Schmeling, A. & Schmidt, S. Systematic procedure for identifying the five main ossification stages of the medial clavicular epiphysis using computed tomography: A practical proposal for forensic age diagnostics. Int. J. Legal Med. 131, 217–224 (2017).
Article PubMed Google Scholar
Schmeling, A. et al. Empfehlungen für die Altersdiagnostik bei Lebenden im Strafverfahren. Anthropologischer Anzeiger, 87–91 (2001).
Schmeling, A. et al. Aktualisierte Empfehlungen der Arbeitsgemeinschaft für Forensische Altersdiagnostik für Altersschätzungen bei Lebenden im Strafverfahren. Rechtsmedizin 18, 451–453 (2008).
Article Google Scholar
Schmeling, A. et al. Criteria for age estimation in living individuals. Int. J. Legal Med. 122, 457–460 (2008).
Article CAS PubMed Google Scholar
Lockemann, U., Fuhrmann, A., Püschel, K., Schmeling, A. & Geserick, G. Arbeitsgemeinschaft für Forensische Altersdiagnostik der Deutschen Gesellschaft für Rechtsmedizin: Empfehlungen für die Altersdiagnostik bei Jugendlichen und jungen Erwachsenen außerhalb des Strafverfahrens. Rechtsmedizin 14, 123–126 (2004).
Article Google Scholar
Schmeling, A., Dettmeyer, R., Rudolf, E., Vieth, V. & Geserick, G. Forensische altersdiagnostik: Methoden, aussagesicherheit, rechtsfragen. Dtsch Arztebl Int. 113, 44–50 (2016).
PubMed PubMed Central Google Scholar
Ciconelle, A. C. M. et al. Deep learning for sex determination: Analyzing over 200,000 panoramic radiographs. J. Forens. Sci. https://doi.org/10.1111/1556-4029.15376
Engler, M. et al. Automatic classification and segmentation of dental panoramic radiographs using a mask regional convolutional neural network. Eur. Congress of Radiology-ECR (2022).
Kurniawan, A. et al. The applicable dental age estimation methods for children and adolescents in Indonesia. Int. J. Dentistry (2022).
Hueck, U. et al. Forensic postmortem computed tomography in suspected unnatural adult deaths. Eur. J. Radiol. 132, 109297 (2020).
Article CAS PubMed Google Scholar
Hubig, M. et al. Fully automatic CT-histogram-based fat estimation in dead bodies. Int. J. Legal Med. https://doi.org/10.1007/s00414-017-1757-5 (2018).
Article PubMed Google Scholar
Schenkl, S. et al. Automatic CT-based finite element model generation for temperature-based death time estimation: feasibility study and sensitivity analysis. Int. J. Legal Med. 131, 699–712 (2017).
Article PubMed Google Scholar
Heinrich, A., Schenkl, S., Buckreus, D., Güttler, F. V. & Teichgräber, U. K. M. CT-based thermometry with virtual monoenergetic images by dual-energy of fat, muscle and bone using FBP, iterative and deep learning–based reconstruction. Eur. Radiol. 32, 424–431. https://doi.org/10.1007/s00330-021-08206-z (2022).
Article CAS PubMed Google Scholar
Schenkl, S. et al. Quality measures for fully automatic CT histogram-based fat estimation on a corpse sample. Sci. Rep. 12, 20147 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Marroquin, T. et al. Age estimation in adults by dental imaging assessment systematic review. Forens. Sci. Int. 275, 203–211 (2017).
Article CAS Google Scholar
Panchbhai, A. Dental radiographic indicators, a key to age estimation. Dentomaxillofacial Radiol. 40, 199–212 (2011).
Article CAS Google Scholar
Black, S., Aggrawal, A. & Payne-James, J. Age Estimation in the Living: The Practitioner’s Guide (Wiley, 2011).
Google Scholar
Wallraff, S., Vesal, S., Syben, C., Lutz, R. & Maier, A. Bildverarbeitung für die Medizin 2021: Proceedings, German Workshop on Medical Image Computing, Regensburg, March 7–9, 2021 186–191 (Springer, 2021).
Book Google Scholar
De Donno, A., Angrisani, C., Mele, F., Introna, F. & Santoro, V. Dental age estimation: Demirjian’s versus the other methods in different populations A literature review. Med. Sci. Law 61, 125–129 (2021).
Article PubMed Google Scholar
Combe, L., Durande, M., Delanoë-Ayari, H. & Cochet-Escartin, O. Small hand-designed convolutional neural networks outperform transfer learning in automated cell shape detection in confluent tissues. Plos One 18, e0281931 (2023).
Article CAS PubMed PubMed Central Google Scholar
Palakodati, S. S. S., Chirra, V. R. R., Yakobu, D. & Bulla, S. Fresh and rotten fruits classification using CNN and transfer learning. Rev. d’Intelligence Artif. 34, 617–622 (2020).
Article Google Scholar
Yang, Y., Zha, K., Chen, Y., Wang, H. & Katabi, D. Delving into deep imbalanced regression. Int. Conf. Mach. Learn., 11842–11851 (2021).

Download references

Acknowledgements

Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—514572362.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Radiology, Jena University Hospital – Friedrich Schiller University, Am Klinikum 1, 07747, Jena, Germany
Andreas Heinrich

Authors

Andreas Heinrich
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.H. developed the method. A.H. wrote and reviewed the manuscript.

Corresponding author

Correspondence to Andreas Heinrich.

Ethics declarations

Competing interests

The author declares no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Heinrich, A. Accelerating computer vision-based human identification through the integration of deep learning-based age estimation from 2 to 89 years. Sci Rep 14, 4195 (2024). https://doi.org/10.1038/s41598-024-54877-1

Download citation

Received: 20 October 2023
Accepted: 17 February 2024
Published: 20 February 2024
DOI: https://doi.org/10.1038/s41598-024-54877-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Diagnostic accuracy of brain age prediction in a memory clinic population and comparison with clinically available volumetric measures

Personal identification with orthopantomography using simple convolutional neural networks: a preliminary study

Deep learning-based age estimation from chest X-rays indicates cardiovascular prognosis

Introduction

Results

CV-based human identification with CNN for age estimation

Evaluation the CNN for age estimation

Further analyses

Discussion

Methods

CV-based human identification

Preprocessing OPGs and CV feature extraction

Matching CV features with CV database

Evaluation CV-based human identification with integrated CNN for age estimation

Development of the CNN for age estimation

Datasets

CNN development

CNN training and evaluation

Further analyses

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figures.

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links