Reply: Limitations in the creation of an automatic diagnosis tool for dysgraphia

Asselborn, Thibault; Gargot, Thomas; Kidziński, Łukasz; Johal, Wafa; Cohen, David; Jolly, Caroline; Dillenbourg, Pierre

doi:10.1038/s41746-019-0115-z

Download PDF

Correspondence
Open access
Published: 09 May 2019

Reply: Limitations in the creation of an automatic diagnosis tool for dysgraphia

npj Digital Medicine volume 2, Article number: 37 (2019) Cite this article

826 Accesses
2 Citations
1 Altmetric
Metrics details

Subjects

We thank Labyt et al. for their comments about methodological limitations of our paper.¹ We carefully revisited the paper and the data, taking into account the critical perspective of the authors. We attempted to further explain raisen issues. However, if things remain unclear, we are ready to perform more analysis and provide more details.

Concerning the tablet calibration: Ideally the same tablet could have been used for the data collection of the two datasets. As it was not the case, we quantified their difference in pressure acquisition. Following the advice of the tablet manufacturer (Wacom Co., Ltd), a pressure calibration was performed on the two tablets. A construction was made to position the pen vertically on the surface of the tablets with minimal friction. Fifteen different weights (called X) from 0 g (pen without load) to 400 g (saturation of both tablets)) were used as an input while the values returned by each tablets (called Y) were logged. We then extracted the relation X/Y for the two tablets which ended up being very similar (the Spearman correlation shows a correlation of 0.9915 (p = 5.32e-12), mean square error of 0.6). A 4th degree polynomial fit was created to model the function describing the X/Y relation of the first tablet and used on the input of the second in order to rectify its output. After this correction, the Spearman correlation was found to be 99.998% (p = 1.81e-21) and the mean squared error was 5.1e-3.

Concerning the in the in-air-time feature: As authors correctly spotted, we didn’t include correction to take care of the irregularities highlighted by Labyt et al. in the acquisition of the data of the D dataset. Note however, that this feature was not found to be important (on the basis of the Gini importance given by the random forest) as can be seen in Table 1. After applying the correction, we retrained our model and saw that the performance of our model is not affected since the specificity and sensibility remain the same.

Table 1 The most important features found by the Random Forest model as well as the In Air Time Ratio feature, using Gini importance as a metric

Full size table

Concerning the handwriting evolution according to the age: We are obviously aware that handwriting evolves with age. In the same way, handwriting evolves differently according to the gender of the child or if he/she is left-handed or right-handed. To account for these variabilities, we included these features (age, gender, and laterality) in the model. The model should then be more robust to these demographic features.

Concerning the recruitment limitations: The people in charge of giving the test to the children were also different, as well as the conditions in which the children passed the test. The protocol was very simple and standardized: children just had to write the BHK text on a blank sheet of paper located on top of a digital tablet for 5 min. In this situation, we believe it is fair to assume that the variability of examiners is negligible.

In addition, Labyt et al. argue that our D dataset may not be representative of the general dysgraphic population leading the model to be biased toward more severe cases of dysgraphia.

If we wanted a D dataset representative of the general dysgraphic population and thus free of bias, the dysgraphic children should have been recruited from the same schools where the tests were conducted for the TD dataset. Since only 5% of children present dysgraphia in the general population, we would have needed to recruit more than a thousand children to reach the number of dysgraphic children we have in our study. It is for this reason that we recruited the D dataset in special centers for this study.

In our future work, we aim to collect more data from schools and therapy centers in order to predict with higher accuracy for the whole spectrum of dysgraphic children.

Concerning the following issue: some of the TD children might be dysgraphic, our results are thus too good: We are also aware of this problem, and that is why we wrote the following sentence in the discussion section to show the current limitation of our model: Note that the inter-rater correlation in BHK is 0.89. Since our algorithm outperforms this value, we conclude that the algorithm learned to mimic the rater. These findings suggest that adding data from other raters should not only reduce bias, but also allow us to surpass the accuracy of each individual rater.

Besides, as mentioned in the paper, we are in the process of having all the 296 tests of TD children as well as the 56 dysgraphic children from the D dataset by three rater experts to be the most objective possible. This new annotated data will help us explore the performance of the model in its current stage. Secondly, it will allow us to retrain our model with strictly controlled D and TD datasets that will help the model to better detect dysgraphia. Following the argument of Labyt et al. that dysgraphic children from school present less severe dysgaphia compared to the ones from the Reference Center for Language and Learning Disorders of the Grenoble Hospital (Centre Referent des Troubles du Langage et des Apprentissages, CRTLA, Centre-Hospitalier-Universitaire Grenoble), adding handwriting of dysgraphic children recruited from school in our D dataset will help the model to reduce its bias towards severe dysgraphia.

We acknowledge that the presence of false positive in our classification of TD children could have been more developed in the discussion but we were constrained by the page-limit and chose to discuss other points.

Our new study on the annotated data will give more insights on this particular point.

To conclude, we refute the term of errors, since Labyt et al. are not showing errors in our paper but are questioning some of the approaches that we decided to follow. We believe that our work will be improved with the proper annotation of the D and TD dataset. Finally, we think that like any innovative new claim in medicine, it would be necessary to replicate the study. That is why we would like to see other scientists work on the same topic in order to gain better knowledge on graphomotor disabilities.

Data availability

As it is possible to identify participants from the data, ethical requirements do not permit us to share participant data from this study.

Code availability

Code is available from the corresponding author upon reasonable request.

Reference

Asselborn, T. et al. Automated human-level diagnosis of dysgraphia using a consumer tablet. npj Digital Med. 1, 42 (2018).
Article Google Scholar

Download references

Author information

Authors and Affiliations

CHILI lab, EPFL, Route Cantonale, 1015, Lausanne, Switzerland
Thibault Asselborn, Wafa Johal & Pierre Dillenbourg
Psychiatrie de l’Enfant et de l’Adolescent, Pitié Salpêtriére - Charles Foix, Assistance Publique Hôpitaux de Paris, 47/83 boulevard de l’Hôpital, 75013, Paris, France
Thomas Gargot & David Cohen
ISIR, Sorbonne Université, 4 Place Jussieu, 75005, Paris, France
Thomas Gargot
CHART Laboratory - EA 4004, TIM, Paris 8 University, 93526, Saint Denis, France
Thomas Gargot
Department of Bioengineering, Stanford University, 443 Via Ortega, Stanford, CA, 94305, USA
Łukasz Kidziński
LSRO Lab, EPFL, Route Cantonale, 1015, Lausanne, Switzerland
Wafa Johal
LPNC, Univ. Grenoble Alpes, F-38040, Grenoble - CNRS, LPNC UMR 5105, F-38040, Grenoble, France
Caroline Jolly

Authors

Thibault Asselborn
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Gargot
View author publications
You can also search for this author in PubMed Google Scholar
Łukasz Kidziński
View author publications
You can also search for this author in PubMed Google Scholar
Wafa Johal
View author publications
You can also search for this author in PubMed Google Scholar
David Cohen
View author publications
You can also search for this author in PubMed Google Scholar
Caroline Jolly
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Dillenbourg
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Guarantor: T.A. had full access to the study data and takes full responsibility for the integrity of the final decision to submit the manuscript. Data collection: C.J. Drafting of the manuscript: T.A. and T.G. Critical revision of the manuscript: All. Supervision: P.D.

Corresponding author

Correspondence to Thibault Asselborn.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Asselborn, T., Gargot, T., Kidziński, Ł. et al. Reply: Limitations in the creation of an automatic diagnosis tool for dysgraphia. npj Digit. Med. 2, 37 (2019). https://doi.org/10.1038/s41746-019-0115-z

Download citation

Received: 29 November 2018
Accepted: 18 April 2019
Published: 09 May 2019
DOI: https://doi.org/10.1038/s41746-019-0115-z

Reply: Limitations in the creation of an automatic diagnosis tool for dysgraphia

Subjects

Data availability

Code availability

Reference

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Data availability

Code availability

Reference

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links