Predicting attitudes toward ambiguity using natural language processing on free descriptions for open-ended question measurements

Hitsuwari, Jimpei; Okano, Hirohito; Nomura, Michio

doi:10.1038/s41598-024-59118-z

Download PDF

Article
Open access
Published: 09 April 2024

Predicting attitudes toward ambiguity using natural language processing on free descriptions for open-ended question measurements

Jimpei Hitsuwari^1,2,
Hirohito Okano¹ &
Michio Nomura¹

Scientific Reports volume 14, Article number: 8276 (2024) Cite this article

767 Accesses
4 Altmetric
Metrics details

Subjects

Abstract

Individual traits and reactions to ambiguity differ and are conceptualized in terms of an individual’s attitudes toward ambiguity or ambiguity tolerance. The development of natural language processing technology has made it possible to measure mental states and reactions through open-ended questions, rather than predefined numerical rating scales, which have traditionally been the dominant method in psychological research. This study presented three ambiguity-related situations and responses collected online from 591 participants in an open-ended format. After the analysis with bidirectional encoder representations from transformers, correlations were calculated using scores from the numerical evaluation by conventional questionnaire, and a significant moderate positive correlation was found. Therefore, this study found that attitudes toward ambiguity can be measured using an open-ended response method of reporting everyday life states. It is a novel methodology that can be expanded to other scales in psychology and can potentially be used in educational and clinical situations where participants can be asked to respond with minimal burden.

Testing theory of mind in large language models and humans

Article Open access 20 May 2024

Augmenting large language models with chemistry tools

Article Open access 08 May 2024

Identity and inequality misperceptions, demographic determinants and efficacy of corrective measures

Article Open access 29 May 2024

Introduction

The ambiguous situations faced in the volatility, uncertainty, complexity, and ambiguity (VUCA) era are diverse, with individual differences in attitudes toward these ambiguous situations¹. To measure individual differences, Lauriola et al.² developed the Multidimensional Attitude toward Ambiguity Scale (MAAS) based on the Ambiguity Tolerance Scale, which measures individuals’ tolerance degree toward ambiguous situations. This scale has been validated for construct validity and internal reliability². The MAAS is utilized globally, with Japanese³ and Swedish versions⁴ also being developed. It has been used in numerous behavioral experiments and psychological surveys^5,6.

However, responding to a predefined numerical rating scale is not necessarily the optimal method to capture complex mental states and personality traits (People do not usually answer or express their states and emotions on a yes or no or 1–7 point scale, and most often use natural language.; for review, see⁷). Considering the recent popularity of ChatGPT, the development of large language models has made it possible to measure psychological states based on natural language, which was quite challenging in the past. For example, in Kjell et al.’s study, participants had to answer the question, “Overall, in your life, are you satisfied or not?”⁸. They examined the correlation between the values calculated by bidirectional encoder representations from transformers (BERT), a large language model, and the scores of the Satisfaction with Life Scale (SWLS)⁹, which has been conventionally used to measure life satisfaction. The BERT regression model transforms the participant’s free text into a multidimensional vector and uses that vector representation to predict the individual’s questionnaire score. The results indicated r = 0.74, implying that life satisfaction can be accurately measured using open-ended responses. In another study¹⁰, BERT was used to predict the Big Five personality traits based on user comments and posts comprising fiction (e.g., short stories) in a novel-writing community on Reddit (a bulletin board social site). The results indicated an average performance of r = 0.33, suggesting that personality can be predicted using free text. The present study asked participants to respond to open-ended questions in three situations (see below in the Method section) involving ambiguity (from the MAAS subscale), and the obtained texts were analyzed. The study aimed to determine the extent to which the survey methods consisting of free-text and natural language processing (NLP) predicted ambiguity tolerance in comparison to conventional numerical scores. Additionally, this study examined whether the texts answered from the respective MAAS subscales could discriminate between the respective subscales answered with numerical values.

Methods

This study was approved by the Ethics Committee of the Graduate School of Education at Kyoto University (CPE-571) and conducted in accordance with relevant guidelines and regulations. We obtained informed consent from the study participants before their participation.

Participants

A total of 600 native English speakers of British nationality (the language used in most of the previous studies⁸ is English, so we targeted British nationals referring to those NLP studies) were recruited using an online survey platform Prolific (https://www.prolific.com/). Nine were excluded because of duplicate IP addresses, extremely short response times (less than 255 s), and attention-checking errors, resulting in 591 participants (M_age = 43.35, SD = 14.44, 325 males, 255 females, 11 others) for the final analyses. A question for the attention check (For this question, select “5. I mildly agree”) was added to MSTAT II (detailed in Procedure section) to exclude participants who selected anything other than the required answer. They were paid￡0.6 as a reward for their participation.

Procedure

The participants provided open-ended responses to three ambiguous situations. The three situations correspond to the three factors of the MAAS: “How do you typically react when you are uncertain about the responsibilities of a job? (Discomfort with Ambiguity; DA),” “How do you typically react when ambiguous words like ‘probably,’ ‘approximately,’ or ‘perhaps’ are used? (Absolutism; AB),” and “How do you typically react when you are in situations which can be interpreted in more than one way? (Need for Complexity and Novelty; NC)” The responses were required to have at least 100 characters (approximately 20 words), and at least 45 s had to pass before answering the next question. Subsequently, participants responded to a questionnaire containing the MAAS and the Multiple Stimulus Types Ambiguity Tolerance Scale-II (MSTAT-II)¹¹. The MSTAT-II is a general measure of ambiguity tolerance and was employed to determine whether it could predict this scale score from the three situations created from the MAAS (usually, in MAAS, the average of each subscale score is calculated but not the overall score). Finally, respondents’ demographic data (sex, age, nationality, and education) were collected. Descriptive statistics from the MAAS and MSTAT-II and examples of open-ended responses obtained from the three texts are presented in Table 1.

Table 1 MAAS and MSTAT descriptive statistics and examples of free-text responses from the three situations.

Full size table

Analysis

The model for predicting the questionnaire scores was developed by fine-tuning the pre-trained BERT-base-cased model (https://huggingface.co/bert-base-cased). Closed models like ChatGPT raise scientific reproducibility and ethical concerns, as the precise architecture and training data are not disclosed, and updates are made without revealing the differences⁷. Therefore, for this study, a more open model, BERT, was used. Regarding hyperparameter selection during fine-tuning and final model evaluation, five-fold nested cross-validation (nested CV) was used. The nested CV has a low bias in estimation accuracy¹² and is particularly effective for machine learning on small samples¹³. It allows obtaining an estimate of the model’s predictive accuracy, independent of the data used to build the model (see Supplementary Material for more information).

Results

The correlation coefficients between the BERT-predicted and true values of the questionnaire scores when using free-text responses to the three open-ended questions were calculated (Table 2 presents the medians; see Supplementary Table 1 for the minimum and maximum values). Results indicated that text NC (r = 0.38, p < 0.001) and the text combining all three texts (r = 0.41, p < 0.001) moderately predicted the MSTAT-II scores, which measure general ambiguity tolerance. Additionally, texts from the DA (r = 0.28, p = 0.002), AB (r = 0.23, p = 0.01), and NC (r = 0.19, p = 0.04) were weakly correlated with their respective MAAS subscale scores.

Table 2 Median correlation coefficient between each text and each questionnaire score.

Full size table

Discussion

The findings of this study are novel as they indicate that even free text can predict psychological states and traits^8,10 with regard to ambiguity.

Three questions were asked in this study; however, only one question from NC, “How do you typically react when you are in situations which can be interpreted in more than one way?” was moderately predictive. This question is more general than the other two questions and applies to various situations. This suggests that refining situation settings and how questions are asked may allow attitudes toward ambiguity to be measurable, even with only one open-ended response. The DA, AB, and NC texts showed weak but significant correlations with their respective scores. Future studies should consider making it possible to discriminate between subscales, for example, by devising how the questions are asked.

This survey method consisting of free-text and NLP will allow for the measuring of an individual’s personality in a more ecologically valid form; that is, an open-ended response method when expressing emotions and states in everyday life^8,10,14,15. In Kjell et al.’s study⁸, questions aimed to examine overall life satisfaction, such as “Overall, in your life, are you satisfied or not?”; however, in this study, the question was constructed by specifying the situation and asking the respondent to imagine the situation, where “it can be interpreted in more than one way.” This allows the use of open-ended surveys that measure not only abstract concepts, such as life satisfaction, but also other personality traits and psychological states that are more specific.

While moderate correlation coefficients were observed, aligning with previous studies¹⁰, there is scope for further improvement in correlation by employing alternative language models (e.g., RoBERTa), a topic of interest for future studies. Consistent with previous studies, the results of this study are limited to English-language data. However, given the translation of the scale into various languages, efforts will be made to globally predict its scores in open-ended surveys in the future study. Both the MAAS and MSTAT-II used in this study were self-reported, and future research can attempt to predict a behavior (e.g., decision-making in ambiguous situations) based on participants’ open-ended responses and BERT scores.

In conclusion, this study successfully predicted attitudes toward ambiguity by NLP of open-ended responses using BERT. Through the utilization of these technologies, complex human minds can be measured in a way that is natural to the participants, with little concern that the content of the questionnaire items will influence participants’ cognitions. Academically, as the scale is translated into other languages, attempts can be made to predict its scores in open-ended surveys globally to increase its accuracy and discrimination to apply it to social surveys, education, clinical situations, among other spheres.

Data availability

All data and script are available online (https://osf.io/jza53/?view_only=dbdc4b4c82f94410aed7e5ccbb22a98d).

References

Furnham, A. & Ribchester, T. Tolerance of ambiguity: A review of the concept, its measurement and applications. Curr. Psychol. 14, 179–199. https://doi.org/10.1007/BF02686907 (1995).
Article Google Scholar
Lauriola, M., Foschi, R., Mosca, O. & Weller, J. Attitude toward ambiguity: Empirically robust factors in self-report personality scales. Assessment 23(3), 353–373. https://doi.org/10.1177/1073191115577188 (2016).
Article PubMed Google Scholar
Hitsuwari, J. & Nomura, M. Developing and validating a Japanese version of the multidimensional attitude toward ambiguity scale (MAAS). Psychology 12, 477–497. https://doi.org/10.4236/psych.2021.124030 (2021).
Article Google Scholar
Forsberg, E., Nilsson, A. & Jørgensen, Ø. Moral dichotomization at the heart of prejudice: The role of moral foundations and intolerance of ambiguity in generalized prejudice. Soc. Psychol. Personal. Sci. 10, 1002–1010. https://doi.org/10.1177/1948550618817347 (2019).
Article Google Scholar
Hitsuwari, J. & Nomura, M. Ambiguity tolerance can improve through poetry appreciation and creation. J. Creat. Behav. 57(2), 178–185. https://doi.org/10.1002/jocb.574 (2023).
Article Google Scholar
Spinelli, C., Ibrahim, M. & Khoury, B. Cultivating ambiguity tolerance through mindfulness: An induction randomized controlled trial. Curr. Psychol. https://doi.org/10.1007/s12144-021-02597-4 (2022).
Article Google Scholar
Kjell, O. N., Kjell, K. & Schwartz, H. A. Beyond rating scales: With targeted evaluation, language models are poised for psychological assessment. Psychiatry Res. 333, 115667. https://doi.org/10.1016/j.psychres.2023.115667 (2023).
Article PubMed Google Scholar
Kjell, O. N., Sikström, S., Kjell, K. & Schwartz, H. A. Natural language analyzed with AI-based transformers predict traditional subjective well-being measures approaching the theoretical upper limits in accuracy. Sci. Rep. 12(1), 3918. https://doi.org/10.1038/s41598-022-07520-w (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Diener, E. Subjective well-being. Psychol. Bull. 95(3), 542–575. https://doi.org/10.1037/0033-2909.95.3.542 (1984).
Article CAS PubMed Google Scholar
Simchon, A., Sutton, A., Edwards, M. & Lewandowsky, S. Online reading habits can reveal personality traits: Towards detecting psychological microtargeting. PNAS Nexus 2(6), 1–9. https://doi.org/10.1093/pnasnexus/pgad191 (2023).
Article Google Scholar
McLain, D. L. Evidence of the properties of an ambiguity tolerance measure: The multiple stimulus types ambiguity tolerance scale-II (MSTAT-II). Psychol. Rep. 105, 975–988. https://doi.org/10.2466/PR0.105.3.975-988 (2009).
Article PubMed Google Scholar
Varma, S. & Simon, R. Bias in error estimation when using cross-validation for model selection. BMC Bioinform. 7(1), 1–8. https://doi.org/10.1186/1471-2105-7-91 (2006).
Article CAS Google Scholar
Vabalas, A., Gowen, E., Poliakoff, E. & Casson, A. J. Machine learning algorithm validation with a limited sample size. PLOS One 14(11), e0224365. https://doi.org/10.1371/journal.pone.0224365 (2019).
Article CAS PubMed PubMed Central Google Scholar
Okano, H., Kawahara, D. & Nomura, M. Language model BERT can estimate trait self-compassion from people's free texts with high accuracy. Open Science Framework. https://doi.org/10.31234/osf.io/9zfh7 (2024).
Sikström, S., Pålsson Höök, A. & Kjell, O. Precise language responses versus easy rating scales: Comparing respondents’ views with clinicians’ belief of the respondent’s views. PLOS One 18(2), e0267995. https://doi.org/10.1371/journal.pone.0267995 (2023).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by Grant-in-Aid for JSPS Fellows Grant Number 22KJ1813, Leave a Nest Grant Incube Prize, and 3rd academist Prize. We also thank Katarina Woodman (Kyoto University) for the English proofreading of the questionnaire. Finally, this manuscript was reviewed in English by Editage (http://www.editage.com/) with the support of Global Education Office, Graduate School of Education, Kyoto University.

Author information

Authors and Affiliations

Graduate School of Education, Kyoto University, Kyoto, Japan
Jimpei Hitsuwari, Hirohito Okano & Michio Nomura
Japan Society for the Promotion of Science, Tokyo, Japan
Jimpei Hitsuwari

Authors

Jimpei Hitsuwari
View author publications
You can also search for this author in PubMed Google Scholar
Hirohito Okano
View author publications
You can also search for this author in PubMed Google Scholar
Michio Nomura
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.H.: Conceptualization, Methodology, Software, Writing—Original Draft, Visualization, Funding. H.O.: Methodology, Formal analysis, Visualization, Writing—Review & Editing. M.N.: Writing—Review & Editing, Supervision.

Corresponding author

Correspondence to Michio Nomura.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hitsuwari, J., Okano, H. & Nomura, M. Predicting attitudes toward ambiguity using natural language processing on free descriptions for open-ended question measurements. Sci Rep 14, 8276 (2024). https://doi.org/10.1038/s41598-024-59118-z

Download citation

Received: 14 February 2024
Accepted: 08 April 2024
Published: 09 April 2024
DOI: https://doi.org/10.1038/s41598-024-59118-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.