Abstract
Individual traits and reactions to ambiguity differ and are conceptualized in terms of an individual’s attitudes toward ambiguity or ambiguity tolerance. The development of natural language processing technology has made it possible to measure mental states and reactions through open-ended questions, rather than predefined numerical rating scales, which have traditionally been the dominant method in psychological research. This study presented three ambiguity-related situations and responses collected online from 591 participants in an open-ended format. After the analysis with bidirectional encoder representations from transformers, correlations were calculated using scores from the numerical evaluation by conventional questionnaire, and a significant moderate positive correlation was found. Therefore, this study found that attitudes toward ambiguity can be measured using an open-ended response method of reporting everyday life states. It is a novel methodology that can be expanded to other scales in psychology and can potentially be used in educational and clinical situations where participants can be asked to respond with minimal burden.
Similar content being viewed by others
Introduction
The ambiguous situations faced in the volatility, uncertainty, complexity, and ambiguity (VUCA) era are diverse, with individual differences in attitudes toward these ambiguous situations1. To measure individual differences, Lauriola et al.2 developed the Multidimensional Attitude toward Ambiguity Scale (MAAS) based on the Ambiguity Tolerance Scale, which measures individuals’ tolerance degree toward ambiguous situations. This scale has been validated for construct validity and internal reliability2. The MAAS is utilized globally, with Japanese3 and Swedish versions4 also being developed. It has been used in numerous behavioral experiments and psychological surveys5,6.
However, responding to a predefined numerical rating scale is not necessarily the optimal method to capture complex mental states and personality traits (People do not usually answer or express their states and emotions on a yes or no or 1–7 point scale, and most often use natural language.; for review, see7). Considering the recent popularity of ChatGPT, the development of large language models has made it possible to measure psychological states based on natural language, which was quite challenging in the past. For example, in Kjell et al.’s study, participants had to answer the question, “Overall, in your life, are you satisfied or not?”8. They examined the correlation between the values calculated by bidirectional encoder representations from transformers (BERT), a large language model, and the scores of the Satisfaction with Life Scale (SWLS)9, which has been conventionally used to measure life satisfaction. The BERT regression model transforms the participant’s free text into a multidimensional vector and uses that vector representation to predict the individual’s questionnaire score. The results indicated r = 0.74, implying that life satisfaction can be accurately measured using open-ended responses. In another study10, BERT was used to predict the Big Five personality traits based on user comments and posts comprising fiction (e.g., short stories) in a novel-writing community on Reddit (a bulletin board social site). The results indicated an average performance of r = 0.33, suggesting that personality can be predicted using free text. The present study asked participants to respond to open-ended questions in three situations (see below in the Method section) involving ambiguity (from the MAAS subscale), and the obtained texts were analyzed. The study aimed to determine the extent to which the survey methods consisting of free-text and natural language processing (NLP) predicted ambiguity tolerance in comparison to conventional numerical scores. Additionally, this study examined whether the texts answered from the respective MAAS subscales could discriminate between the respective subscales answered with numerical values.
Methods
This study was approved by the Ethics Committee of the Graduate School of Education at Kyoto University (CPE-571) and conducted in accordance with relevant guidelines and regulations. We obtained informed consent from the study participants before their participation.
Participants
A total of 600 native English speakers of British nationality (the language used in most of the previous studies8 is English, so we targeted British nationals referring to those NLP studies) were recruited using an online survey platform Prolific (https://www.prolific.com/). Nine were excluded because of duplicate IP addresses, extremely short response times (less than 255 s), and attention-checking errors, resulting in 591 participants (Mage = 43.35, SD = 14.44, 325 males, 255 females, 11 others) for the final analyses. A question for the attention check (For this question, select “5. I mildly agree”) was added to MSTAT II (detailed in Procedure section) to exclude participants who selected anything other than the required answer. They were paid£0.6 as a reward for their participation.
Procedure
The participants provided open-ended responses to three ambiguous situations. The three situations correspond to the three factors of the MAAS: “How do you typically react when you are uncertain about the responsibilities of a job? (Discomfort with Ambiguity; DA),” “How do you typically react when ambiguous words like ‘probably,’ ‘approximately,’ or ‘perhaps’ are used? (Absolutism; AB),” and “How do you typically react when you are in situations which can be interpreted in more than one way? (Need for Complexity and Novelty; NC)” The responses were required to have at least 100 characters (approximately 20 words), and at least 45 s had to pass before answering the next question. Subsequently, participants responded to a questionnaire containing the MAAS and the Multiple Stimulus Types Ambiguity Tolerance Scale-II (MSTAT-II)11. The MSTAT-II is a general measure of ambiguity tolerance and was employed to determine whether it could predict this scale score from the three situations created from the MAAS (usually, in MAAS, the average of each subscale score is calculated but not the overall score). Finally, respondents’ demographic data (sex, age, nationality, and education) were collected. Descriptive statistics from the MAAS and MSTAT-II and examples of open-ended responses obtained from the three texts are presented in Table 1.
Analysis
The model for predicting the questionnaire scores was developed by fine-tuning the pre-trained BERT-base-cased model (https://huggingface.co/bert-base-cased). Closed models like ChatGPT raise scientific reproducibility and ethical concerns, as the precise architecture and training data are not disclosed, and updates are made without revealing the differences7. Therefore, for this study, a more open model, BERT, was used. Regarding hyperparameter selection during fine-tuning and final model evaluation, five-fold nested cross-validation (nested CV) was used. The nested CV has a low bias in estimation accuracy12 and is particularly effective for machine learning on small samples13. It allows obtaining an estimate of the model’s predictive accuracy, independent of the data used to build the model (see Supplementary Material for more information).
Results
The correlation coefficients between the BERT-predicted and true values of the questionnaire scores when using free-text responses to the three open-ended questions were calculated (Table 2 presents the medians; see Supplementary Table 1 for the minimum and maximum values). Results indicated that text NC (r = 0.38, p < 0.001) and the text combining all three texts (r = 0.41, p < 0.001) moderately predicted the MSTAT-II scores, which measure general ambiguity tolerance. Additionally, texts from the DA (r = 0.28, p = 0.002), AB (r = 0.23, p = 0.01), and NC (r = 0.19, p = 0.04) were weakly correlated with their respective MAAS subscale scores.
Discussion
The findings of this study are novel as they indicate that even free text can predict psychological states and traits8,10 with regard to ambiguity.
Three questions were asked in this study; however, only one question from NC, “How do you typically react when you are in situations which can be interpreted in more than one way?” was moderately predictive. This question is more general than the other two questions and applies to various situations. This suggests that refining situation settings and how questions are asked may allow attitudes toward ambiguity to be measurable, even with only one open-ended response. The DA, AB, and NC texts showed weak but significant correlations with their respective scores. Future studies should consider making it possible to discriminate between subscales, for example, by devising how the questions are asked.
This survey method consisting of free-text and NLP will allow for the measuring of an individual’s personality in a more ecologically valid form; that is, an open-ended response method when expressing emotions and states in everyday life8,10,14,15. In Kjell et al.’s study8, questions aimed to examine overall life satisfaction, such as “Overall, in your life, are you satisfied or not?”; however, in this study, the question was constructed by specifying the situation and asking the respondent to imagine the situation, where “it can be interpreted in more than one way.” This allows the use of open-ended surveys that measure not only abstract concepts, such as life satisfaction, but also other personality traits and psychological states that are more specific.
While moderate correlation coefficients were observed, aligning with previous studies10, there is scope for further improvement in correlation by employing alternative language models (e.g., RoBERTa), a topic of interest for future studies. Consistent with previous studies, the results of this study are limited to English-language data. However, given the translation of the scale into various languages, efforts will be made to globally predict its scores in open-ended surveys in the future study. Both the MAAS and MSTAT-II used in this study were self-reported, and future research can attempt to predict a behavior (e.g., decision-making in ambiguous situations) based on participants’ open-ended responses and BERT scores.
In conclusion, this study successfully predicted attitudes toward ambiguity by NLP of open-ended responses using BERT. Through the utilization of these technologies, complex human minds can be measured in a way that is natural to the participants, with little concern that the content of the questionnaire items will influence participants’ cognitions. Academically, as the scale is translated into other languages, attempts can be made to predict its scores in open-ended surveys globally to increase its accuracy and discrimination to apply it to social surveys, education, clinical situations, among other spheres.
Data availability
All data and script are available online (https://osf.io/jza53/?view_only=dbdc4b4c82f94410aed7e5ccbb22a98d).
References
Furnham, A. & Ribchester, T. Tolerance of ambiguity: A review of the concept, its measurement and applications. Curr. Psychol. 14, 179–199. https://doi.org/10.1007/BF02686907 (1995).
Lauriola, M., Foschi, R., Mosca, O. & Weller, J. Attitude toward ambiguity: Empirically robust factors in self-report personality scales. Assessment 23(3), 353–373. https://doi.org/10.1177/1073191115577188 (2016).
Hitsuwari, J. & Nomura, M. Developing and validating a Japanese version of the multidimensional attitude toward ambiguity scale (MAAS). Psychology 12, 477–497. https://doi.org/10.4236/psych.2021.124030 (2021).
Forsberg, E., Nilsson, A. & Jørgensen, Ø. Moral dichotomization at the heart of prejudice: The role of moral foundations and intolerance of ambiguity in generalized prejudice. Soc. Psychol. Personal. Sci. 10, 1002–1010. https://doi.org/10.1177/1948550618817347 (2019).
Hitsuwari, J. & Nomura, M. Ambiguity tolerance can improve through poetry appreciation and creation. J. Creat. Behav. 57(2), 178–185. https://doi.org/10.1002/jocb.574 (2023).
Spinelli, C., Ibrahim, M. & Khoury, B. Cultivating ambiguity tolerance through mindfulness: An induction randomized controlled trial. Curr. Psychol. https://doi.org/10.1007/s12144-021-02597-4 (2022).
Kjell, O. N., Kjell, K. & Schwartz, H. A. Beyond rating scales: With targeted evaluation, language models are poised for psychological assessment. Psychiatry Res. 333, 115667. https://doi.org/10.1016/j.psychres.2023.115667 (2023).
Kjell, O. N., Sikström, S., Kjell, K. & Schwartz, H. A. Natural language analyzed with AI-based transformers predict traditional subjective well-being measures approaching the theoretical upper limits in accuracy. Sci. Rep. 12(1), 3918. https://doi.org/10.1038/s41598-022-07520-w (2022).
Diener, E. Subjective well-being. Psychol. Bull. 95(3), 542–575. https://doi.org/10.1037/0033-2909.95.3.542 (1984).
Simchon, A., Sutton, A., Edwards, M. & Lewandowsky, S. Online reading habits can reveal personality traits: Towards detecting psychological microtargeting. PNAS Nexus 2(6), 1–9. https://doi.org/10.1093/pnasnexus/pgad191 (2023).
McLain, D. L. Evidence of the properties of an ambiguity tolerance measure: The multiple stimulus types ambiguity tolerance scale-II (MSTAT-II). Psychol. Rep. 105, 975–988. https://doi.org/10.2466/PR0.105.3.975-988 (2009).
Varma, S. & Simon, R. Bias in error estimation when using cross-validation for model selection. BMC Bioinform. 7(1), 1–8. https://doi.org/10.1186/1471-2105-7-91 (2006).
Vabalas, A., Gowen, E., Poliakoff, E. & Casson, A. J. Machine learning algorithm validation with a limited sample size. PLOS One 14(11), e0224365. https://doi.org/10.1371/journal.pone.0224365 (2019).
Okano, H., Kawahara, D. & Nomura, M. Language model BERT can estimate trait self-compassion from people's free texts with high accuracy. Open Science Framework. https://doi.org/10.31234/osf.io/9zfh7 (2024).
Sikström, S., Pålsson Höök, A. & Kjell, O. Precise language responses versus easy rating scales: Comparing respondents’ views with clinicians’ belief of the respondent’s views. PLOS One 18(2), e0267995. https://doi.org/10.1371/journal.pone.0267995 (2023).
Acknowledgements
This work was supported by Grant-in-Aid for JSPS Fellows Grant Number 22KJ1813, Leave a Nest Grant Incube Prize, and 3rd academist Prize. We also thank Katarina Woodman (Kyoto University) for the English proofreading of the questionnaire. Finally, this manuscript was reviewed in English by Editage (http://www.editage.com/) with the support of Global Education Office, Graduate School of Education, Kyoto University.
Author information
Authors and Affiliations
Contributions
J.H.: Conceptualization, Methodology, Software, Writing—Original Draft, Visualization, Funding. H.O.: Methodology, Formal analysis, Visualization, Writing—Review & Editing. M.N.: Writing—Review & Editing, Supervision.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hitsuwari, J., Okano, H. & Nomura, M. Predicting attitudes toward ambiguity using natural language processing on free descriptions for open-ended question measurements. Sci Rep 14, 8276 (2024). https://doi.org/10.1038/s41598-024-59118-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-59118-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.