Abstract
As science and technology rapidly progress, it becomes increasingly important to understand how individuals comprehend expository technical texts that explain these advances. This study examined differences in individual readers’ technical comprehension performance and differences among texts, using functional brain imaging to measure regional brain activity while students read passages on technical topics and then took a comprehension test. Better comprehension of the technical passages was related to higher activation in regions of the left inferior frontal gyrus, left superior parietal lobe, bilateral dorsolateral prefrontal cortex, and bilateral hippocampus. These areas are associated with the construction of a mental model of the passage and with the integration of new and prior knowledge in memory. Poorer comprehension of the passages was related to greater activation of the ventromedial prefrontal cortex and the precuneus, areas involved in autobiographical and episodic memory retrieval. More comprehensible passages elicited more brain activation associated with establishing links among different types of information in the text and activation associated with establishing conceptual coherence within the text representation. These findings converge with previous behavioral research in their implications for teaching technical learners to become better comprehenders and for improving the structure of instructional texts, to facilitate scientific and technological comprehension.
Similar content being viewed by others
Introduction
Learning technical information from text in a technological world is a central cognitive skill that is taught in schools, tested by educational institutions and industry employers, and remains a crucial ability for a growing number of occupations as well as for everyday life. Reading comprehension skills are engaged every time we read a user’s manual, a Wikipedia article, or a technical handbook. Neurocognitive research approaches using functional brain imaging have the potential of substantially increasing the understanding of the processes underlying technical reading comprehension, and concomitantly, revealing possible opportunities for enhancing this process.
This study used brain imaging to address two central sets of questions regarding the processes necessary for comprehending technical information from a text. First, is it possible to identify the neuropsychological processes that distinguish good and poor comprehenders? In what way are some people better able than others to grasp from their reading how a technological device works? What are they doing differently than poor comprehenders? Although there is a long history of psychological and educational research approaches to these questions1,2,3,4,5,6, as described below, there is much less converging evidence from neuroscience research. The answers to these questions derived from brain imaging may converge with behavioral research in suggesting comprehension strategies that can be taught to learners to enhance their comprehension of novel technical material. It also may add new insight into the underlying processes used by better comprehenders of technical material, suggesting additional comprehension strategies.
Second, what conceptual and structural aspects of technical texts are associated with better or poorer comprehension? Are there certain properties shared by more comprehensible texts that facilitate effective comprehension? Do the processes used to successfully acquire and integrate new knowledge reveal how writers should compose and structure a technical passage to foster comprehension and memory for novel technical material? Again, previous behavioral research has addressed these questions5,7,8,9,10, but far less is known about how the brain responds to the properties of technical texts. Exploring the answers to these questions from a neurocognitive approach may provide additional insight into how texts can be better engineered to improve the comprehensibility and retention of new technical information.
The present study addressed these questions by acquiring brain activation data (fMRI) as participants, varying in comprehension skills, read short technical expository passages for comprehension. The choice of materials and experimental design was intended to make the task ecologically valid, resembling a real-world comprehension task, with the goal of enhancing the understanding of how new technical knowledge is acquired from text in the context of a realistic comprehension situation. The fMRI data were acquired during the reading of the passages and the goal of the study was to develop an understanding of how the neurocognitive processes that occur during reading are related to comprehending the text, as assessed by a multiple-choice comprehension test. The comprehension of individual participants and the comprehensibility of individual technical passages were used to identify brain areas whose activity underlies comprehension performance.
There is increasing recognition that focusing on brain localization of individual component processes of reading comprehension may fail to explain the complexity of real-world text comprehension11, and similar concerns have been raised about purely behavioral studies of expository text comprehension12,13. Many previous findings are based on experimental studies designed to isolate a single linguistic process while controlling the effects of others14,15. This type of functional decomposition may obscure the effects of other processes that can make comprehension the outcome of a more dynamic and interactive network of neural processing. This concern becomes especially salient when considering individual differences in comprehension performance. Recent neuroimaging studies of text comprehension have used naturalistic designs that attempt to investigate the collective neural signature of the participating processes16,17,18. The present study used realistic technical passages to be learned in a conventional reading comprehension task.
Several behavioral studies have investigated the specific processes that are engaged by the distinguishing properties of technical texts6,9,13,19. Expository technical texts typically describe a set of related concepts and their co-functioning in various events20,21. Compared to narrative texts, technical texts often contain less familiar vocabulary, are less likely to reflect everyday experiences, and use more complex linguistic structures, properties that can make them more difficult to comprehend and learn4. In addition, technical texts typically require fewer inferences to be made19, and their comprehension may depend more heavily on previous knowledge22. Only a few previous neuroimaging studies have investigated the neural basis of expository text comprehension23,24,25,26,27,28, despite its relevance for STEM education and job performance. The present study addresses this gap between behavioral and neuroimaging research in understanding expository text comprehension.
Reading comprehension depends on a large number of component processes, and the efficiency with which each one of them is executed can vary across individual readers. At the single-word and single-sentence levels, these processes include decoding of word forms, retrieval and selection of word meanings, syntactic processing of multi-word constituents, construction of propositions and assignment of case (or pragmatic) roles, establishment of cohesion among propositions, and the maintenance of the propositions in working memory. At the discourse level, additional processes are engaged, including the establishment of global coherence across sentences, maintenance of distantly related propositions in working memory, retrieval of semantic knowledge structures from long-term memory, and the integration of retrieved knowledge with information from the text. Particularly relevant for technical comprehension is the construction of a mental model or situation model of the referents of the text and their interrelations2,29. The mental model is constructed with the aid of the reader’s existing world knowledge to enable various types of inferences (e.g., bridging, predictive, causal, elaborative) that complete the unspoken parts of the text6.
Individual comprehenders may differ in their proficiency in any of these processes30,31,32,33. In addition, they may differ in the background knowledge they bring to the comprehension task, and as one would expect, prior knowledge in a domain affects how proficiently new knowledge in that domain is acquired34,35. Furthermore, proficiency in the underlying processes interacts with both prior knowledge and the structure of the text in determining comprehension performance7,8,36. The present study assessed which of the underlying processes, identified in regional patterns of brain activation, accounted for differences among individuals in their ultimate reading comprehension performance.
Neuroimaging studies have indicated a set of underlying brain regions that become activated by the engagement of text comprehension processes. Lexical retrieval and semantic knowledge representation involve an extensive network including the left-lateralized parietal, temporal, and inferior frontal lobe areas that make up the core language network, and they also involve bilateral activity in the ventral temporal cortex, the anterior temporal lobes, the hippocampus, and the ventromedial and ventrolateral prefrontal cortex37,38,39. Reading comprehension of extended discourse also depends on a brain network that carries out conceptual and semantic integration, which overlaps with semantic processing and representation regions and also includes lateral areas of the prefrontal cortex in both hemispheres40,41.
The construction of a mental model includes using previous knowledge drawn from semantic memory (knowledge of words, facts, and concepts not associated with a specific learning experience) and episodic or autobiographical memory (knowledge of past experiences). Semantic memory storage is distributed across the neocortex in a number of brain regions specialized for particular types of information17,39,42,43, and the retrieval of semantic knowledge and its integration with new or inconsistent prior knowledge is associated with lateral prefrontal regions44,45. Episodic memory storage is associated with the medial temporal lobe and hippocampus46 and episodic retrieval is associated with medial prefrontal regions47,48,49.
In sum, there is extensive literature associating individual brain regions and networks of regions with particular text comprehension processes. This literature provides the basis for inferring which processes are occurring given the observed brain activity during the reading of a particular text by a particular participant. These associations are based on a large literature of multiple studies in which there is activation in particular regions during the execution of a task assumed to engage a given process. This literature, such as the compendium provided by the Neurosynth database50 can be leveraged to clarify which processes are occurring by providing posterior probabilities of the likelihood that a process is associated with activation in a particular area51,52. In our study, the degree of engagement of a given process (assessed by its associated activation level in a given region) during the reading of a technical passage was used to predict the degree of mastery of the passage content.
In this study, the fMRI brain activity of 31 college-age participants of varying comprehension skills was acquired while they read a set of 16 technical passages for comprehension three times, followed by a multiple-choice comprehension test. Behavioral research has demonstrated that repetition of spoken passages increases memory performance, and also results in qualitative shifts in processing5,53,54, and brain imaging studies have indicated that there are corresponding related changes in regional brain activity over repeated hearing or reading of the same passage25,55.
The brain activation observed during the reading was used to predict how well individual participants would perform on the comprehension test and to infer which of the underlying processes distinguished the better comprehenders from the poorer comprehenders. (We refer to the prediction of comprehension performance from brain activity in the statistical sense because a direct causal link from neural activity to cognitive performance is not attainable in fMRI studies.) Similarly, brain activation during reading was used to indicate which passages would be more difficult to comprehend than others and to infer which of the underlying processes were modulated by passage difficulty.
Results
Summary
The brain activation patterns acquired during participants’ reading of the technical passages were systematically predictive of two types of outcomes.
First, regional brain activation during the reading of the technical passages was related to performance on a post-scan comprehension test. The brain regions whose activation best predicted comprehension of the technical content of the passages were strongly associated with several main types of processes: verbal working memory supporting syntactic and semantic language comprehension, retrieval of prior knowledge, spatial aspects of mental model construction, and building new semantic knowledge structures.
A second finding was that the regional brain activation during the reading of the technical passages was also related to how easy or hard the passage was to comprehend. The modulation of passage comprehension difficulty was related to the activation in brain regions associated with establishing links among different types of knowledge representations; typically these different types of knowledge representations are distributed across the cortex. The modulation was also related to activation in regions associated with establishing conceptual coherence.
Brain regions and processes related to individual participant’s comprehension
A set of regions associated with language processing and memory, spatial imagery, episodic encoding, and semantic knowledge retrieval and reorganization was more highly activated in readers who displayed better comprehension of the technical passages. Positive correlations were found between technical text comprehension and activation in language processing areas (the pars opercularis and pars triangularis portions of the left inferior frontal gyrus (L IFG)); in spatial processing areas (the left superior parietal lobule (SPL)); in a semantic integration area (left dorsolateral prefrontal cortex (L DLPFC); and in areas involved in the encoding and consolidation of new declarative knowledge (left and right hippocampal areas). The areas showing a positive relationship between activation and comprehension of the passages are shown in red in Fig. 1. The peak t-values and MNI coordinates for clusters of voxels showing a difference in correlation with comprehension from zero and MNI coordinates are presented in Table 1.
Other brain areas showed a negative relation between comprehension of the technical passages and activation during the reading of the passages (greater activation was associated with poorer comprehending of participants). These areas are associated with the retrieval of information from episodic or autobiographical memory (left superior frontal gyrus (L SFG), left and right ventromedial prefrontal cortex (L and R VMPFC), and the right precuneus (R Precuneus)), and areas associated with the integration of newly learned knowledge with information retrieved from episodic or autobiographical memory (left and right medial prefrontal cortex). These areas are shown in blue in Fig. 1 and are presented at the bottom of Table 1.
The activation in regions showing a positive or negative relationship between activation and comprehension performance was remarkably similar across the three presentations. Regions positively related to comprehension in all three readings of the sentences included the left inferior frontal gyrus (L IFG), the left superior parietal lobule (L SPL), the left middle frontal gyrus (L MFG), and areas involved in the encoding and consolidation of new declarative knowledge (left and right hippocampal areas and right precuneus). Supplementary Fig. 1 in the online Supplementary Information shows the regions related to comprehension separately for each presentation.
Multiple regression modeling of the relation between activation and individual participant’s comprehension performance
A stepwise regression indicated that activation levels in four regions were particularly related to individual participants’ comprehension performance: left ventromedial prefrontal cortex (L VMPFC), left inferior frontal gyrus (L IFG), right dorsolateral prefrontal cortex (R DLPFC), and the right hippocampus (R HC). These were also the most influential regions as measured by their standardized regression weights (ßs) in a model including all regions as predictors. The brain activation levels of each participant (averaged over the three readings of the passage) in each of the 12 regions described above were entered as the independent variables in the stepwise regression model to evaluate their relationship to the mean comprehension performance of each participant (averaged over the 16 passages). The selected regions were the first four independent variables that entered the model (thus, a forward selection process, with the minimum Schwartz Bayesian Criterion dictating when to stop adding variables). The resulting model was free from serious multicollinearity (variance inflation < 1.5 for each of the selected variables, with the minimum tolerance among them being 0.74). As the direction of correlations shown in Fig. 1 and in Table 1 suggest, only the left ventromedial prefrontal cortex (L VMPFC) had a negative regression weight. Interestingly, it was also the most influential of the four variables selected by the stepwise procedure (ß = −0.49); the negative regression weight indicates that participants who performed more poorly on the comprehension test tended to have higher activation in this area. The remaining three predictive regions had positive regression weights. The better comprehenders had higher levels of activity in L IFG, R DLPFC, and R HC. The fit of this model using activation in these four regions as independent variables was reliable (F(4,26) = 20.43, MSE = 0.007, p < 0.00001) and resulted in an R2 of .76 (adjusted R2 = 0.72, Fig. 2).
Cross-validation methods that trained the models on a subset of the data and tested them on the unseen subset established good predictive generalizability of the model. The results of the stepwise analysis reported above indicated that individual participant comprehension performance was reliably associated with brain activation in the selected regions from the full dataset including all three readings of the sentences. This cross-validation procedure establishes predictive validity across independent datasets, beyond showing predictiveness across correlated datasets, as the full stepwise model above does. Cross-validation was carried out by using the data from two presentations (averaged over two readings of the full set of passages) to select ROIs and train a regression model. Comprehension performance in the third, left-out presentation was then predicted using the regression weights from the training model. This cross-validation was repeated using each pair of presentations for training and each left-out presentation for testing. The ROIs selected for training in each cross-validation fold were again based on the correlation of activation and comprehension but were calculated separately for each pair of training presentations. Each of these cross-validated models demonstrated a reliable relationship between activation in the selected ROIs and comprehension performance (R2 = 0.47, (F(4,26) = 5.67, MSE = 0.017, p = 0.00203), R2 = 0.49 (F(4,26) = 6.18, MSE = 0.016, p = 0.00123), and R2 = 0.49 (F(4,26) = 6.23, MSE = 0.016, p = 0.00114), for the three possible leave-one-out folds). The fit of the average of the three cross-validation models is presented in Fig. 3.
Brain regions and processes related to individual passage comprehension difficulty
The comprehensibility of each of the 16 passages was related to the levels of brain activity in 13 brain regions whose voxelwise activation was reliably (p < 0.05) correlated with the mean comprehension level (t-tests comparing the values to zero on these voxelwise maps of the correlation coefficients for the 31 participants identified in these 13 regions, as shown in Fig. 4 and Table 2).
The activation levels in these 13 brain regions were entered as predictor variables in a stepwise regression model that identified four regions that best explained the variance in passage comprehensibility based on the minimum Schwartz Bayesian Criterion: the pars opercularis of the left inferior frontal gyrus (L IFG), the right temporal pole (R TP), the left inferior parietal lobule, and the medial anterior cingulate/dorsomedial prefrontal cortex (M ACC/DMPFC). This reduced model, based on the regions whose activation was correlated with comprehension, straightforwardly predicted the mean comprehension of each passage (R2 = 0.88 (Adj. R2 = 0.84), F(4,11) = 20.02, MSE = 0.003, p = 0.00005), as shown in Fig. 5. (The cross-validated model with predictive validity is reported below). The four selected variables entered the stepwise model and stayed without removal (a forward regression model). Two of the four selected regions were also the most influential as measured by their standardized regression weights in the full 13-ROI model (the M ACC/DMPFC ß = −0.99, L IFG ß = 0.74), and there were no collinearity problems with the four selected predictors (all variance inflation estimates < 2.0, all tolerances > 0.5). These regions predictive of the difficulty of comprehending individual passages are involved in lexical access and serve as a semantic hub that binds together semantic knowledge of different types37,38,56,57.
The ability to predict comprehension difficulty at the passage level from activation was cross-validated using the data from each pair of presentations (readings of the set of passages) to select ROIs and to train a regression model. As with the prediction of comprehension at the participant level reported above, data from the remaining presentation was used to test the generalizability of the model. (The presentation-specific regions of activity are shown in Supplementary Fig. 2.) Each of these cross-validation models demonstrated a reliable relationship between activation in the selected ROIs and comprehensibility (R2 = 0.79, (F(4,11) = 10.37, MSE = 0.005, p = 0.00099), R2 = 0.73 (F(4,11) = 7.60, MSE = 0.006, p = 0.00344), and R2 = 0.76 (F(4,11) = 8.57, MSE = 0.007, p = 0.00215), for the three possible leave-one-out folds). The mean predicted comprehensibility of each passage across the three training/test combinations resulted in an R2 of 0.44 (F(1,14) = 11.08, MSE = 0.01, p = 0.004970) (see Supplementary Fig. 3 for a plot of the cross-validated model fit.).
Predicting individual differences in comprehension on the basis of psychometrically assessed cognitive ability
An interesting question is whether the brain activity measures were superior to behavioral measures of cognitive ability in predicting individuals’ reading comprehension performance. (Supplementary Table 1 presents the correlations among all individual differences measures.) The Nelson–Denny Reading Comprehension measure58, which is very similar to the target comprehension test and hence is not an independent predictor, was unsurprisingly significantly correlated with technical passage comprehension (R2 = 0.40, F(1,29) = 19.37, MSE = 0.017, p = 0.000133). Other psychometric measures were also mildly predictive of comprehension performance. The Raven Progressive Matrices Test59, a measure of fluid intelligence, was reliably related to comprehension (R2 = 0.30, F(1,29) = 12.18, MSE = 0.022, p = 0.001565). The Reading Span Test60, a measure of working memory capacity for language comprehension, was also reliably related (R2 = 0.22, F(1,29) = 8.39, MSE = 0.022, p = 0.00711). Also mildly related was the Bennett Mechanical Comprehension Test61, a measure of mechanical aptitude testing spatial visualization and knowledge of basic physical and mechanical laws and their application in solving mechanical problems (R2 = 0.18, F(1,29) = 6.50, MSE = 0.023, p = 0.016336). A multiple regression model using the latter three psychometric measures to predict comprehension resulted in a multiple R2 of 0.36 (F(3,27) = 4.96, MSE = 0.019, p = 0.00188). By comparison, as reported above, the brain activation measures in four brain regions (L IFG, L IPL, M ACC/DLPFC, and R HC) predicted comprehension with a multiple R2 of 0.76. To quantitatively compare which type of measure was more predictive of comprehension, one set of measures (psychometric or brain imaging) was included in an initial model and the alternative set of measures was added to that model to test whether the improvement in prediction was reliable. The resulting analysis indicated that the addition of the psychometric measures to the brain imaging measures did not result in a significant improvement in prediction (increment in R2 = 0.048, F(3, 24) = 2.09, MSE = 0.019, p = 0.12813). By contrast, the addition of the brain imaging measures to the psychometric measures resulted in a reliable improvement in prediction (increment in R2 = 0.47, F(4, 24) = 14.99, MSE = 0.007, p < 0.00001). Thus, including the brain imaging measures provides additional ability to predict comprehension in individual participants, and also provides additional insight into the psychological processes underlying the reading comprehension performance.
Predicting passage difficulty from text properties
The prediction of passage-level comprehensibility based on brain activation measures was superior to the prediction based on readability measures of the texts. Coh-Metrix natural language processing procedures62 were used to describe each text on five measures of readability (Narrativity, Syntactic Simplicity, Word Concreteness, Referential Cohesion, and Deep Cohesion). (The inter-correlation of these measures is presented in Supplementary Table 2.) A stepwise regression using these predictors of individual passage comprehensibility indicated that only Syntactic Simplicity (which predicted better comprehension) approached significance in explaining the variance in passage comprehensibility (R2 = 0.24, F(1,14) = 4.45, MSE = 0.0001, p = 0.053386).
Although the Coh-Metrix measure of Deep Cohesion was not reliably related to passage difficulty, it was related to activation in two regions. As an alternative approach to explaining how features of the texts might influence comprehension processes, the Coh-Metrix measures were used to predict brain activation in each of the regions selected by the main stepwise regression relating activation to comprehensibility (L IFG L IPL, R TP, and M ACC/DMPFC). The stepwise regression predicting activation in L IFG found that only the measure of Deep Cohesion met the minimum Schwartz Bayesian Criterion (R2 = 0.45, F(1,14) = 11.51, MSE = 0.0001, p < 0.00438). The stepwise model for L IPL also selected only the measure of Deep Cohesion (R2 = 0.21, F(1,14) = 3.67, MSE = 0.0003, p = 0.076051). For the remaining two regions, stepwise regression models found no relationship between any of the six Coh-Metrix measures of text features and brain activation. Even though Deep Cohesion did not directly predict passage-level comprehensibility, it is possible that activation in L IFG and L IPL mediates the relationship between cohesion and comprehensibility. In both regions, more cohesive passages evoked greater activation, and greater activation in these regions was related to better comprehension of the passages.
Predicting passage difficulty from familiarity with the content
Rated familiarity with the passage topics was only moderately related to comprehensibility. The passage topics were rated as relatively unfamiliar by the participants (mean rating = 2.83 (SD = 0.88) on a 7-point scale where 7 indicated “very familiar”). The mean of the correlations between a participant’s familiarity rating for a topic (based on having seen only the topic title) and their comprehension scores on the passage on that topic was modest (mean Fisher’s z-transformed r = 0.24, SD = 0.28). Note that each individual passage had a limited number of possible performance levels, based on responses to four multiple-choice test questions, each with four alternatives. This may limit the information these correlations can provide. Despite the weak correlation, this mean across participants was nevertheless reliably greater than zero (t(30) = 4.65, p < 0.000005).
Changes in activation with repetitions of the passages
As expected, there were changes in activation over the course of the three readings of the passages, but they were not the simple monotonic changes one might predict if processes became more or less active with each additional exposure. These changes were evaluated using whole-brain paired t-tests among the three pairs of passage repetitions across all participants. The results indicated that overall activation increased between the first and second reading of the passages in the bilateral occipito-temporal cortex, in the left superior temporal sulcus and anterior middle temporal gyrus, and in the right temporal pole (see Supplementary Fig. 3a). These same areas decreased in activation from the second to third reading of the passages (see Supplementary Fig. 3b). In the right inferior frontal gyrus and the medial superior frontal gyrus, activation increased by the third reading. This inverted U-shaped function of activation in temporal language areas and the corresponding increase in lateral and medial frontal areas may reflect a shift from semantic to executive processing with repeated presentations.
Discussion
This study produced two central findings concerning the processes involved in the comprehension of technical information from a text, namely: (1) it identified the neural and psychological processes that are related to individual differences in the comprehension of technical texts, and (2) it identified the neural and psychological processes that are related to more comprehensible versus less comprehensible passages.
Individuals with higher comprehension showed more activity during reading in the left inferior frontal lobe, the left superior parietal lobe, the bilateral dorsolateral prefrontal cortex, and the bilateral hippocampus. Processes subserved by these regions are consistent with the construction of mental models that incorporate spatial imagery and integrate prior semantic knowledge with new information. By contrast, participants with poorer comprehension had higher activation in the right inferior parietal lobe and the left and right ventromedial prefrontal cortex. Poorer comprehenders engaged in more processing of the meanings of individual words and more processing of episodic or autobiographical knowledge.
The comprehensibility of individual technical passages was associated with the difficulty of accessing the meaning elements of the concepts in the text and integrating the concepts with each other, a difficulty best predicted by the brain activity in anterior left temporal lobe areas, consistent with previous research on the role of these regions37,38. Better comprehended passages were also associated with more activity in an area of the right temporal lobe that is involved in establishing discourse level coherence across propositions, which is also consistent with previous research63,64,65,66.
The new insights into the neural and cognitive processes that underlie successful expository comprehension converge with prior behavioral research in their practical implications. These implications include suggestions for the teaching of particular comprehension strategies and for improving the writing of such texts to facilitate comprehension, as discussed in more detail in a later section.
Among the better comprehenders, there was more activation in the left inferior frontal gyrus, associated with phonological, semantic, syntactic, and propositional maintenance processes67 that constitute verbal working memory for language60,68. Individual differences in working memory capacity have long been known to predict language comprehension at multiple linguistic levels, including discourse-level comprehension63,69. The findings here show that more activation in this region during the reading of technical texts (enabling more maintenance of the text representation in an activated state) helps predict individual differences in comprehension performance.
Brain regions responsible for spatial processing and spatial imagery (SPL and IPS) were more active among better comprehenders, suggesting that their spatial cognition facilitated comprehension70. This was coupled with greater activation in lateral prefrontal semantic structure-building areas, suggesting that spatial imagery was used to construct an integrated mental model71. This is consistent with behavioral evidence that readers incorporate such spatial imagery into their situation models72,73. Spatial mental models may also serve as a medium for “mental animation” that can represent the causal and dynamic properties of a system25. The passages used here generally contained highly visuospatial conceptual content and the greater activation of the better comprehenders most likely reflects greater visualization of the content.
During the reading of the passages, better comprehenders showed greater activation in the bilateral anterior dorsolateral prefrontal cortex and bilateral hippocampus, whereas poorer comprehenders showed higher activation in the medial ventrolateral prefrontal cortex and the precuneus. The activation pattern in better comprehenders is associated with the encoding and integration of new semantic information with retrieved semantic knowledge to construct new knowledge structures44,45. By contrast, the activation in poorer comprehenders in more medial prefrontal areas is associated with relating new information to retrieved episodic memories or autobiographical knowledge47,48,49. Numerous psychological and educational studies have firmly established the role of prior knowledge in reading comprehension performance7,8,30,35,74,75,76. The implication of the present findings is that the comprehension of technical information is better when it is related to previous semantic knowledge than when it is related to previous episodic knowledge.
In summary, during the reading of expository texts, greater activation of regions associated with verbal working memory, spatial visualization, mental model construction, and the semantic integration of new and retrieved knowledge was observed in the better comprehenders.
Activation in frontal and temporal lobes that was related to individual passage difficulty reflects processes that integrate different types of semantic information from different cortical areas56,57,77. The right temporal pole’s activation in response to the more difficult passages may result from such texts containing information inconsistent with world knowledge. This region has previously been shown to activate during the comprehension of narrative texts that contain details that are inconsistent with a global theme78. In contrast, the right posterior temporal lobe activated more for passages that were better comprehended, consistent with its role in drawing inferences that are required to build a coherent situation model63,64,65,66,78. The current findings indicate that passage difficulty is modulated by processes that bring together information from anatomically distributed component representations and that bind together different types of information. The fMRI technology allows the underlying neural processes that occur during the actual reading to be observed and measured and subsequently related to the resulting comprehension performance.
The findings that better technical comprehension involves activation in regions associated with language and verbal working memory, spatial mental model construction, and knowledge integration support the teaching of several strategies that might improve comprehension of technical passages. Less familiar concepts could be introduced and explained prior to their mention in a text. This is consistent with educational research showing that skilled comprehenders have more immediate access to word meaning, facilitating the integration of individual words into a mental model30 and that teaching the meanings of unfamiliar concepts in the texts ahead of time improves text comprehension79. In addition, explicitly teaching visualization strategies may be an effective way to improve the performance of poorer comprehenders of technical material. Behavioral research has demonstrated that spatial ability is related to success in science, technology, engineering, and math (STEM) fields80, and that computer-based instruction in spatial visualization strategies improves performance in reading comprehension81. Explicitly teaching individuals to make the type of bridging and causal inferences that are necessary for building a situation model may also enhance the comprehension of technical passages. One of the most effective ways of facilitating such inference generation is teaching self-explanation27,82,83,84.
The finding that activation in areas associated with semantic access and integration was related to the difficulty of individual passages suggests that technical texts can be improved by making them more cohesive. A robust finding in the psychological and educational literature is that text cohesion facilitates coherence and improves comprehension of discourse8,9,36,62,85. Although the present study did not find reliable correlations between passage comprehension and measures of passage cohesion, the materials were not designed to vary greatly in cohesion. Nevertheless, the fMRI data seem to be sensitive to this aspect of the structure of the passages, as evidenced by the reliable correlations between a measure of deep cohesion and activation. This activation was predictive of comprehension and may serve as a mediating individual difference variable indicative of the interaction between the text and the brain in determining comprehension. Explicit cohesive devices that can be incorporated across sentences and paragraphs include overlap of arguments and clear indications of causal, temporal, and spatial relations that are not overtly stated.
This study represents an advance in understanding the sources of individual differences in comprehending complex technical information. The brain activation data showed that better comprehenders engage more of three key cognitive processes: they keep more verbal information temporarily active in memory while reading, they are more likely to imagine the spatial relationships among the objects described, and they relate new information with previously learned world knowledge. These findings suggest that teaching comprehension strategies that explicitly engage these processes could improve technical comprehension among students who are poorer at this task. In addition, passages that are harder to comprehend inadequately interrelate different types of information, evoking additional integrative and inferential processing. Instructional texts that make such relations more explicit may facilitate comprehension and learning.
As society becomes increasingly dependent on technical learning from text, the needs for new instructional approaches arise. Functional neuroimaging research can inform the quest for such meta-cognitive instructional approaches. The research on reading comprehension is just one example of the potential impact of this approach.
Methods
Participants
Thirty-one right-handed, native English speakers between the ages of 18 and 35 (25 females and 6 males) from the Pittsburgh area provided usable data in the fMRI scanning task. These participants were selected from a pool of 265 individuals who participated in an online reading comprehension test to identify participants at the extremes of this distribution of reading comprehension performance. Eleven participants were excluded from the fMRI analyses due to excessive head motion (8), because they fell asleep during the scan (2), or were found to have an anatomical brain abnormality (1). All participants gave signed informed consent approved by the Carnegie Mellon Institutional Review Board.
Materials
Twenty-four passages (presented in Supplementary Table 3) were developed that described either mechanical devices or general knowledge topics. Each passage consisted of five sentences with a mean passage length of 132 words. The topics of the 16 passages used in the fMRI study were Bilge Pump, LiDAR, Refrigeration System, Automatic External Defibrillator, Screw Propeller, Sonar, 3D Printer, Aircraft Carrier Catapult, Bacteria, Acoustics and Cochlear Implants, Fever, Tumors Oncology Cancer, Photography, Intellectual Property, Beverages, and Mechanical Engineering of Robots. The eight remaining passages were used for pretesting of participants and familiarization with the reading comprehension task. The topics of the pretest passages were Boiler, Pressure Safety Valve, Turbine, Landing Gear Door Latching System, Automatic Direction Finder, N95 Masks, The Clock’s Timekeeper, and Electronic Circuits.
Procedures
Pretest screening session
Prior to participation in the fMRI session, participants first completed two reading comprehension tasks in a one-hour interactive online Zoom session to identify those who were particularly good or particularly poor at technical comprehension. The participants read two iterations of the set of eight screening passages, presented as a 17-min video over Zoom. The presentation of screening passages followed the same procedures as the fMRI task, as described below. The fMRI study enrolled only participants with either high or low reading comprehension scores (High = 88–100% correct; Low = 41–72%) in the pretest. Following the comprehension test, the participants completed the Nelson–Denny Reading Comprehension Test58 as an additional assessment of reading comprehension skills.
Experimental session
Prior to the fMRI scanning, familiarity ratings on a 1–7-point scale for each of the 16 passage topics were acquired (1 being not familiar at all, and 7 being very familiar), along with a Handedness Questionnaire86.
Following the presentation of one warmup passage (on the topic of Wheelchairs), the set of 16 test passages was presented three times (in different random orders) by first displaying the passage title for 1.5 s, followed by a fixation point for 0.5 s. The passage was then displayed using a moving window paradigm that presented one phrase (consisting of 1–4 words) at a time. The segmentation into phrases attempted to respect syntactic boundaries. Phrases longer than 4 words were presented in two successive cumulating segments. The displayed phrase disappeared when the next phrase was presented. The moving window moved from left to right and down successive lines in the locations where the text would normally occur. To present the text at a comfortable reading rate, the presentation duration parameters were based on an approximate average modulation of gaze durations by word length and frequency during text reading87, plus an intercept of 300 ms. The phrase presentation duration was 300 ms + 16 ms per character + (400 ms–(31.26*log (word frequency of least frequent word))). There was a pause of 4 s after each of the first four sentences of a passage during which the fixation point (an asterisk) was displayed, providing time to process the most recent sentence as well as integrate the new information with the previous sentences. This echoes the natural pause observed at the ends of sentences87. After the final sentence, an ‘X’ was displayed for 6.5 s to mark the end of the passage.
During the first two presentations of the passages, the stems of two (of the ultimate 4) questions (but not the response alternatives) were presented following each passage, to orient the participants to the type of information that would be interrogated (see Supplementary Table 4). The same questions were presented in the first two presentations. The two questions were displayed simultaneously for 5–10 s (the precise duration being dependent on their character length). Following the presentation of the two questions, an ‘X’ fixation point, 3.5 s in duration, was presented before the next passage began.
After completing the comprehension task, 29 of the participants also performed a recall task in the scanner whose analysis is not reported here because of its marginal contribution to the goals of the main study.
In the post-scan comprehension test, comprehension was assessed with four multiple-choice questions (each having four possible response alternatives) regarding each of the 16 passages. The stems of two of the four questions per passage were presented in conjunction with the first and second presentations of the passages (see Supplementary Table 4). The other two questions had not been previously seen by the participants. Participants also completed the following psychometric tests after the scan: The Reading Span Test60, Raven’s Standard Progressive Matrices88, and the Bennett Mechanical Comprehension Test (abbreviated version)61.
fMRI acquisition
Functional images were acquired on a Siemens Prisma (Erlangen, Germany) 3.0 T scanner at the Brain Imaging Data Generation & Education (BRIDGE) center, jointly operated by Carnegie Mellon University and the University of Pittsburgh. A multiband slice-accelerated BOLD spin-echo echo planar acquisition sequence89 was used to acquire 40, 3-mm-thick slices with no gap in each TR (1000 ms) using a multiband factor of 2, an AC-PC slice orientation covering all of the cerebral cortex, a TE of 25 ms, a tip angle of 64°, a 192 × 192 mm field of view and a 64 × 64 in-slice matrix. SPM software90 was used to correct head motion in the images and normalize them to the Montreal Neurological Institute template91.
Data analysis
The mean percentage signal change (MPSC) relative to the fixation condition was computed at each gray matter voxel for each image of each stimulus presentation. The main measure of activation evoked by a passage consisted of the voxel activation levels acquired for each sentence around the peak of the hemodynamic BOLD response and averaged across the five sentences of the passage. The data acquisition interval for each sentence included four brain images acquired once per second (i.e. a TR of 1000) within a 4-s window, offset 5 s from the stimulus onset (i.e. images 5–8). This window was not dependent on the duration of the sentence presentation but rather was intended to capture the peak of the BOLD response to the sentence.
Identification of regions responsive to individual differences in comprehension
Cortical areas that responded differentially across participants during reading were identified by statistical parameter maps of voxelwise correlations between MPSC (averaged over all passages) and comprehension performance on the post-scan test. Voxelwise one-sample t-tests identified clusters of positive and negative correlations with comprehension. These analyses were performed in SPM using a height threshold of p < 0.05 and a cluster extent threshold of 10 voxels. A set of 12 regions of interest (ROIs) met these criteria for region selection (Fig. 1 and Table 1).
This ROI selection procedure was also performed separately for each of the three presentations (i.e. the three repeated readings of the set of 16 passages), as well as for each possible pair of presentations, to provide independent ROIs for uncontaminated cross-validation of the regression model.
Predictive modeling of individual differences in comprehension
The MPSC data (averaged over the three presentations) from the correlation-based ROIs were entered into a stepwise regression model as independent variables, with the participants’ mean comprehension scores (averaged over the four comprehension questions) for each passage as the dependent measure. The purpose of the stepwise analysis was to select from this full set of regions those that together contributed most to predicting comprehension performance. The stepwise selection procedure used a p < 0.15 criterion for entry and a p < 0.15 criterion for retention in the final model, which used the minimum Schwartz Bayesian Criterion for stopping. Multi-collinearity was assessed with variance inflation and tolerance measures at each step. To evaluate the fit of the resulting model (which included only MPSC data from the stepwise-selected ROIs), the predicted comprehension values from this model were compared to the observed comprehension values, resulting in an R2 value quantifying the variance in comprehension explained by the activation in the selected ROIs.
Cross-validation of the prediction of participant-level comprehension from participant-level activation was performed by training three separate models on the MPSC data from each possible pair of presentations of the passages, and by testing the resulting model by applying the obtained regression weights to the averaged MPSC data in the corresponding ROIs in the remaining, left-out presentation. ROIs in each training set were selected as the cluster of comprehension-correlated voxels whose peak was closest to that of the selected ROIs from the descriptive stepwise model above. The data from the test presentation was extracted from these training-based ROIs rather than those defined in the test presentations. The predicted comprehension performance was averaged for each participant over the three folds of the cross-validated models and was compared to the observed comprehension performance for that participant, resulting in a cross-validated R2 measure of the fit.
Identification of regions and predictive modeling of the comprehension of individual passages
Regions of interest (ROIs) for modeling the comprehensibility of individual passages were selected on the basis of statistical parameter maps of the voxelwise correlation between the mean comprehension of an individual passage (averaged over all participants) and the MPSC in each voxel. A one-sample t-test identified regions that were reliably positively or negatively related to comprehensibility over the 16 passages (threshold p < 0.05, minimum cluster size = 10). Thirteen ROIs met these criteria (Fig. 4 and Table 2).
This same ROI selection procedure was also performed separately for each of the three presentations (i.e. repeated readings of the set of 16 passages) to provide independent ROIs for uncontaminated cross-validation of the prediction of individual passage comprehension.
Multiple regression prediction of mean comprehension performance of each passage
Mean comprehensibility of individual passages was predicted using the same regression methods and criteria as those used for predicting individual participant comprehension. Stepwise selection was used to identify regions in which the MPSC significantly contributed to the prediction of the comprehensibility of individual passages, and the fit of the model was evaluated by the R2 value quantifying the variance in observed comprehensibility explained by the passage comprehensibility predicted by activation in the selected ROIs.
As with the prediction of participant-level comprehension performance, cross-validation of the ability of activation to predict passage-level comprehensibility involved fitting a separate model using the MPSC in the independently selected ROIs for each possible pair of presentations to predict comprehension performance and applying the resulting regression weights to the averaged activation in corresponding ROIs in the test presentation. The predicted comprehension performance was averaged for each individual passage over the three folds and was compared to the observed comprehension performance for that passage, resulting in a cross-validated R2 measure of the fit.
Assessing changes in activation across presentations
Correlation maps for each presentation based on participant-level and passage-level comprehension means were examined to qualitatively judge the consistency of the voxelwise relationships between comprehension and activation over the three presentations. To quantify changes in activation across presentations, the overall MPSC (the overall voxelwise mean signal during fixation subtracted from that during that passage reading) was calculated for each participant. Second-level voxelwise paired t-tests were then performed on this MPSC data for the 31 participants contrasting each pair of presentations. The resulting statistical parameter maps were thresholded (p < 0.05, minimum cluster size = 10), to identify regions that increased or decreased in overall comprehension activation relative to the baseline for each pair of presentations.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
All data are available by request directed to the corresponding author. There are no limits on data sharing.
Code availability
All codes used for the analyses are available by request directed to the corresponding author.
References
Kintsch, W. & van Dijk, T. A. Toward a model of text comprehension and production. Psychol. Rev. 85, 363–394 (1978).
van Dijk, T. A. & Kintsch, W. Strategies of Discourse Comprehension (Academic Press, New York, 1983).
Lesgold, A. M. & Perfetti, C. A. Interactive processes in reading comprehension. Discourse Process. 1, 323–336 (1978).
Mar, R. A., Li, J., Nguyen, A. T. P. & Ta, C. P. Memory and comprehension of narrative versus expository texts: a meta-analysis. Psychon. Bull. Rev. 28, 732–749 (2021).
Millis, K. K. & Simon, S. Resource allocation during the rereading of scientific texts. Mem. Cogn. 26, 232–246 (1998).
Singer, M. & O’Connell, G. Robust inference processes in expository text comprehension. Eur. J. Cogn. Psychol. 15, 607–631 (2003).
Roller, C. M. Commentary: the interaction between knowledge and structure variables in the processing of expository prose. Read. Res. Q. 25, 79–89 (1990).
McNamara, D. S. & Kintsch, W. Learning from texts: effects of prior knowledge and text coherence. Discourse Process. 22, 247–288 (1996).
Follmer, J. D. & Sperling, R. A. Interactions between reader and text: contributions of cognitive processes, strategy use, and text cohesion to comprehension of expository science text. Learn. Individ. Differ. 67, 177–187 (2018).
Hebert, M., Bohaty, J. J., Nelson, J. R. & Brown, J. The effects of text structure instruction on expository reading comprehension: a meta-analysis. J. Educ. Psychol. 108, 609–629 (2016).
Hasson, U., Egidi, G., Marelli, M. & Willems, R. M. Grounding the neurobiology of language in first principles: the necessity of non-language-centric explanations for language comprehension. Cognition 180, 135–157 (2018).
Gauche, G. & Pfeiffer Flores, E. The role of inferences in reading comprehension: a critical analysis. Theory Psychol. 32, 326–343 (2022).
Ramsay, C. M. et al. A comparison of the effects of students' expository text comprehension strategies. Instr. Sci. 38, 551–570 (2010).
Ferstl, E. C. Inferences during text comprehension: what neuroscience can (or cannot) contribute. In Inferences During Reading (eds Cook, A. E., O’Brien, E. J. & Lorch, R. F. Jr.) 230–259 (Cambridge University Press, Cambridge, 2015).
Ferstl, E. C. Text comprehension. In The Oxford Handbook of Psycholinguistics (eds Rueschemeyer, S. -A. & Gaskell, M. G.) 197–216 (Oxford University Press, Oxford, UK, 2018).
Popham, S. F. et al. Visual and linguistic semantic representations are aligned at the border of human visual cortex. Nat. Neurosci. 24, 1628–1636 (2021).
Huth, A. G., Nishimoto, S., Vu, A. T. & Gallant, J. L. A continuous semantic space describes the representation of thousands of object and action categories across the human brain. Neuron 76, 1210–1224 (2012).
Wehbe, L. et al. Simultaneously uncovering the patterns of brain regions involved in different story reading subprocesses. PLoS ONE 9, 1–19 (2014).
Clinton, V. et al. Inferential comprehension differences between narrative and expository texts: a systematic review and meta-analysis. Read. Writ. 33, 2223–2248 (2020).
Roehling, J. V., Hebert, M., Nelson, J. R. & Bohaty, J. J. Text structure strategies for improving expository reading comprehension. Read. Teach. 71, 71–82 (2017).
Lorch, R. F. What about expository text? In Inferences During Reading (eds Cook, A. E., O’Brien, E. J. & Lorch, R. F. Jr.) 348–361 (Cambridge University Press, 2015).
Wolfe, M. B. W. & Woodwyk, J. M. Processing and memory of information presented in narrative or expository texts. Br. J. Educ. Psychol. 80, 341–362 (2010).
Swett, K. et al. Comprehending expository texts: the dynamic neurobiological correlates of building a coherent text representation. Front. Hum. Neurosci. 7, 853 (2013).
Ozernov-Palchik, O. et al. Distinct neural substrates of individual differences in components of reading comprehension in adults with or without dyslexia. Neuroimage 226, 117570 (2021).
Mason, R. A. & Just, M. A. Physics instruction induces changes in neural knowledge representation during successive stages of learning. Neuroimage 111, 36–48 (2015).
van Moort, M. L., Jolles, D. D., Koornneef, A. & van den Broek, P. What you read versus what you know: neural correlates of accessing context information and background knowledge in constructing a mental representation during reading. J. Exp. Psychol. Gen. 149, 2084–2101 (2020).
Moss, J., Schunn, C. D., Schneider, W., McNamara, D. S. & VanLehn, K. The neural correlates of strategic reading comprehension: cognitive control and discourse comprehension. Neuroimage 58, 675–686 (2011).
Hsu, C., Clariana, R., Schloss, B. & Li, P. Neurocognitive signatures of naturalistic reading of scientific texts: a fixation-related fMRI study. Sci. Rep. 9, 10678 (2019).
Van Den Broek, P. Using texts in science education: cognitive processes and knowledge representation. Science (1979) 328, 453–456 (2010).
Perfetti, C. & Stafura, J. Word knowledge in a theory of reading comprehension. Sci. Stud. Read. 18, 22–37 (2014).
Prat, C. S. & Yamasaki, B. L. The cognitive and neural correlates of individual differences in inferential processes. In Inferences During Reading (eds O’Brien, E. J., Cook, A. E. & Lorch, R. F.) 210–229 (Cambridge University Press, Cambridge, 2015).
Freed, E. M., Hamilton, S. T. & Long, D. L. Comprehension in proficient readers: the nature of individual variation. J. Mem. Lang. 97, 135–153 (2017).
Goring, S. A., Schmank, C. J., Kane, M. J. & Conway, A. R. A. Psychometric models of individual differences in reading comprehension: a reanalysis of Freed, Hamilton, and Long (2017). J. Mem. Lang. 119, 104221 (2021).
Chiesi, H. L. et al. Acquisition of domain-related information in relation to high and low domain knowledge. J. Verbal Learn. Verbal Behav. 18, 257–273 (1979).
Smith, R., Snow, P., Serry, T. & Hammond, L. The role of background knowledge in reading comprehension: a critical review. Read. Psychol. 42, 214–240 (2021).
McNamara, D. S., Kintsch, E., Songer, N. B. & Kintsch, W. Are good texts always better? Interactions of text coherence, background knowledge, and levels of understanding in learning from text. Cogn. Instr. 14, 1–43 (1996).
Lambon Ralph, M. A., Sage, K., Jones, R. W. & Mayberry, E. J. Coherent concepts are computed in the anterior temporal lobes. Proc. Natl Acad. Sci. USA 107, 2717–2722 (2010).
Visser, M., Jefferies, E. & Lambon Ralph, M. A. Semantic processing in the anterior temporal lobes: a meta-analysis of the functional neuroimaging literature. J. Cogn. Neurosci. 22, 1083–1094 (2010).
Binder, J. R., Desai, R. H., Graves, W. W. & Conant, L. L. Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. Cereb. Cortex 19, 2767–2796 (2009).
Braga, R. M., Dinicola, L. M., Becker, H. C. & Buckner, R. L. Situating the left-lateralized language network in the broader organization of multiple specialized large-scale distributed networks. J. Neurophysiol. 124, 1415–1448 (2020).
Ferstl, E. C., Neumann, J., Bogler, C. & Von Cramon, D. Y. The extended language network: a meta-analysis of neuroimaging studies on text comprehension. Hum. Brain Mapp. 29, 581–593 (2008).
Huth, A. G., De Heer, W. A., Griffiths, T. L., Theunissen, F. E. & Gallant, J. L. Natural speech reveals the semantic maps that tile human cerebral cortex. Nature 532, 453–458 (2016).
Binder, J. R. et al. Toward a brain-based componential semantic representation. Cogn. Neuropsychol. 33, 130–174 (2016).
Badre, D., Poldrack, R. A., Paré-Blagoev, E. J., Insler, R. Z. & Wagner, A. D. Dissociable controlled retrieval and generalized selection mechanisms in ventrolateral prefrontal cortex. Neuron 47, 907–918 (2005).
Brod, G., Werkle-Bergner, M. & Lee Shing, Y. The influence of prior knowledge on memory: a developmental cognitive neuroscience perspective. Front. Behav. Neurosci. 7, 139 (2013).
Rolls, E. T. The hippocampus, ventromedial prefrontal cortex, and episodic and semantic memory. Prog. Neurobiol. 217, 102334 (2022).
Brod, G., Lindenberger, U., Werkle-Bergner, M. & Shing, Y. L. Differences in the neural signature of remembering schema-congruent and schema-incongruent events. Neuroimage 117, 358–366 (2015).
Cabeza, R. & St Jacques, P. Functional neuroimaging of autobiographical memory. Trends Cogn. Sci. 11, 219–227 (2007).
Hasson, U., Nusbaum, H. C. & Small, S. L. Brain networks subserving the extraction of sentence information and its encoding to memory. Cereb. Cortex 17, 2899–2913 (2007).
Yarkoni, T., Poldrack, R. A., Nichols, T. E., Van Essen, D. C. & Wager, T. D. Large-scale automated synthesis of human functional neuroimaging data. Nat. Methods 8, 655–670 (2011).
Poldrack, R. A. Inferring mental states from neuroimaging data: from reverse inference to large-scale decoding. Neuron 72, 692–697 (2011).
MacHery, E. In defense of reverse inference. Br. J. Philos. Sci. 65, 251–267 (2014).
Mayer, R. E. Can you repeat that? Qualitative effects of repetition and advance organizers on learning from science prose. J. Educ. Psychol. 75, 40–49 (1983).
Bromage, B. K. & Mayer, R. E. Quantitative and qualitative effects of repetition on learning from technical text. J. Educ. Psychol. 78, 271–278 (1986).
Maguire, E. A., Frith, C. D. & Morris, R. G. M. The functional neuroanatomy of comprehension and memory: the importance of prior knowledge. Brain 122, 1839–1850 (1999).
Chiou, R., Humphreys, G. F., Jung, J. Y. & Lambon Ralph, M. A. Controlled semantic cognition relies upon dynamic and flexible interactions between the executive ‘semantic control’ and hub-and-spoke ‘semantic representation’ systems. Cortex 103, 100–116 (2018).
Chiou, R. & Lambon Ralph, M. A. Unveiling the dynamic interplay between the hub- and spoke-components of the brain’s semantic system and its impact on human behaviour. Neuroimage 199, 114–126 (2019).
Brown, J. I., Fishco, V. V. & Hanna, G. The Nelson–Denny Reading Test: Forms G and H (Riverside Publishing Company, 1993).
Raven, J. C. Advanced Progressive Matrices, Set II. (H. K. Lewis, London, 1962).
Daneman, M. & Carpenter, P. A. Individual differences in working memory and reading. J. Verbal Learn. Verbal 466, 450–466 (1980).
Bennett, G. K., Seashore, H. G. & Wesman, A. G. Differential Aptitude Tests (The Psychological Corporation, San Antonio, TX, 1972).
McNamara, D. S., Louwerse, M. M., McCarthy, P. M. & Graesser, A. C. Coh-Metrix: capturing linguistic features of cohesion. Discourse Process. 47, 292–330 (2010).
Mason, R. A. & Just, M. A. How the brain processes causal inferences in text: a theoretical account of generation and integration. Psychol. Sci. 15, 1–7 (2004).
Mason, R. A. & Just, M. A. Differentiable cortical networks for inferences concerning people’s intentions versus physical causality. Hum. Brain Mapp. 32, 313–329 (2011).
Mason, R. A. & Just, M. A. Identifying component discourse processing from their fMRI time course signatures. In Reading—from Words to Multiple Texts (eds Britt, M. A., Goldman, S. R. & Rouet, J.-F.) 147–159 (Routledge, New York, 2013).
Prat, C. S., Mason, R. A. & Just, M. A. Individual differences in the neural basis of causal inferencing. Brain Lang. 116, 1–13 (2011).
Fedorenko, E. & Blank, I. A. Broca’s area is not a natural kind. Trends Cogn. Sci. 24, 270–284 (2020).
Daneman, M. & Merikle, P. M. Working memory and language comprehension: a meta-analysis. Psychon. Bull. Rev. 3, 422–433 (1996).
Boudewyn, M. A. Effects of working memory span on processing of lexical associations and congruence in spoken discourse. Front. Psychol. 4, 60 (2013).
Sack, A. T. Parietal cortex and spatial cognition. Behav. Brain Res. 202, 153–161 (2009).
Yarkoni, T., Speer, N. K. & Zacks, J. M. Neural substrates of narrative comprehension and memory. Neuroimage 41, 1408–1425 (2008).
Haenggi, D., Kintsch, W. & Ann Gernsbacher, M. Spatial situation models and text comprehension. Discourse Proces. 19, 173–199 (1995).
Rinck, M. Spatial situation models. In The Cambridge Handbook of Visuospatial Thinking (eds Shah, P. & Miyake, A.) 334–382 (Cambridge University Press, 2005).
Bransford, J. D. & Johnson, M. K. Contextual prerequisites for understanding: Some investigations of comprehension and recall. J. Verbal Learn. Verbal Behav. 11, 717–726 (1972).
Koedinger, K. R., Corbett, A. T. & Perfetti, C. The knowledge-learning-instruction framework: bridging the science-practice chasm to enhance robust student learning. Cogn. Sci. 36, 757–798 (2012).
McDaniel, M. A. & Einstein, G. O. Training learning strategies to promote self-regulation and transfer: the knowledge, belief, commitment, and planning framework. Perspect. Psychol. Sci. 15, 1363–1381 (2020).
Patterson, K., Nestor, P. J. & Rogers, T. T. Where do you know what you know? The representation of semantic knowledge in the human brain. Nat. Rev. Neurosci. 8, 976–987 (2007).
Ferstl, E. C., Rinck, M. & Von Cramon, D. Y. Emotional and temporal aspects of situation model processing during text comprehension: an event-related fMRI study. J. Cogn. Neurosci. 17, 724–739 (2005).
Wright, T. S. & Cervetti, G. N. A systematic review of the research on vocabulary instruction that impacts text comprehension. Read. Res. Q. 52, 203–226 (2017).
Hinze, S. R. et al. When do spatial abilities support student comprehension of STEM visualizations? Cogn. Process. 14, 129–142 (2013).
Ponce, H. R., Mayer, R. E. & Lopez, M. J. A computer-based spatial learning strategy approach that improves reading comprehension and writing. Educ. Technol. Res. Dev. 61, 819–840 (2013).
McNamara, D. S. SERT: self-explanation reading training. Discourse Process. 38, 1–30 (2004).
McNamara, D. S. Self-explanation and reading strategy training (SERT) improves low-knowledge students’ science course performance. Discourse Process. 54, 479–492 (2017).
Chi, M. T. H., De Leeuw, N., Chiu, M. & Lavancher, C. Eliciting self‐explanations improves understanding. Cogn. Sci. 18, 439–477 (1994).
Graesser, A. C., McNamara, D. S., Louwerse, M. M. & Cai, Z. Coh-Metrix: analysis of text on cohesion and language. Behav. Res. Methods Instrum. Comput. 36, 193–202 (2004).
Oldfield, R. C. The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia 9, 97–113 (1971).
Just, M. A. & Carpenter, P. A. A theory of reading: from eye fixations to comprehension. Psychol. Rev. 87, 329–354 (1980).
Raven, J. C., Styles, I. & Raven, M. Raven’s Progressive Matrices: SPM Plus SETS A-E. (Oxford Psychologists Press/The Psychological Corporation/Pearson test publishers, Oxford, England, 1998).
Auerbach, E. J., Xu, J., Yacoub, E., Moeller, S. & Uǧurbil, K. Multiband accelerated spin-echo echo planar imaging with reduced peak RF power using time-shifted RF pulses. Magn. Reson. Med. 69, 1261–1267 (2013).
Friston, K. J., Ashburner, J. T., Kiebel, S. J., Nichols, T. E. & Penny, W. D. Statistical Parametric Mapping: The Analysis of Functional Brain Images (Elsevier/Academic Press, Amsterdam, 2006).
Collins, D. L. 3D Model-based Segmentation of Individual Brain Structures from Magnetic Resonance Imaging Data (McGill University, Montreal, Canada, 1994).
Acknowledgements
This study was funded in part by Office of Naval Research Grant N00014-20-1-2625 and in part by the National Science Foundation under Grants BCS-2215741 and BCS-1748897. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation or the Office of Naval Research. The funding agencies played no role in the study design, data collection, analysis and interpretation of data, or the writing of this manuscript.
Author information
Authors and Affiliations
Contributions
R.M. and M.A.J. conceived, designed, and supervised the project. R.M. and A.L. created the materials and conducted the data collection. T.K. and A.L. analyzed the data and wrote the initial draft of the paper. All authors contributed to the writing and editing of the manuscript, and all authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Keller, T.A., Mason, R.A., Legg, A.E. et al. The neural and cognitive basis of expository text comprehension. npj Sci. Learn. 9, 21 (2024). https://doi.org/10.1038/s41539-024-00232-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41539-024-00232-y