A point to reading

Reading hands

Editor's Introduction

Reading literary fiction improves theory of mind

annotated by
Catherine Richardson

The books we read change our minds, but in what ways? This study shows that certain types of books (nonfiction, popular, literary classics) affect the minds of readers differently. Although reading fiction has long been a requirement in education and a pastime for many, little empirical research has investigated the ways in which interacting with characters who are not really real can affect our ideas about real people in real life. This study shows causal effects of reading on a social cognitive process called Theory of Mind, “mind-reading,” or our ability to imagine the thoughts and emotions of others. The researchers found that literary fiction improved Theory of Mind.

Paper Details

Original title
Reading literary fiction improves theory of mind
David Comer Kidd Emanuele Castano
Original publication date
Vol. 342 no. 6156 pp. 377-380
Issue name


Understanding others’ mental states is a crucial skill that enables the complex social relationships that characterize human societies. Yet little research has investigated what fosters this skill, which is known as Theory of Mind (ToM), in adults. We present five experiments showing that reading literary fiction led to better performance on tests of affective ToM (experiments 1 to 5) and cognitive ToM (experiments 4 and 5) compared with reading nonfiction (experiments 1), popular fiction (experiments 2 to 5), or nothing at all (experiments 2 and 5). Specifically, these results show that reading literary fiction temporarily enhances ToM. More broadly, they suggest that ToM may be influenced by engagement with works of art.


The capacity to identify and understand others’ subjective states is one of the most stunning products of human evolution. It allows successful navigation of complex social relationships and helps to support the empathic responses that maintain them (15). Deficits in this set of abilities, commonly referred to as Theory of Mind (ToM), are associated with psychopathologies marked by interpersonal difficulties (68). Even when the ability is intact, disengagement of ToM has been linked to the breakdown of positive interpersonal and intergroup relationships (9).

Researchers have distinguished between affective ToM (the ability to detect and understand others’ emotions) and cognitive ToM (the inference and representation of others’ beliefs and intentions) (78). The affective component of ToM, in particular, is linked to empathy (positively) and antisocial behavior (negatively) (78). It is thus not surprising that we foster ToM in our children by having them attend to the emotional states of others: “Do you think he is happy or sad as a consequence of your action?” Such explicit encouragements to understand others usually diminish when children appear to skillfully and empathically engage in interpersonal relationships. Cultural practices, though, may function to promote and refine interpersonal sensitivity throughout our lives. One such practice is reading fiction.

Familiarity with fiction, self-reported empathy, and performance on an advanced affective ToM test have been correlated (10, 11), and limited experimental evidence suggests that reading fiction increases self-reported empathy (1213). Fiction seems also to expand our knowledge of others’ lives, helping us recognize our similarity to them (101114). Although fiction may explicitly convey social values and reduce the strangeness of others, the observed relation between familiarity with fiction and ToM may be due to more subtle characteristics of the text. That is, fiction may change how, not just what, people think about others (101114). We submit that fiction affects ToM processes because it forces us to engage in mind-reading and character construction. Not any kind of fiction achieves that, though. Our proposal is that it is literary fiction that forces the reader to engage in ToM processes.

The category of literary fiction has been contested on the grounds that it is merely a marker of social class, but features of the modern literary novel set it apart from most best-selling thrillers or romances. Miall and Kuiken (15–17) emphasize that through the systematic use of phonological, grammatical, and semantic stylistic devices, literary fiction defamiliarizes its readers. The capacity of literary fiction to unsettle readers’ expectations and challenge their thinking is also reflected in Roland Barthes’s (18) distinction between writerly and readerly texts. Although readerly texts—such as most popular genre fiction—are intended to entertain their mostly passive readers, writerly—or literary—texts engage their readers creatively as writers. Similarly, Mikhail Bakhtin (19) defined literary fiction as polyphonic and proposed that readers of literary fiction must contribute their own to a cacophony of voices. The absence of a single authorial perspective prompts readers to enter a vibrant discourse with the author and her characters.

Bruner (20), like Barthes and Bakhtin, has proposed that literature engages readers in a discourse that forces them to fill in gaps and search “for meanings among a spectrum of possible meanings” (p. 25). Bruner argues that to elicit this writerly stance, literary fiction triggers presupposition (a focus on implicit meanings), subjectification [depicting reality “through the filter of the consciousness of protagonists in the story” (p. 25)], and multiple perspectives (perceiving the world simultaneously from different viewpoints). These features mimic those of ToM.

Our contention is that literary fiction, which we consider to be both writerly and polyphonic, uniquely engages the psychological processes needed to gain access to characters’ subjective experiences. Just as in real life, the worlds of literary fiction are replete with complicated individuals whose inner lives are rarely easily discerned but warrant exploration. The worlds of fiction, though, pose fewer risks than the real world, and they present opportunities to consider the experiences of others without facing the potentially threatening consequences of that engagement. More critically, whereas many of our mundane social experiences may be scripted by convention and informed by stereotypes, those presented in literary fiction often disrupt our expectations. Readers of literary fiction must draw on more flexible interpretive resources to infer the feelings and thoughts of characters. That is, they must engage ToM processes. Contrary to literary fiction, popular fiction, which is more readerly, tends to portray the world and characters as internally consistent and predictable (21). Therefore, it may reaffirm readers’ expectations and so not promote ToM.

To test our general hypothesis that literary fiction would prime ToM, we first compared the effects of reading literary fiction with reading nonfiction (experiment 1) and then focused on testing our predictions about the different effects of reading literary and popular fiction (experiments 2 to 5).

Difficulty in precisely quantifying literariness notwithstanding, some works are considered particularly good examples of literature and are recognized with prestigious awards (e.g., the National Book Award). Although selected through an inherently inexact process, prize-winning texts are more likely to embody general characteristics of literature than bestsellers of genre fiction (e.g., romance and adventure stories). In the absence of a clear means of quantifying literariness, the judgments of expert raters (i.e., literary prize jurors) were used. Accordingly, to study the effects of reading literary fiction, we selected literary works of fiction by award-winning or canonical writers and compared their effects on ToM with reading nonfiction, popular fiction, or nothing at all.

In experiment 1 (22), 86 participants were randomly assigned to read one of six short texts (three literary fiction and three nonfiction). Next, participants completed a false-belief test as a measure of cognitive ToM (23) and an advanced affective ToM test, the Reading the Mind in the Eyes Test [RMET (6)], in which they were asked to identify facially expressed emotions. Participants’ familiarity with fiction was assessed using the Author Recognition Test (24), an index of general exposure to fiction that avoids problems of socially desirable responding. Affect (25), engagement with the text (transportation scale) (26), and demographic information were assessed.

For the cognitive ToM task, participants were asked to indicate the probability that a character would act according to the character’s own false belief or the participant’s true belief. Participants (n = 13) who failed to give probabilities and univariate outliers (>3.5 SD from the mean; n = 6) were excluded from the analysis. Probabilities were compared in a 2(false-belief versus no false-belief condition) × 2(fiction versus nonfiction) analysis of variance (ANOVA). There was no main effect for the type of scenario, which suggests no evidence of egocentric bias (F1,63= 1.47, P = 0.22). The level of false estimates was low across conditions (grand mean ± standard deviation, 6.61± 9.79).

Scores for the affective ToM task were computed by summing the number of correct identifications of facially expressed emotions (6) and analyzed using ANOVA, with condition and Author Recognition Test as between-participants factors (Table 1). Scores were higher in the literary fiction than nonfiction condition (Table 2). Higher Author Recognition Test scores (indicating more familiarity with fiction) predicted higher RMET scores. When entered as covariates; education, gender, age, transportation, negative affect, self-reported sadness, and average time spent on RMET items did not significantly alter the main effect of condition (P = 0.05). More time spent on RMET items predicted better performance (β = 0.23, P = 0.02). No other covariates approached significance (Pvalues of >0.14).


Table 1.  RMET and DANVA2-AF analyses.

This is the test statistic used in ANOVA, the F-ratio, and it is reported with the degrees of freedom as subscript. 

If the F-ratio is greater than 1, the experimental manipulation had some effect above and beyond the effect of extraneous factors.

It can be compared against a critical value based on its degrees of freedom. 


Beta represents the standardized regression coefficient.

Basically, this is the slope of the line that describes the relation between scores on the Author Recognition Test and scores on the RMET.

In this case, for every one standard deviation of increase in Author Recognition Test , there will be a .36 standard deviation increase in RMET.


The Probability value or P-value.

If the probability of obtaining the value of our test statistic by chance is less than .05, or if P<.05,  then we generally reject the null hypothesis that nothing happened as false: There is an effect in the population. 


An effect size measure that measures the overall effect of ANOVA.

It has been suggested that values of .01, .06, and, .14 represent small, medium, and large effects, respectively. 

Remember that these are rough guidelines and that effect sizes need to be interpreted within the context of the research literature.


Table 2.  Means (adjusted for other terms in the models) and standard deviations of RMET and DANVA2-AF scores. 95% confidence intervals are reported in brackets. X, no data. Means in the same row that share the same superscripts differ at P < 0.05.

Literary fiction

Scores (means) were higher on the RMET in the literary fiction condition than in the nonfiction condition in Experiment 1.

They were also higher in the literary fiction conditions than the popular fiction conditions in experiments 3, 4, and 5.

Exploring RMET (25.90)

The mean is a simple statistical model of the center of a distribution of scores. It is a hypothetical estimate of the "typical" or average score.

Means should be compared across conditions.

In conditions with the higher means, participants performed better than participants in conditions with lower means (except in Experiment 2, where the reverse is true). ones. 

Exploring RMET (4.38a)

Standard deviation: an estimate of the average variability (spread) of a set of data measured in the same units of measurement as the original data.

Small standard deviations (relative to the  value of the mean itself) indicates that most of the data points are close to the mean (i.e. the mean is an accurate representation of the data).

A standard deviation of 0 would mean that all of the scores were the same.

Exploring RMET ([24.55, 27.24])

Confidence Intervals for the given (means) are the range of values around that statistic that are believed to contain, with a certain probability (95%), the true value of that statistic.

If the confidence interval is small, the sample mean must be close to the true mean. 

Experiment 2 aimed to replicate and extend the findings of experiment 1 by using different texts and a different measure of affective ToM, the Diagnostic Analysis of Nonverbal Accuracy 2—Adult Faces test (DANVA2-AF) (27). Experiment 2 was also designed to directly differentiate between the effects of popular versus literary fiction (28).

Participants (n = 114) were randomly assigned to read one of three excerpts from recent finalists for the National Book Award (literary fiction condition), one of three excerpts from recent bestsellers on Amazon.com (popular fiction condition), or nothing at all (no-reading condition) (22). Participants then completed the measure of cognitive ToM used in experiment 1 and the DANVA2-AF before completing the Author Recognition Test, the transportation scale, and demographic questions. Performance on the false-belief cognitive ToM task was analyzed as in experiment 1, but no significant effects were detected (P values of >0.13).

DANVA2-AF scores were computed by summing errors on all of the negative affect items (22). Untransformed means are reported, but log-transformed scores were used in an ANOVA with experimental condition and Author Recognition Test as between-participants factors (see Table 1). No interaction emerged, but higher scores on the Author Recognition Test were weakly associated with fewer errors on the DANVA2-AF. The omnibus main effect of condition was marginally significant, and the pairwise comparisons revealed significant differences between conditions consistent with our hypothesis. Fewer errors were made in the literary fiction condition than in the no-reading and popular fiction conditions, whereas there was no difference between the latter two (P = 0.98) (seeTable 2). As in experiment 1; education, gender, and age were not significant covariates (P values of >0.34) and did not alter the critical, omnibus main effect of condition (P = 0.08). Transportation did not correlate with DANVA2-AF scores (P = 0.94).

Experiment 3 (N = 69) aimed to replicate the literary fiction versus popular fiction comparison (22). The popular fiction texts were three stories from an edited anthology of popular fiction (29), and literary fiction texts were three stories from a collection of the 2012 PEN/O. Henry Award winners for short stories (30). Participants’ affect was assessed using the Positive Affect Negative Affect Scale (PANAS) and a single-item report of sadness. Using the same analytical strategy used in experiment 1, it was found that RMET scores were higher in the literary fiction condition than in the popular fiction condition. There were no effects involving the Author Recognition Test (for test, Table 1; for means, Table 2). Education, gender, and the average time spent on RMET items were not significant covariates (P values of >0.12) and did not alter the effect of condition (P = 0.04).

In experiments 1 and 2, no effects were observed on the cognitive ToM measure, a false-belief task. Since participants in neither condition clearly failed to recruit cognitive ToM, it is possible that the task may have been insufficiently sensitive. Therefore a fourth experiment included the Yoni test (7). The Yoni test is a new measure that has been used in only a handful of studies. However, it has been validated (7831) and has the advantage of assessing both cognitive and affective ToM.

In experiment 4, four of the texts used in experiment 3 along with two new stories, one for each condition (i.e., literary fiction and popular fiction), from the same sources were used (22). Participants (N = 72) completed the RMET and the Yoni test. For the 24 cognitive and 24 affective ToM trials in the Yoni test, participants must draw from minimal linguistic and visual cues to infer a character’s thoughts and emotions, respectively. An additional 16 control trials require the identification of spatial relations. For each type of item, there are equal numbers of trials requiring first-order and second-order (more difficult) inferences.

RMET scores were higher in the literary fiction condition than in the popular fiction condition (for tests, see Table 1; for means, see Table 2). Author Recognition Test scores predicted RMET scores. Entered as covariates, subject variables (i.e., education, age, and gender) did not reach significance (P values of >0.14), though time spent on RMET items did (β = 0.21, P = 0.04). However, the effect of condition was only slightly altered and remained significant (P = 0.05).

Yoni performance was analyzed via a mixed analysis of covariance (ANCOVA) with type (affective versus cognitive) and level of difficulty (first order versus second order) of trials as within-participants factors, condition and Author Recognition Test scores as between-participants factors, and scores on the control task as a covariate (31). A main effect of condition emerged (F1,67 = 4.47, P = 0.03, ωp2 = 0.04) but no other effects involving condition or Author Recognition Test scores approached significance (P values of >0.27). Other significant effects, which are not relevant to the hypotheses, are described in the supplementary materials (22). Participants in the literary fiction condition [0.89 ± 0.08, 95% confidence interval (CI) = 0.86, 0.92] performed with greater accuracy on all ToM trials than those in the popular fiction condition (0.85 ± 0.10, CI = 0.82, 0.87).

A fifth experiment (22) aimed to replicate experiment 4 and test for the influences of subject variables (i.e., education, age, gender) and possible confounds with a larger sample (N = 356). As in experiments 3 and 4, three works of literary fiction were taken from a collection of the 2012 PEN/O. Henry Prize winners (30) and three works of popular fiction from an anthology (29). Participants were randomly assigned to the literary fiction, popular fiction, or no-reading control condition; completed the RMET and Yoni tasks; reported their current affect (PANAS), along with two additional items assessing sadness and happiness; and completed the Author Recognition Test. Participants in the two reading conditions completed the transportation scale and two additional items assessing the extent to which they enjoyed reading the text and how much they thought it represented “excellent literature.” All participants reported their age, gender, ethnicity, and highest level of attained education before being debriefed and compensated.

Literary texts (3.54 ± 1.31, CI = 3.28, 3.80) were enjoyed less than popular texts (4.07 ± 1.53, CI = 3.80, 4.34;F1,223 = 7.62, P = 0.006, ωp2 = 0.02), but they were seen as better examples of literature (4.84 ± 1.40, CI = 4.56, 5.11) than popular texts (4.43 ± 1.60, CI = 4.15, 4.72; F1,223 = 4.04, P = 0.04, ωp2 = 0.01). Reported transportation did not significantly differ across conditions (F1,223 = 3.20, P = 0.07, ωp2 = 0.00), although it was slightly higher in the literary condition (3.90 ± 0.41, CI = 3.83, 3.98) than the popular condition (3.81 ± 0.39, CI = 3.73, 3.88). None of these variables were correlated (P values of >0.11) with performance on either the RMET or the Yoni task (controlling for performance on physical trials).

Results on the RMET were analyzed as in the previous experiments. The effect of condition was significant (seeTable 1). Scores were significantly higher in the literary fiction condition than in the popular fiction and no-reading conditions (see Table 2). The latter two conditions did not differ (P = 0.65). A significant main effect of Author Recognition Test scores emerged (see Table 1). Added as covariates, gender, education, age, positive affect, negative affect, sadness, happiness, and time spent on RMET items did not significantly relate to RMET scores (P values of >0.23), and the effect of condition was only slightly altered (P = 0.06).

The analytical strategy used in experiment 4 was also used for the Yoni task. The main effect of condition (F2,351= 0.64, P = 0.52) was not significant, but there was a significant interaction of condition and the two within-subjects factors, trial difficulty and trial type (F2,351 = 3.42, P = 0.03). The interaction of Author Recognition Test scores, trial difficulty, and trial type approached significance (F1,351 = 2.88, P = 0.09), but no other effects involving condition or Author Recognition Test did (P values of >0.11). Other significant effects, which are not relevant to the hypotheses, are described in the supplementary materials (22). To disentangle the three-way interaction including the experimental condition, a repeated measures ANCOVA, with item type (cognitive, affective) and condition as factors, and performance on the control task as covariate, was conducted separately for first-order and second-order trials. On first-order trials, there was a main effect of the covariate (β = 0.20, P < 0.001, ωp2 = 0.03) and of condition (F2,351 = 4.21, P = 0.01, ωp2 = 0.01). No other effects approached significance (P values of >0.87). Pair-wise comparisons revealed that scores were higher in the literary fiction condition (0.98 ± 0.02, CI = 0.97, 0.99) than in the popular fiction condition (0.96 ± 0.06, CI = 0.95, 0.97; t = 2.85,P = 0.004) and the no-reading condition (0.97 ± 0.05, CI = 0.96, 0.98; t = 2.01, P = 0.04). The popular fiction condition and no-reading condition did not differ (P = 0.33). On second-order trials, no effects involving condition or Author Recognition Test scores approached significance (P values of >0.16).

The difference between first- and second-order trials, which appeared only in experiment 5, might be due to its higher statistical power, which allowed for this difference to be detected. The second-order Yoni trials may require a set of more advanced cognitive skills (e.g., metarepresentation) that are less easily influenced by manipulation than the other tasks, all of which are first-order ToM tasks.

Experiment 1 showed that reading literary fiction, relative to nonfiction, improves performance on an affective ToM task. Experiments 2 to 5 showed that this effect is specific to literary fiction. On cognitive measures, no effects emerged on the false-belief task used in experiments 1 and 2. Because error rates on the false-belief task were very low, the measure may have been insufficiently sensitive to capture the effects of the manipulations. However, on the more-demanding Yoni task used in experiments 4 and 5, the effect on cognitive trials was present and indistinguishable from that on affective trials.

The Author Recognition Test predicted RMET scores in experiments 1, 4, and 5, and success on the DANVA2-AF (marginally) in experiment 2; but it did not predict performance on the Yoni task or, anomalously, the RMET in experiment 3. Thus, although generally consistent with previous findings (1012), our pattern of results suggests the need for further research into the relation between measures of familiarity with fiction and performance on different ToM tasks.

The results of five experiments support our hypothesis that reading literary fiction enhances ToM. Existing explanations focused on the content of fiction cannot account for these results. First, the texts we used varied widely in subject matter. Second, it is unlikely that people learned much more about others by reading any of the short texts. Third, the effects were specific to literary fiction. We propose that by prompting readers to take an active writerly role to form representations of characters’ subjective states, literary fiction recruits ToM. The evidence we report here is consistent with this view, but we see these findings as preliminary and much research is needed.

First, our findings demonstrate the short-term effects of reading literary fiction. However, taken together, the relation between the Author Recognition Test and ToM performance and the finding that it is specifically literary fiction that facilitates ToM processes suggest that reading literary fiction may lead to stable improvements in ToM. Because the Author Recognition Test does not distinguish between exposure to literary and popular fiction, additional research with refined methods is necessary to test this hypothesis.

Second, literary fiction, like many stimuli drawn from the real world, is heterogeneous and complex. Although it is not clearly quantifiable, literariness has ecological validity as a construct, as suggested by participants’ agreement with prize jurors on the literariness of the texts in experiment 5. On the basis of strategies used by researchers studying violent video games [e.g., (32)] and fiction (12), literariness was held relatively constant in each condition while potentially confounding features varied. Self-reported affect along with transportation into, enjoyment, and perceived literariness of the texts did not account for the effects of condition. Further analyses tested the roles of superficial linguistic features of the texts. Frequencies of negative and positive emotion terms, social words, cognitive words, big words (more than six letters), and self-references were computed in each text using Linguistic Inquiry and Word Count (LIWC) software (33). Standardized RMET or DANVA2-AF scores from all experiments were analyzed using ANCOVA, with experimental condition and Author Recognition Test scores as factors and all six LIWC variables as covariates (data from the no-reading conditions was not included). The frequency of negative emotion words (β = 0.09, P = 0.05, ωp2 = 0.00) positively predicted ToM scores, but no other effects of LIWC variables approached significance (P values of >0.17). The main effects of condition (F1,515 = 12.02, P < 0.001, ωp2 = 0.02) and Author Recognition Test scores (β = 0.23, P < 0.001, ωp2 = 0.05) remained significant. This result suggests that the effect of literature observed across experiments may not be easily reduced to superficial linguistic characteristics. Future research, notably following the lead of Miall and Kuiken (1517), as well as Bruner (20), may reveal more subtle, but nonetheless quantifiable, features that set literary fiction apart.

The present findings mark only one step toward understanding the impact of our interactions with fiction, the experiences of which are thought to contribute to the development of consciousness and to enrich our daily lives (34). Indeed, there are surely many consequences of reading on cognitive and affective processes that are independent of its effects on ToM, and it seems likely that many of those may result from popular, as well as literary, fiction. Similarly, whereas literary fiction appears able to promote ToM, this capacity does not fully capture the concept of literariness, which includes, among others, aesthetic and stylistic matters not addressed in this research. It is our hope that further research will focus on other forms of art, such as plays and movies, that involve identifying and interpreting the subjective experiences of others (1028).

Literature has been deployed in programs intended to promote social welfare, such as those intended to promote empathy among doctors (35) and life skills among prisoners (36). Literature is, of course, also a required subject throughout secondary education in the United States, but reformers have questioned its importance: A new set of education standards that has been adopted by 46 U.S. states (the Common Core State Standards) controversially calls for less emphasis on fiction in secondary education [see (37)]. Debates over the social value of types of fiction and the arts more broadly are important, and it seems critical to supplement them with empirical research. These results show that reading literary fiction may hone adults’ ToM, a complex and critical social capacity.

Supplementary Materials


Materials and Methods

Supplementary Text

References (3851)

References and Notes

  1. C. Batson, in The Handbook of Social Psychology, D.T. Gilbert, S.T. Fiske, G. Lindzey, Eds. (McGraw-Hill, Boston, 1998), pp. 282–316.

  2. F. B. de Waal, The antiquity of empathy. Science 336, 874–876 (2012).

  3. R. Saxe, S. Carey, N. Kanwisher, Understanding other minds: Linking developmental psychology and functional neuroimaging. Annu. Rev. Psychol. 55, 87–124 (2004). CrossRefMedlineWeb of ScienceGoogle Scholar

  4. C. L. Sebastian, N. M. G. Fontaine, G. Bird, S.-J. Blakemore, S. A. De Brito, E. J. P. McCrory, E. Viding, Neural processing associated with cognitive and affective Theory of Mind in adolescents and adults. Soc. Cogn. Affect. Neurosci. 7, 53–63 (2012).

  5. S. G. Shamay-Tsoory, The neural bases for empathy. Neuroscientist 17, 18–24 (2011).

  6. S. Baron-Cohen, S. Wheelwright, J. Hill, Y. Raste, I. Plumb, The “Reading the Mind in the Eyes” Test revised version: A study with normal adults, and adults with Asperger syndrome or high-functioning autism. J. Child Psychol. Psychiatry 42, 241–251 (2001).

  7. S. G. Shamay-Tsoory, J. Aharon-Peretz, Dissociable prefrontal networks for cognitive and affective theory of mind: A lesion study. Neuropsychologia 45, 3054–3067 (2007).

  8. S. G. Shamay-Tsoory, H. Harari, J. Aharon-Peretz, Y. Levkovitz, The role of the orbitofrontal cortex in affective theory of mind deficits in criminal offenders with psychopathic tendencies. Cortex 46, 668–677 (2010).

  9. E. Castano, in The Oxford Handbook of Personality and Social Psychology, K. Deaux and M. Snyder, Eds. (Oxford Univ. Press, New York, 2012), pp. 419–445.

  10. R. Mar, K. Oatley, J. Hirsh, J. dela Paz, J. Peterson, Bookworms versus nerds: Exposure to fiction versus non-fiction, divergent associations with social ability, and the simulation of fictional social worlds. J. Res. Pers. 40, 694–712 (2006).

  11. R. Mar, K. Oatley, J. Peterson, Exploring the link between reading fiction and empathy: Ruling out individual differences and examining outcomes. Communications 34, 407–428 (2009).

  12. M. Djikic, K. Oatley, M. Moldoveanu, Reading other minds: Effects of literature on empathy. Scientific Study of Literature 3, 28–47 (2013).

  13. P. M. Bal, M. Veltkamp, How does fiction reading influence empathy? An experimental investigation on the role of emotional transportation. PLOS ONE 8, e55341 (2013).

  14. E. Schiappa, P. Gregg, D. Hewes, The parasocial contact hypothesis. Commun. Monogr. 72, 92–115 (2005).

  15. D. Miall, D. Kuiken, Foregrounding, defamiliarization, and affect: Response to literary stories. Poetics 22, 389–407 (1994).

  16. D. Miall, D. Kuiken, Beyond text theory: Understanding literary response. Discourse Process. 17, 337–352 (1994).

  17. D. Miall, D. Kuiken, What is literariness? Three components of literary reading. Discourse Process. 28, 121–138 (1999).

  18. R. Barthes, S/Z: An Essay (Hill and Wang, New York, 1974).

  19. M. Bakhtin, Problems of Dostoevsky’s Poetics (Univ. Minnesota, Minneapolis, 1984).

  20. J. Bruner, Actual Minds, Possible Worlds (Harvard Univ., Cambridge, MA, 1986).

  21. R. Gerrig, D. Rapp, Psychological processes underlying literary impact. Poetics Today 25, 265–281 (2004).

  22. Materials and methods are available as supplementary materials on Science Online.

  23. B. A. Converse, S. Lin, B. Keysar, N. Epley, In the mood to get over yourself: Mood affects theory-of-mind use. Emotion 8, 725–730 (2008).

  24. D. J. Acheson, J. B. Wells, M. C. MacDonald, New and updated tests of print exposure and reading abilities in college students. Behav. Res. Methods 40, 278–289 (2008).

  25. D. Watson, L. A. Clark, A. Tellegen, Development and validation of brief measures of positive and negative affect: The PANAS scales. J. Pers. Soc. Psychol. 54, 1063–1070 (1988).

  26. M. C. Green, T. C. Brock, The role of transportation in the persuasiveness of public narratives. J. Pers. Soc. Psychol. 79, 701–721 (2000).

  27. S. Nowicki, Manual for the Receptive Tests of the Diagnostic Analysis of Nonverbal Accuracy 2 (Department of Psychology, Emory Univ., Atlanta, GA, 2010).

  28. R. Mar, K. Oatley, The function of fiction is the abstraction and simulation of social experience. Perspect. Psychol. Sci. 3, 173–192 (2008).

  29. G. Hoppenstand, Ed., Popular Fiction: An Anthology (Longman, New York, 1998).

  30. L. Furman, Ed., The PEN/O. Henry Prize Stories 2012: The Best Stories of the Year (Anchor, New York, 2012).

  31. S. G. Shamay-Tsoory, Recognition of ‘fortune of others’ emotions in Asperger syndrome and high functioning autism. J. Autism Dev. Disord. 38, 1451–1461 (2008).

  32. B. J. Bushman, C. A. Anderson, Comfortably numb: Desensitizing effects of violent media on helping others. Psychol. Sci. 20, 273–277 (2009).

  33. J. Pennebaker, R. Booth, M. Francis, Linguistic Inquiry and Word Count (LIWC): LIWC2007 (LIWC.net, Austin, TX, 2007).

  34. N. Humphrey, Consciousness Regained (Oxford Univ., Oxford, 1983).

  35. M. F. McLellan, A. H. Jones, Why literature and medicine? Lancet 348, 109–111 (1996).

  36. J. Billington, ‘Reading for Life’: Prison reading groups in practice and theory. Critical Survey 23, 67–85 (2011).

  37. S. Mosle, What should children read? New York Times, 22 November 2012; http://opinionator.blogs.nytimes.com/2012/11/22/what-should-children-read/?r=1.

  38. D. DeLillo, The Angel Esmeralda (Scribner, New York, 2011), pp. 47–54.

  39. L. Davis, The Collected Stories of Lydia Davis (Farrar, Straus, & Giroux, New York, 2009), pp. 333–337.

  40. Chekhov, in Anton Chekhov’s Short Stories, R. E. Matlow, Ed. (Norton, New York, 1979; original work published 1884), pp. 1–3.

  41. Mann, How the potato changed the world. Smithsonian Magazine, November 2011; www.smithsonianmag.com/history-archaeology/How-the-Potato-Changed-the-World.html.

  42. Gandel, Bamboo steps up. Smithsonian Magazine, 21 March 2008; www.smithsonianmag.com/specialsections/ecocenter/greener-living/bamboo.html.

  43. R. Dunn, The story of the most common bird in the world. Smithsonian Magazine, 2 March 2012; www.smithsonianmag.com/science-nature/The-Story-of-the-Most-Common-Bird-in-the-World.html.

  44. F. Ali, T. Chamorro-Premuzic, Investigating theory of mind deficits in nonclinical psychopathy and Machiavellianism. Pers. Individ. Dif. 49, 169–174 (2010). CrossRefWeb of ScienceGoogle Scholar

  45. L. Erdrich, The Round House (Harper Collins, New York, 2012).

  46. T. Obreht, The Tiger’s Wife (Random House, New York, 2011).

  47. J. Ward, Salvage the Bones (Bloomsbury, New York, 2011).

  48. G. Flynn, Gone Girl (Crown Publishers, New York, 2012).

  49. D. Steele, The Sins of the Mother (Random House, New York, 2012).

  50. W. Young, Cross Roads (FaithWords, Nashville, TN, 2012).

  51. H. A. Elfenbein, N. Ambady, Predicting workplace outcomes from the ability to eavesdrop on feelings. J. Appl. Psychol. 87, 963–971 (2002). CrossRefMedlineWeb of ScienceGoogle Scholar

  52. Acknowledgments: Data are available in spreadsheet format from authors by request. This research was funded by a Prize Fellowship and a Dissertation Fellowship by the New School for Social Research to the first author, and a Faculty Development Award by the New School for Social Research to the second author. We are grateful to Andrew Marzoni for his insight into literary criticism and pointing us toward Barthes’s S/Z (19).