You are here

Parallels and Allusions in Early Chinese Texts: A Digital Approach, by Donald Sturgeon

DHAsia presents Parallels and Allusions in Early Chinese Texts: A Digital Approach, by Donald Sturgeon.

April 25, 2017 4:15pm to 6:00pm
Text reuse in the form of textual parallels within and between early Chinese transmitted texts is extensive, widespread, and typically unattributed, often reflecting complex textual histories involving repeated transcription, compilation, and editing spanning many centuries and involving contributions from multiple authors and editors. In later works, a related but distinct type of text reuse appears: intentional but unattributed allusion to the contents of earlier influential or well-known works. Identifying concrete instances of both types of reuse can assist in the interpretation of obscure or disputed passages, and for early texts in particular can also shed light upon difficult issues of authorship and textual history.
Digital methods not only offer the prospect of locating individual instances of both types of reuse automatically, but to the extent to which they can reliably do so also make possible the systematic study of text reuse across a corpus of works as a whole. I describe methods of identifying concrete instances of text reuse in the classical Chinese corpus, evaluate the degrees of accuracy achieved, and demonstrate how the data produced allow text reuse patterns to be explored at a corpus level.
DHAsia gratefully acknowledges support for Prof. Sturgeon's residency from the Center for Spatial and Textual Analysis (CESTA), the Center for Interdisciplinary Digital Research, the Confucius Institute, the Center for East Asian Studies, and other partners.
About the Speaker
Since 2005, he has managed the Chinese Text Project (, an online digital library of pre-modern Chinese which is now the largest such library in the world and attracts tens of thousands of visitors and large numbers of crowd-sourced contributions every day. His current projects include large-scale Optical Character Recognition (OCR) of historical Chinese documents, the application of machine learning to the dating of pre-modern Chinese texts, and development and evaluation of automated methods for analyzing pre-modern Chinese documents and their relationship to the wider corpus of pre-modern Chinese writing.
Free and Open to the Public
Event Sponsor: 
Stanford University Libraries, East Asia Library, Program in Modern Thought and Literature, Center for Spatial and Textual Analysis (CESTA), History Department, Center for East Asian Studies, Department of East Asian Languages and Cultures
Contact Email:
Phone Number: 
(650) 723-2651


April 26, 2018 - 4:00pm
Los Angeles, California
Please join the USC U.S.-China Institute and USC professor Brett Sheehan for a discussion on the evolution of Chinese capitalism chronicling the fortunes of the Song family of North China under five successive authoritarian governments.


June 5, 2018 - 7:00pm
Los Angeles, California

Please join the USC U.S.-China Institute, the East Asian Studies Center, and the USC School of Cinematic Arts for a screening of the 1993 Chinese film Woman Sesame Oil Maker (香魂女). It tells the story of a woman in a small village who buys a peasant wife for his mentally disabled son after her sesame oil business becomes unexpectedly successful. The screening will be followed by a Q&A with the director, Xie Fei (谢飞).