Skip to main content icon/video/no-internet

Textual data refer to systematically collected material consisting of written, printed, or electronically published words, typically either purposefully written or transcribed from speech. Text collected for use as data typically reflects a conscious research purpose, motivated by a design aimed at yielding insight on some feature of the social or political world. This entry outlines the purpose, issues, and challenges involved in selecting, preparing, and analyzing textual data.

The process of textual data analysis follows several main steps. First, a text and its author are identified that will directly inform the research question at hand, for instance, legislative speeches if one is interested in lawmakers' policy agenda. Second, this text must be processed, typically involving numerous decisions about how to convert, store, edit, and combine texts. Next, the processed text is analyzed, a step that may take any of a wide variety of forms. Humans may read the text to classify it, possibly after dividing texts into smaller units; computerized tools may analyze the text to count frequencies of preidentified words or phrases; statistical methods may be applied to specific words, word patterns, or sentences to construct scales, clusters, or to identify other patterns. Finally, the results of the analysis are summarized, interpreted, and reported to inform the research question that motivated the analysis of the textual data.

The steps in this process are not dissimilar to those found in any empirical research design, but the systematic use of textual data may present special challenges in the stages related to identifying the data to be collected and preparing the data for analysis. In the modern era of near-ubiquitous electronic textual content, this problem stems not from a lack of availability of textual data or tools for working with them but, rather, from the opposite: The staggering quantity and variety of textual data create real challenges in selecting which texts to analyze. Furthermore, the technical challenges of working with different formats for recording electronic texts, as well as converting these into formats or “data sets” that suit the purpose of analysis, are often far from trivial. Despite such challenges, however, textual data remain one of the most promising and one of the least explored sources of systematic information about the political and social world.

The Appeal of Textual Data

Textual data in the political and social sciences have several principal advantages. The first can be found in the nature of text and the information it may contain about the authors or speakers generating that text. If the task at hand is to gauge overt messages or signals conveyed through text, of course, then no better source of information exists than the texts themselves. For instance, in 1951, in a well-known study of articles by other Politburo members about Stalin on the occasion of his 70th birthday, Nathan Leites, Else Bernaut, and Raymond Garthoff were able to discern differences in groups with regard to communist ideology. In this instance, the messages signaled not only an underlying orientation but also a degree of political maneuvering with regard to a leadership struggle following the foreseeable event of Stalin's death. The messages are themselves significant, and these could only be gleaned from the public articles authored by each Politburo member, written in the full knowledge that they would be reprinted in the party and general Soviet press. Similar content is present in the overt messages from advertising—although such analyses frequently focus on nontextual content as well—as well as in political speeches and political advertising. The principal advantage of textual data is that what political actors write or say may be a significant political signal or act, a form of purposive political act best judged using the textual data as the best manifestation of the act itself.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading