Skip to main content icon/video/no-internet

Aggregate data are quantified attributes of collectivities that either relate to the body of interest as a whole (e.g., the level of democracy for countries) or have been aggregated on the basis of the properties of individual members of the collective. Aggregate data could also be negative, that is, not individual data, meaning that aggregate data refer to bigger entities (e.g., nations, regions, or companies) than individual data do (e.g., voters, workers, or inhabitants of some geographic area). The process of aggregation can be performed by calculating sums or various means (e.g., geometric or arithmetic mean) of the frequency distribution of individual cases. Aggregate data are predominantly secondary data; that is, researchers do not usually collect such data on their own. Another distinction refers to macro- versus microdata, with aggregate data always referring to quantities on a higher meso- or macrolevel. In this entry, types and sources of such data, major problems of reliability and comparability, and the possible applications of aggregate data are discussed.

Kinds of Aggregate Data

There are three types of aggregate data:

  • data relating to a collective as a whole (e.g., population or democracy indices, number of veto players or other characteristics of political systems);
  • aggregated individual data (e.g., the Gini Index for measuring income disparities, or the unemployment rate), which can be subdivided into two types:
    • census data, where every single case of the whole population is measured (complete inventory count)
    • data from samples, which should fit the context of the research question and be as representative as possible; and
  • event data, which give a frequency of events within a given period, mostly gathered from media sources (event data have become important sources for analysis, especially in international conflict research).

Basic Problems with Aggregate Data

There are some basic problems in comparative studies using aggregate data that have to be taken seriously. The first problem is whole-nation bias, which results from ignoring subnational variation; loss of information and reduced complexity are the consequences. Second, aggregate data are often measured in monetary units (e.g., for expenditure, import or trade volumes). For reasons of comparability between countries, the different currencies are most often translated into dollars or euros, which poses the problem of finding an adequate exchange rate for standardizing to the dollar or euro. One such standardization uses purchasing power parities, which may be misleading. Another alternative for comparing aggregated data is to standardize using a percentage of a national product measure. The choice and correct measurement of such indicators (e.g., gross domestic product [GDP] or gross national product [GNP]) are difficult and problematic; official data do not account for huge differences such as those concerning black markets or forms of subsistence income.

Intercountry comparability is also undermined by the use of different bases of calculation and a dissimilar inclusiveness of the indicators. An example makes this clear: The military budget is not identical with the budget of the Ministry of Defense. There may be a number of military expenditures (e.g., for the suppression of terrorism) that fall under the authority of other departments, such as the Ministry of the Interior. Another complication is the fact that the allocation of funds for such expenditure may vary from one country to another. The varying levels of quality in collecting data among countries can serve as a source of heteroskedasticity (i.e., a nonconstant variance where one expects a constant variance in the data). Therefore, different methods of data collection (e.g., estimation instead of more exact measurement) lead to less reliable data and, in the end, to poorer estimations in regression analysis.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading