Skip to main content icon/video/no-internet

Many quantities of interest in sociology, psychology, public health, and medicine are unobservable but well-conceived characteristics such as attitudes, temperament, psychological diagnoses, and health behaviors. Such constructs can be measured indirectly by using multiple items as indicators of the unobservable characteristics. The latent class (LC) model can classify individuals into population subgroups based on these unobservable characteristics, by using their responses to questionnaire items that are related to those characteristics. Suppose, for example, we are interested in the construct of nicotine dependence and want to classify respondents into groups corresponding to different types of nicotine dependence we believe are indicated by a number of behaviors relating to cigarette use. We can collect data on several items pertaining to cigarette use and then apply an LC model to this data to identify two or more nicotine dependence types to which smokers might belong. We can use this LC model to classify not only the subjects in our study but also subjects in other studies. The LC model has been widely applied in behavioral and biomedical applications, particularly in the area of substance use prevention and treatment. In physical or psychiatric research, the LC model has been a popular strategy for development and evaluation of diagnostic criteria. The LC model can also be a useful way to address many problems of categorical data analysis in population-based epidemiologic studies.

The Mathematical Model

The LC model explains the relationship among manifest items by positing the assumption that the population comprises different classes related to the construct of interest. In other words, the population is assumed to consist of mutually exclusive and exhaustive groups, called latent classes, and the distribution of the items varies across classes. The LC model comprises two types of parameters: the relative size of each class and the probability of a particular response to each item within a class. To specify an LC model, let Y = (Y1, …, YM)) be M discrete items measuring latent classes, where variable Ym takes possible values from 1 to rm: Let C = 1, 2, …, L be the variable of LC membership, and let I(y = k) denote the indicator function that takes the value 1 if y = k and 0 otherwise.

If the class membership were observed, the joint probability that an individual belongs to Class 1 and provides responses y = (y1, …, yM) would be

None

where

γl = P(C = 1) represents the probability of belonging to Latent Class 1 and

None

represents the probability of response k to the mth item given a class membership in 1.

Therefore, the marginal probability of a particular response pattern y = (y1, …, yM) without regard for the unseen class membership is

None

Here, we have assumed local independence—that is, the items are assumed to be unrelated within each class. This assumption is the crucial feature of the LC model that allows us to draw inferences about the unseen class variable.

Model Identifiability

Model identification is imperative in estimating parameters of an LC model. The parameters of an LC model are said to be locally identifiable if the likelihood function is uniquely determined by the parameters within some neighborhood of a particular value of parameters. A necessary (but not sufficient) condition to make the LC model locally identifiable is that the number of possible response patterns of manifest items must be greater than the number of free parameters in the LC model. Even if this necessary condition is satisfied, it is impossible to say a priori whether or not this model is indeed identifiable. A necessary and sufficient condition for local identifiability is that the first derivative matrix of the log-likelihood function with respect to the parameters evaluated at a particular value must have full column rank. When an LC model is not identifiable, the simplest way to achieve identification is to reduce the number of parameters to be estimated by fixing or constraining parameters.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading