ADOLESCENT CIGARETTE USE:
EFFECTS OF RACE/ETHNICITY AND SCHOOL
RACIAL/ETHNIC COMPOSITION

Robert A. Johnson
Abt Associates Inc.

Abstract

Research has shown there exists substantial variability in substance use among U.S. schools but has been unable to separate variability due to differences among schools from variability due to differences among the populations that are served by schools. To disentangle the effects of explanatory variables operating at the levels of individuals, families, and schools, we applied multilevel models (mixed models or hierarchical models) to self-report data on daily cigarette use collected in the eighth and tenth grade panels of the National Educational Longitudinal Study (NELS). The response variable is cigarette initiation (first use), coded "1" if the adolescent initiated daily cigarette use (at least one cigarette per day) during the interval between the baseline NELS interview and the reinterview conducted two years later; and coded "0" if the adolescent did not smoke on a daily basis at either wave. The final model used 18 explanatory variables- including 7 individual, 4 family, and 7 school variables- and incorporated "cross-level interaction effects"- interactions between school and family/individual variables- and variance components gauging differences among schools in the effects of individual and family variables.

This paper focuses on a subset of the results, those pertaining to the effects of racial/ethnic minority status and the interactions between minority status and the racial/ethnic composition of students in the school. The findings support research showing that Asian/Pacific Islander, black, and Hispanic adolescents are substantially less likely than white non-Hispanic adolescents to report cigarette use. In each of the NELS eighth and tenth grade panels, API and black adolescents are less than half as likely- and Hispanics about three-fourths as likely- to initiate daily cigarette use as other adolescents. The findings go beyond previous research by showing that the deterrent effects of minority status on cigarette use are much larger among minority adolescents who attend predominantly minority schools. Especially in the NELS tenth grade panel, where the effect is to reduce cigarette risk by one-half, minority schools appear to reinforce the effect of an individual's minority status, so minority students are at even lower risk of cigarette use if they attend schools with a high percentage of minority students. The results also suggest that minority schools help to mitigate the adverse effects on cigarette risk of two social conditions- fewer than two parents living at home and school dropout- that are prevalent within the largest racial/ethnic minority populations of the U.S.

1. Introduction. Research on adolescent cigarette use has been concerned mainly with the effects of individual and family variables. Many studies document associations between cigarette use and family attachment, school involvement, peer smoking, and other individual and family variables (e.g., Akers and Lee 1996; Ennett and Bauman, 1993). Recent research suggests that cigarette use also varies by type of school (Ennett et al. 1997; Skager and Fisher 1989).

A drawback is the failure to link school with individual and family explanatory variables. To reduce omitted-variables bias, factors operating at the different levels need to be included in the same model. School variables may also condition the effects of variables operating at the individual and family levels. For example, one might expect racial/ethnic differences in cigarette use to be intensified in schools with a high percentage of minority students. If minority adolescents are less likely than others to use cigarettes (Fendrich and Vaughan, 1994(1); Johnson and Larison, 1998), then predominantly minority schools may reinforce this effect by offering a normative climate in opposition to cigarette use, a "cross-level interaction effect" (Bryk and Raudenbush, 1992).

The multilevel modeling approach applied in this paper (Bryk and Raudenbush 1992; Goldstein 1995; Kreft and De Leeuw 1999) has a number of advantages, including variance estimates that take into account the data hierarchy, such as clustering of sample students within schools. For our purposes- and those of public policy research generally- the most important advantage may be that multilevel models can yield consistent estimates of cross-level interaction effects (see the next section). We applied multilevel models to the National Educational Longitudinal Study (NELS) to explore how individual, family, and school characteristics affect adolescent cigarette use. This paper reports our findings about the effects of race/ethnicity and school racial/ethnic composition.(2)

Adolescent cigarette use is only one of many social outcomes that are relevant to policies affecting the racial/ethnic composition of schools. Attempts to manipulate school composition as a matter of law and policy have a contentious history in the U.S. (Bok, 1996). Supreme Court rulings in the 1960s favored racial integration of the public schools, and the percentage of black students enrolled in predominantly (more than half) minority schools declined from about 77% in 1968 to 64% in 1973. However, school segregation has remained roughly constant since the early 1970s (Orfeld, 1993). Based on NELS, the percentages of API, black, Hispanic, and other eighth graders who were in predominantly minority schools in 1988 were 34%, 64%, 65%, and 40%, respectively. The corresponding percentages of tenth graders in 1990 were 41%, 66%, 68%, and 37%.

The following sections discuss the advantages of multilevel models for policy research, the data and methods of our application, and the results.

2. Multilevel models in policy research. The standard single-level regression model has long been the model of choice in policy-oriented research. Let yij be a continuous response measured for the i-th student in the j-th school; xij an individual-level explanatory variable measured for the same student; and zj a school-level variable measured for the j-th school. The model can be written

______________ yij = a + b xij + c zj + d (xij zj ) + eij , _________________ (1)

where eij is student-level error with zero mean; and a, b, c, and d are regression coefficients. The error eij represents student-level variables that are not included in the model and that affect yij.

It is instructive to write the single-level model as a "pseudo two-level model" by defining a school-specific intercept aj equal to (a + c zj ):

______Level 1 (students):____ yij = aj + b xij + d (xij zj ) + eij

______Level 2 (schools):_____aj = a + c zj ___________________________(1')

But (1) is not a true multilevel model because there is no random error at Level 2. The single-level model allows unmeasured variables at the student level, but not at the school level.

The multilevel approach introduces the idea of separate regressions in each school or context:

______Level 1 (students):___yij = aj + bjxij + eij

______Level 2 (schools):____aj = a + c zj + u1j

________________________bj = b + d zj + u2j _____________________ (2)

The key property of the level-1 equation is that the regression intercept and slope of yij on xij each have a subscript "j," which implies that these parameters can vary across schools. The level-1 regressions are linked by a level-2 model, where the regression coefficients of the level-1 model are themselves regressed on the school explanatory variable zj . As in the single-level model, additional assumptions are needed to estimate the model, the main ones being that the level-1 error (eij) and level-2 errors (u1j and u2j) are uncorrelated with each other and with the explanatory variables.

For comparison with the single-level model, we can write the two-level model as a single equation by substituting the right-hand-sides of the level-2 equations for ajand bj in the level-1 equation:

_______________ yij = a + b xij + c zj + d (xij zj ) + (eij + u1j + xij u2j).______(2)

Comparing (1) with (2) shows the only difference is in the assumed error structure.

An important parameter for policy is d, the cross-level interaction. Individuals and families in the U.S. are afforded many legal protections, so schools are the principal lever of drug prevention policy. Cross-level interactions may be critical paths by which school policies can impact individual behavior. Yet, if d is estimated using (1) when the true error is that of (2), the estimate is inconsistent, because the error in (2) is correlated with (xij zj ). Thus, if there exist unmeasured school variables- as there almost surely are- good estimates of cross-level interaction effects might not be possible using a single-level model.

3. Data and methods

a. Sample and data collection design. The longitudinal design of NELS (NCES, 1992) allows us to gauge changes in cigarette use between measurement waves and to control for whether or not respondents used cigarettes at the prior wave. Most research on cigarette use in the U.S. has used cross-sectional rather than longitudinal data, perhaps because the National Household Survey on Drug Abuse (NHSDA) and Monitoring the Future (MTF)-two major surveys designed to measure substance use-are cross-sectional in design. Yet retrospective reporting can bias responses about past drug use obtained from cross-sectional surveys (Fendrich and Vaughn, 1994; Johnson et al. 1998).

We split the NELS longitudinal file into two panels to investigate cigarette use separately among eighth graders in 1988 and tenth graders in 1990. Eighth grade panel members were first interviewed as eighth graders in Fall 1988 (Wave 1) and reinterviewed two years later (Wave 2). Tenth grade panel members were first interviewed as tenth graders in Fall 1990 (Wave 1) and reinterviewed two years later (Wave 2).(3) Both panels followed up school drop-outs. About 6.8% of eighth grade panel members and 10.4% of tenth grade panel members dropped out before Wave 2. The adolescent interviews used traditional personal interviewing techniques. Personal interviews of parents and school administrators were also conducted at Wave 1 of each panel.

Both panels are based on a two-stage national probability sample of U.S. students: Stratified random sampling of schools was followed by random sampling of eligible students within schools. In our analysis, the eighth grade panel consists of 17,424 adolescents in 1,014 schools who responded to both interviews. The tenth grade panel consists of 16,542 adolescents in 1,464 schools who responded to both interviews. We used standard NELS weights (NCES, 1992) to adjust for unit nonresponse and unequal selection probabilities. We used techniques described in Pfeffermann et al. (1997)- as implemented in the program MLWIN (www.ioe.ac.uk/mlwin) to incorporate the NELS weights in the multilevel model estimation.

b. Measurement of daily cigarette use. Daily cigarette use at each wave of each panel was measured based on responses to the question "How many cigarettes do you usually smoke in a day?" We collapsed the response categories at each wave to form a binary variable: 1 = One or more cigarettes per day; 0 = Not a daily smoker. Item nonresponse was small, ranging from 2.3% at Wave 1 of the eighth grade panel to 7.1% at Wave 2 of the tenth grade panel. To impute the missing data, we used techniques for multilevel models described by Schafer (1996, 1997).(4)

c. Measurement of race/ethnicity and school racial/ethnic composition. The adolescent's race/ethnicity was measured at Wave 1 using response categories similar to the 1990 Census. This report distinguishes four categories- Asian and Pacific Islander (API), black, Hispanic, and other.(5) School racial/ethnic composition is based on the Wave-1 school administrator interviews. The administrators were asked about the percentage of students who were non-white or Hispanic. To reduce measurement error, we recoded the responses to form a binary variable, equal to 0 if the minority percentage was less than 50% and equal to 1 otherwise. Missing values were imputed based on the distributions of races and ethnicities reported by sample students in the school.

d. Measurement of other explanatory variables. There are seven additional explanatory variables at the individual and family levels: gender (based on Wave-1 adolescent interviews); family income (Wave 1 parental interviews); two biological parents at home (Wave-1 adolescents)(6);school dropout (Wave-2 follow up); negative peer relations scale (Wave-1 adolescents- number of affirmative answers to five questions about how school peers viewed the respondent, e.g., as a poor student); school participation scale (Wave-1 adolescents- number out of nine activities, e.g., athletics, music); and parental support scale (Wave-1 adolescents- number of affirmative answers to ten questions about the respondent's parent(s), e.g., whether a parent helped with home work).(7)There are six additional explanatory variables at the school level: region; type of place (central city v. other metro v. nonmetro); type of school (public v. Catholic v. other); school size; average beginning teacher salary; and student-teacher ratio. The first four are from the school sampling frame (NCES, 1992); the last two from administrator interviews.(8) Descriptive statistics for all variables- means, variances, and intercorrelations- are in Johnson and Hoffmann (1999). Prior to analysis, continuous variables were "centered" by subtracting their means (Bryk and Raudenbush, 1992).

e. Statistical models. This paper presents results based on two multilevel models- called Model 1 and Model 2. Model 1 is a 2-level "variance-components model" with a logit-linked binary response and one fixed covariate. We use Model 1 to underscore the importance of cigarette initiation as a response variable, so it is convenient to present both the model and the results in this section. Let yij denote a binary (0-1) response variable indicating daily cigarette use at Wave 2 and let pij denote the corresponding probability of using cigarettes daily at Wave 2. Let xij denote a binary (0-1)

response variable indicating daily cigarette use at Wave 1. The model is written

____________Level 1 (adolescents):____yij = pij + eij

_________________________________logit(pij) = aj + b xij

____________Level 2 (schools):_______aj = a + uj ,____________________(3)

where logit(pij) = log(pij /(1 - pij)); "log" denotes the natural logarithm; eij is a level-1 random error; and uj is a level-2 random error. We assume that yij is distributed as an extra-Bernoulli variable with mean pij, so eij has mean 0 and variance SIGMA e2 = k pij (1 - pij ).(9) We also assume that uj is normal with mean 0 and variance u2 and that the level-1 and level-2 errors are independent.(10)

Table 1 shows the Model 1 parameter estimates.(11) The slope parameter b gauges the dependence of daily cigarette use on daily cigarette use two years earlier. For the eighth grade panel, the estimated b of 2.77 corresponds to an odds-ratio of current relative to past smoking of about exp(2.77) = 16. That is, an eighth grade panel member is about 16 times more likely to be a daily smoker if he (she) was a daily smoker two years ago than if he was not. For the tenth grade panel, the corresponding estimate equals about exp(2.99) = 20. The increase in the odds-ratio may reflect that addiction becomes more severe the longer an individual uses cigarettes. If so, it makes sense to try to prevent adolescents from ever using cigarettes for the first time.(12) Past cigarette use is such a strong predictor of current use that it might be misleading to include past smokers and nonsmokers in the same model. We examined separate models for cigarette initiation and cessation and found that most explanatory variables interact with past use. NELS data on initiation are more plentiful than data on cessation, so this paper focuses on initiation.

Model 2 uses daily cigarette initiation between Waves 1 and 2 -rather than cigarette use at Wave 2- as the response variable. That is, yij equals 1 if the adolescent began daily cigarette use between Waves 1 and 2; and yij equals 0 if the adolescent was a daily nonsmoker at both waves. The analysis is restricted to daily nonsmokers at Wave 1, which reduces the sample size from 17,424 to 16,454 in the eighth grade panel and from 16,542 to 13,840 in the tenth grade panel. Model 2 also extends Model 1 by adding individual and family explanatory variables at level 1 and school explanatory variables at level 2. We assume P level-1 explanatory variables, denoted xpij, p = 1, ..., P; and Q level-2 explanatory variables, denoted wqj, q = 1...,Q. Model 2 is written:

______Level 1 (adolescents):____yij ___=_ ij + eij

___________________________logit(pij) = aj + SUMp bpj xpij

______Level 2 (schools):____aj = a + SUMq c0q wqj + u0j

________________________b1j = b1 + SUMq c1q wqj + u1j

_____________________________ . . .

________________________bPj = bP + SUMq cPq wqj + uPj , __________(4)

where the summations extend from p = 1 to p = P at Level 1 and from q = 1 to q = Q at Level 2.

The first level of (4) is similar to (3), except that pij - the probability of initiation- depends upon a school-specific intercept- aj - and upon school-specific slopes- b1j through bPj. In the (P + 1) level-2 equations, the level-1 regression intercept and slopes are themselves treated as response variables. Each is regressed on Q school-level explanatory variables. For example, in the equation for b1j, b1 is the average across schools of the slope of logit(pij) on x1ij; c11 is the effect on b1j of a unit increase in w1j; and u1j is the level-2 random error associated with b1j.(13)

Substituting the right-hand-side of each level-2 equation of (4) into Level 1 expresses logit(pij) as a function of the xpij's, the wqj's, and their products- the xpijwqj's:

logit(pij)= aj + SUMp bpj xpij + SUMq c0qwqj

____________+ SUMp SUMq c0q (xpij wqj) + u0j + SUMp xpijupj.__________(4)

The coefficients of the xpijwqj's - the cpq's - gauge the cross-level interaction effects, showing how school variables amplify or dampen the effects of individual and family variables.

The next section presents a simplified version of Model 2 in which interactions and school-level variances and covariances that were not statistically significant in either NELS panel have been omitted from the model. In analyzing each panel, we tested each fixed and random parameter in the full model using Wald tests and found that only nine cross-level interactions and five school-level variance components were significant in one or both panels.

4. Results. Table 2 presents parameter estimates for Model 2. Results are presented both on the logit scale and on the scale of odds-ratios. For example, on the logit scale, the effect of dropout status on cigarette initiation equals 1.13 in the eighth grade panel. This implies that, after controlling for other explanatory variables, dropouts in the eighth grade panel were about exp(1.13) = 3.1 times as likely as nondropouts to initiate daily use between Wave 1 and Wave 2.

Table 2 supports previous research showing that racial/ethnic minority status, two-parent families, parental support, school participation, urban residence, residence in the West, and predominantly minority schools are associated with reduced cigarette risk Negative peer associations, school dropout, Catholic schools, and higher student/teacher ratios are associated with increased risk. The effects of family income, school size, and beginning teacher salary are not significantly different from zero in either panel. Most significant associations appear fairly stable across panels, with gender being the most notable exception.(14)

Yet the estimated main effects can be misleading unless cross-level interactions and school-level variance components are taken into account. The cross-level interactions (fourth set of estimates in Table 2) show that four school variables- school minority percentage, residence in the West, student/teacher ratio, and urban residence- condition the effects of one or more individual and family explanatory variables in one or both panels. The most important effects of school variables appear not as main effects but as cross-level interactions. The school variance components (fifth set) show there is significant unexplained school-level variation in the overall level of cigarette initiation and in the effects on initiation of parental support, school participation, gender, and minority status.

Tables 3 and 4 re-express Table 2's results on the effects of school racial/ethnic composition in the forms of (a) percentages initiating cigarette use and (b) odds-ratios comparing minority with other adolescents. Table 3 shows that, in predominantly minority schools, the deterrent effect of racial/ethnic minority status on cigarette initiation becomes stronger. This is especially true in the tenth grade panel, where the percentage of black adolescents initiating daily cigarette use increases from 1.9% among those attending minority schools to 4.7% among those attending non-minority schools. The advantage of odds ratios over percentages is that they do not depend on the values of other explanatory variables (see Table 3 footnote). In the tenth grade panel, the overall odds on initiation of a black adolescent equals 0.31 times the odds of an other (non-API, non-black, and non-Hispanic) adolescent, but this odds ratio declines to 0.19 among those attending minority schools.

Table 4 presents a more detailed portrait of school composition effects, taking into account differences by region, dropout status, family type, and gender. The statistics presented are the percentages of minority adolescents initiating cigarette use and the odds ratios on initiation of minority (API, black, or Hispanic) relative to other adolescents. Both statistics are based upon Model 2's estimates of the main effects of six variables- minority status, school minority composition, region, dropout status, family type, and gender- and the interaction effects involving one or both of minority status and school minority composition. Since the school composition variable does not distinguish among predominantly API, black, and Hispanic schools, the percentages and odds ratios in Table 4 are weighted averages across API, black, and Hispanic adolescents. The positive Minority-by-West interaction (see Table 2) does indicate higher cigarette risk among minority adolescents in the West, where Hispanics are most heavily concentrated.

Comparisons of the percentages in Table 4 suggest that minority schools reduce the adverse effects of two social conditions- fewer than two parents at home and school dropout. For example, among minority female eighth graders who reside in non-Western regions, the effect of having fewer than two parents at home is to increase the percentage initiating cigarette use by 2.2 percentage points- from 6.2% to 8.4%- if the adolescent attends a non-minority school, but the increase is only 0.4 percentage points- from 4.8% to 5.2%- if she attends a minority school. In the same subpopulation, the effect of dropping out is to increase the percentage by 23.8 percentage points- from 8.4% to 32.2%- if the minority female attends a non-minority school but the increase is only 8.1 percentage points- from 5.2% to 13.3%- if she attends a minority school. The odds ratios suggest male dropouts are an exception to the generalization that minority adolescents are less likely to initiate cigarette use. Yet, regardless of gender, the adverse effect of school dropout is greatly reduced among those who attended minority schools.

In conclusion, the results suggest that schools with supportive environments for minority students can override some adverse influences arising from the adolescents' homes and neighborhoods or from the larger society. Yet two limitations of our measures and models point to the need for further research. First, our measure of negative peer relations pertains strictly to the respondent's perceptions of how peers in the same school regard him or her. Second, NELS provided no measure of parental cigarette use. Incorporating measures of peer cigarette use, relations with peers who do not attend the same school, and parental cigarette use might improve the model.

5. References

Akers, R and G Lee 1996 "A longitudinal test of social learning theory: adolescent smoking." Journal of Drug Issues 26:317-343.

Bok, D, 1996. The State of the Nation. Cambridge: Harvard U.

Bryk, A. and S. Raudenbush. (1992). Hierarchical Linear Models. Newbury Park, CA: Sage.

Ennett, S. and K. Bauman. 1993. "Peer group structure and adolescent cigarette smoking: a social network analysis." Journal of Health and Social Behavior 34: 226-236.

Ennett S, R Flewelling, R Lindrooth, E Norton 1997. "School characteristics associated with school rates of alcohol, cigarette, and marijuana use." J. Health and Social Behavior 38: 55-71.

Fendrich, M. and C. Vaughn, 1994. "Diminished lifetime substance use over time: an inquiry into differential underreporting." Public Opinion Quarterly 58: 96-123.

Goldstein, H. 1995. Multilevel Statistical Models. London: Edward Arnold..

Johnson, R. and D. Gerstein. 1998. "Initiation of use of alcohol, cigarettes, marijuana, cocaine and other substances in US birth cohorts since 1919." Am. J. Public Health 88:27-33.

Johnson, R., D. Gerstein, and K. Rasinski. 1998. "Adjusting survey estimates for response bias" Public Opinion Quarterly 62:354-377.

Johnson, R. and C. Larison (1998). Prevalence of Substance Use among Racial and Ethnic Subgroups in the United States, 1991-93. Rockville, MD: SAMHSA/OAS.

Johnson, R. and J. Hoffmann, 1999. "School, family, and individual risk factors in adolescent cigarette use: a multilevel analysis." Under review by Journal of Health and Social Behavior.

Kreft, I. and J. De Leeuw, 1999. Introducing Multilevel Models. London: Sage

National Center for Educational Statistics. 1992. NELS 1988 User's Manual: First Followup Student Component, NCES 1992-030. Washington, DC: U.S. Department of Education.

Orfeld, G., 1993. The Growth of Segregation in American Schools: Changing Patterns of Separation and Poverty since 1968. Cambridge: Harvard U.

Pfeffermann, D., C. Skinner, D. Holmes, H. Goldstein, and J. Rasbash, 1997. "Weighting for unequal selection probabilities in multilevel models." J. Royal Stat. Society, B, 23-40.

Schafer, J. 1996. Analysis of Incomplete Multivariate Data. London: Chapman and Hall.

Schafer, J. 1997. "Imputation of missing covariates under a multivariate linear mixed model." Department of Statistics. University Park, PA: Pennsylvania State University.

Skager, R. and D. Fisher. 1989. "Substance use among high school students in relation to school characteristics." Addictive Behaviors 14: 129-138.

Table 1. Model 1 estimates. Daily cigarette use at Wave 2. a = average school intercept.b = slope of Wave 1 cigarette use. su2 = school-level variance. Standard errors in parentheses. NELS 1988.
Eighth grade panel Tenth grade panel
a b s u2 a b su2
-1.77 (.03) 2.77 (.08) 0.09 (.02) -1.95 (.03) 2.99 (.06) 0.03 (.01)
Table 2. Model 2 estimates. Initiation of daily cigarette use between Waves 1 and 2. NELS 1988. Odds ratios (OR) gauging change between panels.
Parameter Eighth grade Tenth grade Change between panels

1. Individual and family variables

Intercept -1.35_(.06) -1.85_(.07) Base %:_ 21% to 14%*
Race/ethnicity: API -1.00_(.15) -0.67_(.15) OR:_0.36 to 0.51
_____________ Black -1.32_(.13) -0.98_(.14) OR:_0.27 to 0.38
_____________Hispanic -0.53_(.12) -0.10_(.13) OR:_0.59 to 0.90
Male -0.23_(.06) 0.16_(.06) OR:_0.79 to 1.17*
Negative peers (max v. average) 0.92_(.08) 0.42_(.11) OR:_2.51 to 1.52*
Participation (add 2 activities) -0.07_(.04) -0.21_(.05) OR:_0.93 to 0.81*
Dropout before Wave 2 1.13_(.12) 0.98_(.12) OR:_3.10 to 2.66
2 biological parents at home -0.31_(.05) -0.24_(.06) OR:_0.73 to 0.79
Parental support (max v. average) -0.14_(.04) -0.10_(.04) OR:_0.87 to 0.90
Family income- lowest quartile 0.07_(.06) -0.12_(.07) OR:_1.07 to 0.89*
____________- highest quartile -0.03_(.06) 0.08_(.07) OR:_0.97 to 1.08
2. Interactions between individual and family variables
Parental support by Participation -0.11_(.04) -0.07_(.05) OR:_0.90 to 0.93
Male by Minority by Dropout 1.08_(.24) 0.80_(.22) OR:_2.94 to 2.22
3. School variables
West region -0.44_(.09) -0.17_(.10) OR:_0.64 to 0.84
Urban place -0.08_(.07) -0.18_(.09) OR:_0.92 to 0.84
Catholic school 0.29_(.13) 0.24_(.15) OR:_1.34 to 1.27
School % minority > 50% -0.30_(.17) -0.23_(.17) OR:_0.74 to 0.79
Small size: < 600 students -0.11_(.07) -0.04_(.11) OR:_0.90 to 0.96
Beginning teach salary: $1000s -0.01_(.01) 0.01_(.01) OR:_0.99 to 1.01
Stud-teach ratio (increase by 10) 0.08_(.04) 0.04_(.06) OR:_0.92 to 0.96
4. Cross level interactions
Minority X School % minority -0.21_(.17) -0.69_(.17) OR:_0.81 to 0.50*
Male X School % minority 0.26_(.14) 0.32_(.15) OR:_1.30 to 1.38
2 biol. pars X School % minority 0.23_(.13) 0.34_(.15) OR:_1.26 to 1.40
Dropout X School % minority -0.62_(.23) -0.29_(.21) OR:_0.54 to 0.75
Minority X West region 0.48_(.16) 0.14_(.17) OR:_1.62 to 1.15
Parental support X Stud/teach -0.04_(.05) -0.13_(.06) OR:_0.96 to 0.88
Participation X Stud/teach 0.08_(.04) 0.07_(.07) OR:_1.08 to 1.07
Parental support X Urban place 0.17_(.08) 0.11_(.08) OR:_1.19 to 1.12
Dropout X Urban place 0.52_(.22) 0.24_(.21) OR:_1.68 to 1.27
5. Random parameters- Variability among schools
Var(Intercept)______ 0.23_(.04) 0.08_(.02) Difference = -0.15 (.04)*
Var(Parental support) 0.13_(.03) 0.12_(.02) Difference = -0.01 (.04)
Var(School participation) 0.15_(.03) 0.13_(.03) Difference = -0.02 (.04)
Var(Male) 0.34_(.07) 0.19_(.05) Difference = -0.15 (.09)
Var(Minority) 0.45_(.11) 0.13_(.06) Difference = -0.32 (.12)*
Corr(Intercept, Male) -0.43_(.15) -0.38_(.24) Difference = 0.05 (.28)
*Significant based on two-sample two-tail t-test assuming independent samples, alpha = .05

Table 3. Model 2 estimates. Percentages of API, black, Hispanic, and other adolescents initiating daily cigarette use between Waves 1 and 2. Odds ratios. By school racial/ethnic composition.* NELS 1988.

School racial/ethnic

composition

Race/ethnicity
API Black Hispanic Other
%___OR %__OR %___OR %___OR

Eighth grade panel

Total schools 7.6%__0.37 5.4%__0.26 11.1%___0.56 18.0%__1.0
Minority % > 50 5.0%__0.30 3.7%__0.22 7.8%___0.48 14.9%__1.0
Minority % 50 or less 8.0%__0.37 6.0%__0.27 12.2%___0.59 19.0%__1.0
Tenth grade panel
Total schools 5.0%__0.43 3.7%__0.31 8.4%___0.76 10.8%__1.0
Minority % > 50 2.6%__0.26 1.9%__0.19 4.5%___0.46 9.3%__1.0
Minority % 50 or less 6.3%__0.51 4.7%___0.38 10.4%___0.90 11.4%__1.0
*Since continuous covariates have been centered at their means, percentages pertain to an average non-dropout female with fewer than two parents who attended a non-Catholic school in an urban area outside the West.

Table 4. Model 2 Estimates. Percentages of minority adolescents initiating daily cigarette use and odds ratios of minority vs. other. By school minority composition, gender, region, dropout, and family type. *

Region
Dropout

status

Family

type

School % minority 50% School % minority > 50%
Female Male Female Male
%___OR %___OR %___OR %___OR
Eighth grade panel
NE, NC, S Non-drop < 2 parents 8.4%__0.38 6.7%__0.38 5.2%_0.31 5.3%_0.31
"" "" 2 parents 6.2%__0.38 5.0%__0.38 4.8%_0.31 5.0%_0.31
"" Dropout < 2 parents 32.2%_0.4 52.6%_1.1 13.3%_0.3 31.8%_0.91
"" "" 2 parents 25.9%_ 0.4 44.9%_1.1 12.4%_0.3 30.0%_0.91
West Non-drop < 2 parents 8.7%__0.61 7.0%__0.61 5.4%_0.50 5.6%_0.50
"" "" 2 parents 6.5%__0.61 5.2%__0.61 5.0%_0.50 5.2%_0.50
"" Dropout < 2 parents 33.2%_0.6 53.6%_1.8 13.8%_0.5 32.7%_1.5
"" "" 2 parents 26.7%_0.6 45.9%_1.8 12.8%_0.5 30.9%_1.5
Tenth grade panel
NE, NC, S Non-drop < 2 parents 6.5%__0.52 7.5%__0.52 2.7%_0.26 4.3%_0.26
"" "" 2 parents 5.1%__0.52 6.0%__0.52 2.9%_0.26 4.7%_0.26
"" Dropout < 2 parents 18.9%_0.5 37.7%_1.2 6.5%_0.3 20.1%_0.6
"" "" 2 parents 15.4%_0.5 32.2%_1.2 7.1%_ .3 21.6%_0.6
West Non-drop < 2 parents 6.2%__0.60 7.2%__0.60 2.6%_0.30 4.1%_0.30
"" "" 2 parents 4.9%__0.60 5.8%__0.60 2.8%_0.30 4.5%_0.30
"" Dropout < 2 parents 18.3%_0.6 36.8%_1.3 6.3%_0.30 19.5%_0.7
"" "" 2 parents 15.0%_0.6 31.3%_1.3 6.9%_0.30 21.0%_0.7
*Percentages pertain to an average adolescent who attended a non-Catholic school in an urban area.

1. Fendrich and Vaughan (1994) concluded- based on the National Longitudinal Study of Youth- that lower substance use among blacks may be due partly to differential underreporting.

2. A full discussion of results is in Johnson and Hoffmann (1999).

3. The samples of schools in the eighth and tenth grade panels are disjoint, because middle and high schools do not overlap, but the samples of students are overlapping. About 90 percent of responding students in each panel were also responding students in the other panel. Given the positive correlation of cigarette use at different stages of adolescence, the sample overlap results in increased precision for comparing panels. The significance test results presented in this paper are conservative in that we treat the two panels as independent samples. The significance tests are also conservative-relative to single-level analyses-because the standard errors of test statistics correctly reflect the clustering of students within schools in the sample design.

4. For each panel, we first generated predicted values using a bivariate normal multilevel model with two response variables-daily cigarette use at waves 1 and 2-and twelve explanatory variables, including family structure, dropout status, parental support, school participation, negative peer associations, race/ethnicity, region, type of school, percent of minority students, student-teacher ratio, size of school, and teacher salary level. The effects of six individual/family variables were treated as random at the school level. The continuous imputed values were rounded to 0 or 1. We also generated three sets of imputations for each panel, and, using multiple imputation techniques (Schafer, 1997), found that the additional uncertainty contributed by the imputation amounted to less than 10% of each standard error reported in Table 1.

5. The "other" category is more than 98% white non-Hispanic in both panels, with the rest Native American. Based on NHSDA, Native Americans- like whites- are high in cigarette use (Johnson and Larison, 1998).

6. Living arrangements were measured at Wave 1 based on adolescents' responses to the question "Which of the following people live in the same household with you?" Preliminary analyses showed that detailed family types, such as mother only and mother-stepfather, did not differ significantly in their effects on adolescent cigarette use and the presence or absence of siblings was not a significant predictor, so our analysis distinguishes only those who lived with two biological parents (coded 1) from all other arrangements (coded 0).. Missing values were few- 1.5% of eighth graders and 0.7% of tenth graders- and these were imputed as 0's.

7. Each of the three scales has high internal reliability, with Cronbach's alpha greater than 0.65 in each panel. Except for family income (10% missing in the eighth grade panel, 17% in the tenth grade panel), missing data rates of individual-level explanatory variables were less than 3% in each panel. We imputed missing values of scale items using the mode of respondents with nonmissing values. We imputed missing values of family income using predicted values from a linear mixed model (Schafer, 1997), treating the log of income as normally distributed and allowing the intercept to vary among schools. Details are in Johnson and Hoffmann (1999).

8. Region, type of place, type of school, and school size had no missing data in either panel, but missing data rates of salary and student-teacher ratio were appreciable in the tenth grade panel: 27% for salary and 7% for student-teacher ratio. The missing values on these two variables were imputed using mean imputation within imputation cells defined by region, type of place, and type of school.

9. In both panels, k was estimated to be very close to 1.0.

10. Inspection of residual plots suggested that the assumptions of normality and constant variance are reasonable for the level-2 errors in both models presented in this paper.

11. Multilevel parameter estimates reported in this paper are second-order penalized quasi-likelihood estimates ("PQL2"), as discussed in Goldstein (1995) and implemented in MLWIN. The estimates were corroborated using two alternative methods- bootstrap and Markov Chain Monte Carlo- also in MLWIN.

12. Another finding from Table 1 is that the school variance in daily cigarette use- after controlling for past use- is much larger among eighth graders than among tenth graders- 0.09 vs. 0.03. This suggests that opportunities for school interventions to prevent cigarette use are greater in middle schools than in high schools.

13. Model 2 also assumes 1) the level-1 random error eij is independent of each of the level-2 random errors- u0j through uPj; 2) level-1 and level-2 random errors are independent of all measured explanatory variables; and 3) the vector of level-2 random errors upj , p = 0, 1, ..., P- is multinormal with zero means; variances SIGMAup2, p = 0, 1,..., P; and covariances SIGMAupp, where p and p range from 0 to P and p does not equal p.

14. The effect of male gender shifts from negative in the eighth grade panel- -0.23- to positive in the tenth grade panel- 0.16. This finding is consistent with NHSDA data suggesting that, beginning in the 1980s, females tended to initiate cigarette use at earlier ages than males (Johnson and Gerstein, 1998).