Developing Early Literacy Skills: A Meta-Analysis of Alphabet Learning and Instruction

Correspondence concerning the manuscript ought to be sent to Shayne B. Piasta, The Ohio State University, Preschool Language and Literacy Lab, 535B Arps Hall, 1945 North High Street, Columbus, OH 43210-1172, (ude.uso@1.atsaip)

Shayne B. Piasta is a visiting assistant professor in the School of Teaching and Learning and a research scientist in the Preschool Language and Literacy Lab, both at The Ohio State University, Columbus, USA; ude.uso@1.atsaip

Richard K. Wagner is the Associate Director of the Florida Center for Reading Research and Robert O. Lawton Distinguished Professor of Psychology at Florida State University, Tallahassee, USA; ude.usf.ysp@rengawkr

Abstract

Alphabet knowledge is a hallmark of early literacy and facilitating its development has become a primary objective of pre-school instruction and intervention. However, little agreement exists about how to promote the development of alphabet knowledge effectively. A meta-analysis of the effects of instruction on alphabet outcomes demonstrated that instructional impacts differed by type of alphabet outcome examined and content of instruction provided. School-based instruction yielded larger effects than home-based instruction; small-group instruction yielded larger effects than individual tutoring programs. We found minimal evidence of transfer of alphabet instruction to early phonological, reading, or spelling skills. Implications for research and practice are discussed.

In the field of early literacy, alphabet knowledge refers to children’s familiarity with letter forms, names, and corresponding sounds, as measured by recognition, production, and writing tasks. Together, such alphabet knowledge represents an important component of emergent literacy (Whitehurst & Lonigan, 1998). Children’s knowledge of letter names and sounds is the best predictor of their later reading and spelling abilities (Hammill, 2004; Scarborough, 1998; Schatschneider, Fletcher, Francis, Carlson, & Foorman, 2004). Letter name and letter sound knowledge predict subsequent literacy skills independently of other important predictors including phonological awareness and oral language (Burgess & Lonigan, 1998; McBride-Chang, 1999; Wagner, Torgesen, & Rashotte, 1994). Preschool and kindergarten students with poor knowledge of letter names and sounds are more likely to struggle with learning to read and be classified as having reading disabilities (Gallagher, Frith, & Snowling, 2000; O’Connor & Jenkins, 1999; Torppa, Poikkeus, Laakso, Eklund, & Lyytinen, 2006). These children tend to fall further behind their peers in reading acquisition, leading to gaps in spelling, reading fluency, vocabulary, and comprehension skills (Stanovich, 1986; Torgesen, 2002).

For these reasons, alphabet knowledge has become an important learning goal for young children. The National Association for the Education of Young Children (NAEYC) recognizes development of the alphabetic principle as a goal for the preschool years and proficiency in letter discrimination, letter naming, and letter–sound correspondences are expected during the kindergarten year (NAEYC, 1998). Standards for Head Start (U.S. Department of Health and Human Services & Administration for Children and Families, 2003) require children to demonstrate awareness of letters as individual symbols and the capability to identify at least 10 letters by name. Both the Early Reading First and Reading First programs set explicit goals for increasing letter knowledge and letter–sound correspondences (U.S. Department of Education, 2002; 2003). Finally, many state curriculum frameworks, as required by the No Child Left Behind Act of 2002, include letter name learning, basic sound knowledge, and letter writing in their literacy standards for prekindergarten and kindergarten students (e.g., Florida Department of Education, n.d.; Massachusetts Department of Education, 2001).

Despite these initiatives, relatively little is known about the impact of early instruction on the development of alphabet knowledge. This fact is especially troublesome as large numbers of children continue to enter kindergarten knowing less than half of the letter names and fewer letter sounds (U.S. Department of Health and Human Services & Administration for Children and Families, 2005, 2006). Approaches to alphabet teaching vary greatly, as there has been disagreement about the appropriateness of early literacy instruction and about what constitutes effective instruction (Bredekamp, 1987; Dickinson, 2002; Elkind & Whitehurst, 2001; Foulin, 2005; Justice, Pence, Bowles, & Wiggins, 2006). The present study synthesized the research literature on the effects of alphabet instruction on both alphabet knowledge and other emergent literacy outcomes. We examined potential moderators of these effects, including factors related to sample and instructional characteristics. We also examined whether effect sizes were affected by publication bias or methodological issues related to study quality.

NELP and the Impact of Alphabet Knowledge Instruction

Despite its importance, alphabet knowledge often is overlooked as a specific outcome of interest compared with other aspects of literacy instruction such as phonological awareness or reading per se (see, e.g., Bus & van IJzendoorn, 1999; National Institute of Child Health and Human Development [NICHD], 2000; Wagner, 1988). An exception is the recent synthesis conducted by the National Early Literacy Panel (NELP), charged with identifying “interventions, parenting activities, and instructional practices that promote the development of…early literacy skills” for children from birth to age 5 (NELP, 2008, p. vi). Alphabet knowledge comprised one of the emergent literacy skill outcomes investigated by the Panel. In synthesizing the results of 24 code-focused early childhood intervention studies meeting inclusion criteria, the NELP study found an overall average effect size of 0.38 (95% confidence interval = 0.18 to 0.58) on alphabet knowledge outcomes. Effects on alphabet learning were found to vary based on demographic characteristics (child age, ethnicity, extent of literacy knowledge, population density of the location in which the study was conducted) as well as whether letter instruction was combined with training in phonological awareness. Other types of interventions investigated by the Panel (e.g., shared reading, general preschool/kindergarten programs) showed no reliable impact on children’s alphabet knowledge.

The NELP (2008) study, however, was limited in four important ways. 1 First, the criteria used to select studies for inclusion in the meta-analysis narrowed its scope considerably. The review included only studies published in peer-reviewed journals, a decision that promotes publication bias, widely recognized as a threat to the validity of meta-analyses (Cooper & Hedges, 1994). Publication bias refers to the problem of selective publication, in which the decision to publish a study is influenced by its results (e.g., the statistical significance of findings; Rosenthal, 1979). When it occurs, the effect sizes of meta-analyses that include only published studies are positively biased (i.e., larger than the true effect size). The review also excluded studies involving participants older than kindergarten age, despite the fact that alphabet instruction, particularly letter sound instruction, may continue into first and later grades, particularly for at-risk students.

Second, as alphabet knowledge was one of many outcomes examined in the NELP study, the impact of instruction on this outcome was not examined in great detail. The NELP study did not distinguish between various alphabet outcomes such as letter name knowledge, letter sound knowledge, or letter writing, instead collapsing all alphabet outcomes into a single construct and prohibiting disambiguation of differential effects across alphabetic domains.

The overall effect size reported in the NELP report also amalgamated across all types of code-focused early childhood interventions (e.g., interventions that did and did not provide alphabet instruction) and did not thoroughly investigate the impact of specific instructional components on alphabet knowledge outcomes despite evidence that the focus of instruction may be particularly noteworthy. The teaching of specific alphabet skills, for example, may be important in facilitating alphabet knowledge growth. A growing body of research demonstrates reciprocal relations between letter name and sound knowledge and suggests that instruction in letter names may facilitate letter sound learning for those letters whose names also include their sounds (e.g., the /b/ at the beginning of the letter name B or the /f/ at the end of the letter name F), with children using the information contained in the letter names to derive or cue corresponding sounds (e.g., Evans, Bell, Shaw, Moretti, & Page, 2006; Levin, Shatil-Carmon, & Asif-Rave, 2006; McBride-Chang, 1999; Share, 2004; Treiman, Tincoff, Rodriguez, Mouzaki, & Francis, 1998; Treiman, Weatherston, & Berch, 1994). Considerable evidence also suggests reciprocal relations between phonological skills and alphabet knowledge development (Burgess & Lonigan, 1998; Lonigan, Burgess, & Anthony, 2000; McBride-Chang, 1999; Wagner et al., 1994), including potential benefits of phonological training on alphabet skills (Ball & Blachman, 1991).

Although specific alphabet and phonological awareness instructional foci may thus have an impact on alphabet learning, consideration of phonological training represented the sole instructional factor investigated by the NELP study. Specifically, the NELP study disaggregated studies according to whether interventions included phonological awareness training only (effect size [ES] = 0.04), phonological awareness training plus alphabet training (ES = 0.37), or phonological awareness training plus phonics training (ES = 0.57). 2 Effects were not disaggregated for the type of alphabet training provided (i.e., letter names, letter sounds), nor was the effect of pure alphabet instruction studied with respect to alphabet knowledge outcomes as no studies were found to fit this category.

Studies with alphabet training as the lone instructional focus are especially interesting as they may speak to the causal direction of relations between alphabet knowledge and reading and spelling skills, which has not been well established (Ehri, 1983; Foulin, 2005). Although some theorists place great emphasis on letter learning as requisite for later literacy development (largely based on high correlations among these skills; e.g., Adams, 1990; Ehri, 1998), others have questioned the necessity of such knowledge, and letter name knowledge in particular, on reading and spelling skill acquisition (Groff, 1984; McGuinness, 2004; Venezky, 1975, 1979). It is also possible that reciprocal relations exist among these skills, as seen in the phonological awareness literature (Burgess & Lonigan, 1998; Wagner et al., 1994). Overall, because alphabet acquisition was only a single aspect of a much broader study, the NELP results were unable to address a number of important questions concerning the impact of instruction on alphabet and other emergent literacy outcomes. Third and finally, the NELP review covered only those articles available through March 2003 (C. Schatschneider, personal communication, March 2, 2009). Additional published and unpublished studies have become available since that time.

The Present Study

The present meta-analysis expands the NELP review in four ways. First, we included all studies providing alphabet training and assessing alphabet outcomes through November 2006. Second, we disambiguated effects for five discrete alphabet outcomes (letter name knowledge, letter sound knowledge, letter name fluency, letter sound fluency, and letter writing). Third, we attended to the specific targets of the alphabet instruction provided, whether training in letter names, letter sounds, or both. Fourth and finally, we differentiated between studies providing multicomponential and alphabet-only instruction (i.e., those targeting alphabet knowledge in addition to other literacy domains versus solely targeting alphabet skills). The latter allowed us to examine (a) the effects of including alphabet training as one dimension of more general literacy instruction, (b) potential facilitative effects of phonological training on alphabet knowledge development, and (c) causal relations between alphabet instruction and emergent literacy skill acquisition (including alphabet knowledge development) through synthesis of studies in which only children’s alphabet knowledge was manipulated.

We anticipated significant, positive effects of instruction on children’s alphabet knowledge. Overall, we anticipated slightly larger effect sizes than those reported by NELP, as all studies in the present synthesis included alphabet instruction. We expected our results to be most consistent with those of NELP when considering studies providing multicomponential instruction, given the NELP study’s reliance on such studies, as opposed to those providing only alphabet instruction. With respect to instructional foci, we expected to see larger effects in instances requiring minimal transfer (e.g., letter name instruction on letter name knowledge), to see transfer from letter name instruction to letter sound learning (as based on the extant literature), and to see benefits of combining alphabet and phonological instruction (as found by NELP). Finally, we expected significant, positive effects of alphabet instruction on phonological awareness, reading, and spelling outcomes.

Method

Search and Selection Procedure

An extensive literature search sought to identify published and unpublished studies measuring the effect of instruction on alphabet outcomes. The PsycInfo and ERIC databases were searched, using the earliest possible start date through November 2006. The past 10 years of Dissertation Abstracts International were also searched via ProQuest. Titles, abstracts, and keywords were searched for the word alphabet or derivations thereof or the string letter* knowledge, which allowed for inclusion of additional phrases such as letter sound knowledge, letter–sound correspondence knowledge, and so forth. The initial set of 4,686 references was narrowed via the following four selection criteria: (1) study of an instructional program or intervention designed to foster literacy skills in an alphabetic language, (2) sample that included children (i.e., participants were less than 18 years of age), (3) use of a quantitative research design, either experimental or quasi-experimental, allowing for estimates of effect size as compared with a control condition, and (4) published in English. Three hundred studies met these initial criteria and were obtained for full review, during which three additional inclusion criteria were evaluated: (1) measurement of a discrete alphabet outcomes at posttest, (2) explicit acknowledgment of alphabet training as a component of instruction, (3) sufficient information to calculate an effect size. Discrete measures of alphabet outcomes included tasks of letter name production or recognition, letter sound production or recognition, letter writing, and letter name or sound fluency (i.e., timed) tasks (see Table 1 ). Short-term studies (defined as encompassing less than one week of instruction), those using pseudoletters, and those assessing only letter discrimination (e.g., finding the match for a particular letter in an array) or reporting composite measures from which specific alphabet outcomes could not be isolated were excluded. Also, for instances in which results from the same sample or a subsample were reported in multiple studies, only one study was included in the meta-analysis. The search yielded 37 studies meeting all criteria.

Table 1

Descriptions of Study Codes Indicating Outcome and Moderator Variables

Study codeDescription
Outcome(s)
Letter name knowledgeAssessments involving untimed recognition or production of letter names such as asking children to name individual letters depicted on flashcards
Letter sound knowledgeAssessments involving untimed recognition or production of letter sounds such as asking children to give the associated sound for individual letters depicted on flashcards
Letter writingAssessments involving writing letters in response to oral prompts or timed writing of the alphabet such as asking children to write the alphabet from beginning to end
Letter name fluencyAssessments involving quickly naming letters such as RAN-letters and DIBELS Letter Naming Fluency
Letter sound fluencyAssessments involving quickly producing the sounds of letters such as in adaptations of RAN-letters and DIBELS fluency tasks, in which the sounds of letters are rapidly produced instead of the names
Phonological awarenessAssessments requiring awareness or manipulation of speech sounds including rhyming, sound categorization or matching, and sound blending or deletion
ReadingAssessments of reading including decoding, sight word recognition, and oral reading fluency
SpellingAssessments of spelling including traditional dictation tasks and invented spelling
Study quality
Experiment vs. quasi-experimentExperiments used random assignment to assign participants to conditions, whether as individuals or clusters; quasi-experiments assigned participants to conditions using methods other than random assignment
Evidence of selection biasesPretest or other initial differences between the participants assigned to treatment versus control which may bias interpreting posttest differences between the groups
Other methodological factors
Alphabet instruction isolated to treatment groupLetter name or sound instruction was provided only to the participants in the treatment condition
Evidence of potential ceiling effectsMean(s) on alphabet posttest measure(s) were within 1 SD of the maximum possible score
Publication biasStudy was published in a peer-reviewed journal as opposed to other, unpublished sources
Participant characteristics
AgeGeneral age of children included in the sample classified as early childhood (preschool, kindergarten) or elementary school (grades 1–3)
At-risk for reading difficultiesMajority of children in the sample considered at risk for later reading difficulties because of minority or low socioeconomic status, identification as learning disabled or language impaired, or poor emergent literacy skills
Instructional characteristics
SettingSetting/context in which treatment was delivered was school based (preschool classrooms, elementary school classrooms, school pullout/small-group program), home based, or research based (laboratory, clinic,)
ImplementerPerson who provided instruction (teacher employed by an educational agency versus researcher or research staff)
Group sizeGrouping unit (i.e., number of students) in which instruction was delivered (individual tutoring, small groups, whole class)
TimeNumber of minutes of instruction received by participants

All studies from two of the National Reading Panel (NRP) meta-analyses (Phonological Awareness and Phonics; NICHD, 2000) and the recent meta-analysis by NELP (2008) were also reviewed for inclusion. These reviews added 20 studies (16 nonduplicate studies found by NRP and 4 found by NELP) to the meta-analysis. Unpublished studies from two additional sources were included. Available online conferences proceedings of relevant professional organizations (i.e., the Society for Research in Child Development, the Society for the Scientific Study of Reading, the American Educational Research Association, and the International Reading Association) were searched for presentations specifically related to the alphabet or knowledge of letters in general. Researchers active in this area were contacted with requests for copies of identified presentations or relevant unpublished work, yielding an additional two studies meeting all selection criteria (Brodeur et al., 2006; M. Brodeur, personal communication, March 10, 2007). Both studies provided yet-unpublished results from well-designed large-scale random control trials in kindergarten classrooms, one of which had undergone peer review as a conference presentation. Florida Center for Reading Research reports (www.fcrr.org/FCRRReports/reportslist.htm) were also searched. This database consists of reviews of popular literacy curricula, including empirical evidence of effectiveness. Each review entails an exhaustive search of published and unpublished studies of the specified curriculum from school districts, independent program evaluators, and publishing companies. All curriculum reviews in the database were assessed, with four meeting selection criteria for inclusion in the meta-analysis.

In sum, the multistep literature search identified 494 studies that were obtained for full review after meeting initial screening criteria. A total of 63 studies met all criteria and are included in the present meta-analysis. These studies yielded a total of 82 independent contrasts between an instructional (treatment) condition and a control, often on multiple outcomes.

Study Coding

Included studies were coded extensively with respect to outcome, instructional features, methodology/design, and participant characteristics. Coding of outcome and instructional foci were required for correct classification of studies prior to analysis; additional details regarding instruction, methodology, and participants were used in moderator analyses. The codes used in the meta-analysis are presented in Table 1 . Note that, given our interest in causal relations between alphabet knowledge and phonological awareness, reading, and spelling outcomes, these outcomes (immediate and follow up) were coded only for studies involving pure alphabet instruction. Also, although we initially intended to code and analyze instructional features in much greater detail (e.g., additional instructional foci beyond alphabet or phonological training, instructional materials), insufficient reporting of such features prevented their inclusion. Reliability of the coding scheme was ensured through double coding of 10% of the studies. High inter-rater agreement (average kappa of 0.97) supported the reliability of study coding.

Effect Size Calculation and Analysis

Hedge’s g, corrected for bias, was used as the measure of effect size. For studies reporting means, standard deviations, and sample size at posttest, the following formulae were used (Ray & Shadish, 1996)

g U = g ( 1 − 3 4 ( n E + n C − 2 ) − 1 ) with g = ( X ¯ E − X ¯ C ) S and S = ( n E − 1 ) S E 2 + ( n C − 1 ) S C 2 n E + n C − 2

where g U is the unbiased estimate of Hedge’s g, g is Hedge’s g as traditionally defined, nE is the number of participants in the experimental sample, nC is the number of participants in the control sample, X ̅ E is the mean of posttest scores for participants in the experimental group, X ̅ C is the mean of posttest scores for participants in the control group, SE 2 is the variance of posttest scores for the participants in the experimental group, and SC 2 is the variance of posttest scores for the participants in the control group. For studies failing to provide the numbers required by the above formula, effect sizes were computed using alternative information, such as F-tests, t-tests, and reported and inferred probability levels (Ray & Shadish, 1996).

Single studies were allowed to contribute multiple effect sizes to the meta-analysis as long as the samples used to compute each effect size were independent of one another. In cases where multiple treatment groups were compared with a single control group, the average effect size was used in analyses. If separate treatment groups were matched with their own controls (e.g., at-risk and non–at-risk treatment groups compared with at-risk and non–at-risk controls, respectively), each comparison was included as a separate effect size. If multiple treatment and control groups existed and specific comparisons were not intended, these were randomly paired and each pair contributed an effect size to the meta-analysis. Finally, for studies with multiple measures within an outcome category (e.g., both a letter name recognition task and a letter name production task), a composite effect size was computed using the formula presented by Rosenthal and Rubin (1986).

Average weighted effect sizes were computed using a random-effects model (Shadish & Haddock, 1994), which accounts for both within- and between-study variance in effect size estimates. Estimation via the random-effects model was chosen for two reasons. Theoretically, the random-effects model was appropriate because we did not assume a single population distribution, given the synthesis of multiple interventions conducted in various settings and using various instructional methods and outcomes. This decision was also supported statistically, as the variability in observed effect sizes often exceeded that attributable solely to sampling error (i.e., the Q homogeneity test statistic was significant; Hedges & Vevea, 1998).

Random-effects analyses were conducted for each of the five alphabet outcomes. Analyses were completed separately for those studies with multiple instructional components and those providing only alphabet instruction, and were further subdivided by the type of instruction provided (in letter names, letter sounds, or both names and sounds, with or without additional phonological training). For studies providing only alphabet instruction, additional analyses were also conducted for immediate and follow-up phonological awareness, reading, and spelling outcomes. For all analyses, a number of statistics, in addition to the average weighted effect size, were computed. These included 95% confidence intervals for the effect, 3 τ 2 or the between-study variability in effect sizes, and the Q statistic to test the homogeneity of effect sizes across studies (Cooper & Hedges, 1994). In addition, a file drawer statistic was calculated (Rosenthal, 1979). This calculation reflects the number of studies with null results required to reduce the average weighted effect size estimate to a nonsignificant value.

Moderator Analyses

Moderator analyses were used to examine factors related to and potential sources of bias in the effect sizes estimated, thus speaking to the validity of findings. Moderator analyses were conducted for overall effect size estimates for the various alphabet outcomes (letter name knowledge, letter sound knowledge, letter writing, and letter name fluency), given sufficient sample size and power. Moderator analyses could not be conducted for letter sound fluency, phonological, reading, or spelling outcomes.

The potential moderating factors investigated include those listed in the last section of Table 1 . Similar to the large meta-analyses conducted by the NRP (NICHD, 2000) and NELP (2008), participant and instructional characteristics were examined as potential moderating factors. Participant characteristics included a broad classification of age (early childhood versus elementary school children) and whether participants were considered to be at risk for reading difficulties. Additional instructional characteristics included whether the intervention was classroom- or research lab/clinic-based, whether it was implemented by practicing teachers or research staff, the size of the group to which instruction was administered, and total instructional time. Studies that did not report information for particular moderator variables were dropped from the relevant analyses. Studies dropped because of missing data also included those with multiple treatment groups compared with a single control (i.e., those with dependent effect sizes), when these treatment groups differed on moderator variables.

As recommended by Wortman (1994), an additional set of factors investigated potential bias because of differences in study methodology or quality. Use of random assignment was examined as the study element differentiating experimental and quasi-experimental designs and thus affecting the confidence with which one may make causal attributions regarding the intervention or instruction provided (Shadish, Cook, & Campbell, 2002). Evidence of selection bias or initial nonequivalence among treatment and control groups, considered a major threat to internal validity (Shadish et al., 2002), was examined as a second indicator of study quality. A third methodological factor examined bias related to whether alphabet instruction was provided only to the treatment group (e.g., comparing a supplemental alphabet training program with phonological awareness training or an untreated control) or to both treatment and control groups in various forms (e.g., pitting one method of instruction against another, such as a comprehensive literacy program versus a simpler letter-of-the-week curriculum). Fourth, given the inclusion of both published and unpublished studies, studies obtained from peer-reviewed journals were compared with those from other sources as a means of investigating publication bias. Finally, given concerns regarding the “constrained” nature of alphabet knowledge and assessment (Paris, 2005), we examined whether ceiling effects artificially restricted effect size estimates (Cohen, Cohen, West, & Aiken, 2003).

With the exception of instructional time, all moderators were dummy coded for entry into the regression procedure described in Hedges (1994). Instructional time was retained as a continuous variable and analyzed as such. Weighted least-squares regression was used, in which treatment/control contrasts were weighted by the inverse of their unconditional variances (Hedges, 1994). Overall model fit was assessed by comparing the residual variance remaining for each model after other moderators were added. Factors that improved model fit accounted for significant between-study variance in effect size estimates and the extent to which such variability among studies remained after controlling for moderators was assessed via tests of the homogeneity of effect sizes (Q statistic). Standard errors and significance tests for regression coefficients were corrected as specified by Hedges (1994). Factors with significant coefficients were interpreted as reliably predicting the magnitude of effects.

Results

Description of Studies

The studies included in the meta-analysis (N = 63) are listed in Appendixe A and Appendixe B . These studies involved 8,468 participants, 4,466 of whom received treatment involving alphabet instruction. A total of 82 independent effect sizes (k) were calculated, and the number of participants and effect sizes contributing to each meta-analysis is indicated in Table 2 and Table 3 . Of the entire sample, most studies (n = 53) included multiple instructional components, typically providing alphabet instruction along with phonological training (n = 44, k = 59). Other instructional targets included oral language (k = 9), writing/printing (k = 11), general print concepts (k = 7), word identification (decoding, k = 12; sight words, k = 12), and book reading/use (k = 11). Instructional components aimed at improving other targets such as speech articulation, listening comprehension, spelling, or handwriting were present to a lesser extent. Ten studies provided pure alphabet instruction: three provided instruction in letter names only (k = 3), four provided instruction in letter sounds only (k = 4), and three provided instruction in both letter names and sounds (k = 5) without any additional instructional components. Total instructional time ranged from 120 to 5,793 minutes (M = 1250.46 minutes, SD = 1335.69).

Table 2

Average Weighted Effect Sizes and Relevant Statistics for Multicomponential Studies

Type of instructionknAverage weighted
effect size
95% CI τ 2 Homogeneity of effect sizes File
drawer statistic
Lower boundUpper boundQp
Letter name knowledge outcomes
Full sample2726310.430.220.640.24159.21< .001549
Studies providing LN instruction
With PA1331.000.271.72 1
Without PA2720.20−0.240.65−0.020.86.353
Studies providing LS instruction
With PA64610.18−0.010.360.004.24.515
Studies providing both LN and LS instruction
With PA1720410.580.310.860.27125.19< .001451
Without PA124−1.85−2.82−0.88 3
Letter sound knowledge outcomes
Full sample3636160.650.440.860.32270.92< .0012176
Studies providing LN instruction
Without PA3480.820.221.410.000.76.6843
Studies providing LS instruction
With PA1718630.850.631.080.1569.36< .0011048
Studies providing both LN and LS instruction
With PA1516810.480.170.790.29106.55< .001167
Without PA124−1.31−2.20−0.42 1
Letter writing outcomes
Full sample67410.590.310.860.0714.70.01276
Studies providing LS instruction
With PA46010.600.250.940.0911.30.01041
Studies providing both LN and LS instruction
With PA21400.58−0.051.210.153.40.065
Letter name fluency outcomes
Full sample172016−0.02−0.250.210.1782.89 < .001
Studies providing LN instruction
With PA143−0.61−1.230.00
Without PA11390.16−0.200.51
Studies providing LS instruction
With PA713850.07−0.090.230.0210.23.115
Without PA124−0.32−1.130.48
Studies providing both LN and LS instruction
With PA7425−0.04−0.760.690.8465.05 < .001
Letter sound fluency outcomes
Full sample44280.580.380.790.001.21.75224
Studies providing LS instruction
With PA34040.600.480.72−0.020.65.72322
Without PA1240.29−0.521.09

Note, k = number of independent contrasts/studies; n = total number of children; CI = confidence interval; τ 2 = between-study variability; LN = letter name; PA = phonological awareness; LS = letter sound.

Table 3

Average Weighted Effect Sizes and Relevant Statistics for Studies Providing Only Alphabet Instruction

Type of instructionknAverage weighted
effect size
95% CI τ 2 Homogeneity of effect sizes File
drawer statistic
Lower boundUpper boundQp
Letter name knowledge outcomes
Full sample83890.14−0.100.370.028.16.319
Studies providing LN instruction1280.00−0.860.86
Studies providing LS instruction21720.08−0.220.390.000.12.731
Studies providing LN and LS instruction51890.23−0.200.670.127.75.101
Letter sound knowledge outcomes
Full sample48560.480.400.550.002.57.46220
Studies providing LN instruction16280.500.340.66 9
Studies providing LS instruction22040.24−0.040.530.000.19.661
Studies providing LN and LS instruction1240.41−0.401.22
Letter writing outcomes
Full sample
Studies providing LS instruction11120.31−0.060.68
Letter name fluency outcomes
Full sample
Studies providing LN instruction1390.640.001.28
Letter sound fluency outcomes
Full sample
Studies providing LS instruction134−0.33−1.010.35
Immediate phonological awareness outcomes
Full sample6829−0.17−0.730.400.4459.56 < .001
Studies providing LN instruction15160.620.440.79 11
Studies providing LS instruction4284−0.22−0.760.330.2617.24.001
Studies providing LN and LS instruction129−0.98−1.75−0.20 1
Immediate reading outcomes
Full sample63970.210.020.40−0.014.60.4671
Studies providing LN instruction1390.49−0.141.13
Studies providing LS instruction53580.18−0.030.390.003.74.442
Immediate spelling outcomes
Full sample
Studies providing LS instruction11120.30−0.080.67
Follow-up phonological awareness outcomes
Full sample
Studies providing LS instruction32640.01−0.800.820.4620.57 < .001
Follow-up reading outcomes
Full sample42850.11−0.120.350.002.11.550
Studies providing LN instruction1390.25−0.380.88
Studies providing LS instruction32460.09−0.150.340.001.91.384
Follow-up spelling outcomes
Full sample
Studies providing LS instruction32640.14−0.620.890.4018.20 < .001

Note, k = number of independent contrasts/studies; n = total number of children; CI = confidence interval; τ 2 = between-study variability; LN = letter name; LS = letter sound.

Appendix A

Effect Sizes and Study Characteristics of Independent Treatment/Control Contrasts Included in Meta-Analyses

StudySourceStudy typeDesignCountryLanguageAgeAt-risk sampleSettingImplementerGroup sizeLNLSPATime (min)Isolated alpha
Angerilli (1999)PQT/DExpCanadaEnglishK ECRASGNYY Y
Aram & Biron (2004)P/EJQEDIsraelHebrewPreAPORASGYYY1650Y
Aram (2006)P/EJQEDIsraelHebrewPreAPCTSGYYY1250Y
P/EJExpIsraelHebrewPreAPCTSGYYY1250Y
Ball & Blachman (1988)FCRRJExpUSAEnglishK PORASGYY 560Y
Ball & Blachman (1991)P/EJExpUSAEnglishK PORASGYY 560Y
Bara, Gentaz, Colé, & Sprenger-Charolles (2004)P/EJQEDFranceFrenchK PORASGNYN175N
Beech, Pedley, & Barlow (1994)P/EJQEDUKEnglishGE RAINYY135Y
Blachman, Ball, Black, & Tangel (1994)NELPJQEDUSAEnglishKAECTSGYYY770Y
Boyle (2006)PQT/DExpUSAEnglishPreAHPtIYYY Y
Brodeur et al. (2006)C/RPExpCanadaFrenchKAECTWCYYY750Y
C/RPExpCanadaFrenchKNECTWCYYY750Y
Brodeur (2007)C/RUExpCanadaFrenchKAECTWCYYY1260Y
C/RUExpCanadaFrenchK ECTWCYYY1260Y
C/RUExpCanadaFrenchK ECTWCYYY1260Y
Caplovitz (2005)PQT/DExpUSAEnglishPre HPtIYNN337Y
Castle, Raich, & Nicholson (1994)NELPJExpNew ZealandEnglishKAPORASGNYY400Y
NELPJExpNew ZealandEnglishKAPORASG 300Y
Culatta, Kovarsky, Theadore, Franklin, & Timler (2003)P/EJQEDUSAEnglishPreAPCRAWCYYN Y
de Jong, Seveke, & van Veen (2000)P/EJExpNetherlandsDutchK ECTSGNYY1500Y
de Jong & Vrielink (2004)P/EJExpNetherlandsDutchG1 ECTSGNYN1250Y
Edwards (2000)PQT/DExpUSAEnglishKAECTSGYYY3390Y
P/EJExpUSAEnglishPre/K RLRAINYN N
Ehri & Wilce (1987)NRPJExpUSAEnglishK RAINYY160N
Eidlitz-Neufeld (2002)P/ET/DQEDCanadaEnglishK ECTWCNYY Y
P/ET/DQEDCanadaEnglishK ECTWCNYY Y
P/ET/DQEDCanadaEnglishG1 ECTWCNYY Y
Farrell (1978)P/ET/DExpUSAEnglishGEAPORAWCYYN360Y
P/ET/DExpUSAEnglishGEAPORAWCYYN360Y
P/ET/DExpUSAEnglishGEAPORAWCYYN360Y
Fuchs et al. (2001)NELPJExpUSAEnglishKSECT NYY Y
NELPJExpUSAEnglishKNECT NYY Y
NELPJExpUSAEnglishKNECT NYY Y
Fugate (1997)P/EJExpUSAEnglishG1 PORAIYNN150Y
Gettinger (1986)NRPJExpUSAEnglishPre PCTSGYYY Y
Gillon (2005)P/EJQEDNew ZealandEnglishPreACRASGYYY1148Y
Hecht & Close (2002)P/EJQEDUSAEnglishK ECC/TIYYY Y
Hetzroni & Shavit (2002)P/EJExpIsraelHebrewGEAPOT YYN N
Johnston & Watson (2004)P/EJQEDUKEnglishKSECTWCNYY1600N
P/EJQEDUKEnglishGE PORASGNYN285Y
Justice (2000)P/ET/DExpUSAEnglishPreAPORASGYNN144Y
Kerstholt, van Bon, & Schreuder (1997)P/EJExpNetherlandsDutchK RA NYY120Y
Lennon & Slesinski (1999)NELPJExpUSAEnglishKAPORAPrYYY1350Y
NELPJExpUSAEnglishKNPORAPrYYY1350Y
MacCoubrey (2003)PQT/DQEDCanadaEnglishKAPORASGNYY700Y
McMahon, Rose, & Parks (2003)P/EJQEDUSAEnglishG1 PORAWCYNN800Y
Mioduser, Tur-Kaspa, & Leitner (2000)P/EJExpIsraelHebrewPreAEC IYYY Y
Murray, Stahl, & Ivey (1996)P/EJExpUSAEnglishPreAPCTWCYNN150Y
Murray (1998)NELPJExpUSAEnglishK PORAPrYYY263N
O’Brien (2006)PQT/DExpUSAEnglishPre HPtINYY490Y
O’Connor & Jenkins (1995)NELPJExpUSAEnglishKAPORASGNYY300N
O’Connor, Notari-Syverson, & Vadasy (1996)NELPJExpUSAEnglishKAECTSGYYY1200Y
NELPJQEDUSAEnglishKNECTWCYYY1200Y
NELPJQEDUSAEnglishKAECTSGYYY1200Y
O’Connor (2000)P/EJQEDUSAEnglish A YYY Y
Oudeans (2003)NELPJExpUSAEnglishKSPORASGYYY600N
Peterson & Haines (1992)NRPJExpCanadaEnglishKAPORAIYNN Y
NRPJExpCanadaEnglishK PORAIYNN Y
NRPJExpCanadaEnglishK PORAIYNN Y
Putman (2005)PQT/DQEDUSAEnglishK ECTWCNYN Y
RMC Research Corporation (n.d.-a)FCRRRExpUSAEnglishPre PCTWCNYY Y
RMC Research Corporation (n.d.-b)FCRRRExpUSAEnglishK ECTWCNYY3500Y
FCRRRExpUSAEnglishG1 ECTWCNYY3500Y
Roberts & Corbett (1997)P/ERQEDUSAEnglishKMECRAWCNYY720Y
Roberts (2003)NELPJExpUSAEnglishPreA RASGYNY1080Y
Schneider, Roth, & Ennemoser (2000)NELPJQEDGermanyGermanKAPOTSGNY Y
Scott (2005)P/ET/DExpUSAEnglishPreAPCT YYY840Y
Shippensburg University Research (2004)FCRRRQEDUSAEnglishK ECVINYY4320Y
Solity, Deavers, Kerfoot, Crane, & Cannon (1999)NELPJQEDUKEnglishKSECTWCNYY5760Y
Stuart (1999)P/EJQEDUKEnglishKMECTSGNYN3600Y
Sumbler (1999)PQT/DQEDCanadaEnglishK ECTWCNYY Y
Trout, Epstein, Mickelson, Nelson, & Lewis (2003)NELPJExpUSAEnglishKAPORAIYYY4200Y
Uhing (2005)PQT/DQEDUSAEnglishPre PORASGYNY1792Y
van Bysterveldt, Gillon, & Moran (2006)P/EJQEDNew ZealandEnglishPreAHPtIYYN240Y
Vandervelden & Siegel (1997)NELPJQEDCanadaEnglishKAPORAPrYYY285N
Vaughn et al. (2006)C/RJExpUSAEnglishG1APORASGNYY5793Y
Walton, Bowden, Kurtz, & Angus (2001)NELPJExpCanadaEnglishK/G1 PORASGNYY500Y
Walton & Walton (2002)P/EJExpCanadaEnglishK PORASGNYY500Y
P/EJExpCanadaEnglishK PORASGNYY500Y
Whitehurst et al. (1994)NELPJExpUSAEnglishPreAPCTSGYYY2063Y
Williams (1980)NRPJQEDUSAEnglishGEAECTSGNYY2600Y
NRPJExpUSAEnglish A TSGNYY1164Y
Woodrome (2006)P/ET/DExpUSAEnglishPre/K RLRASGYNN120Y
Yeh (2003)NELPJExpUSAEnglishPreAPCTSGNYY405Y

Note. Components of instruction explicitly included: LN (letter name training), LS (letter sound training), PA (phonological awareness training); Time = total instruction time for the treatment group in minutes; Isolated alpha = alphabet instruction provided to the treatment group only; PQ = ProQuest; P/E = PsycInfo or ERIC database; FCRR = Florida Center for Research Research Reports; NELP = reference list from the National Early Literacy Panel review; C/R = conference presentation or other requested material; NRP = reference list from the Report of the National Reading Panel; T/D = thesis or dissertation; J = journal article; P = presentation; U = unpublished data or manuscript; R = report; Exp = experimental design; QED = quasi-experimental design; K = kindergarten; Pre = preschool; G1 = grade 1; GE = general elementary; Pre/K = mixed preschool and kindergarten; K/G1 = mixed kindergarten and grade 1; A = all; S = some; N = none; M = majority; EC = elementary school classroom; PO = pullout program; PC = preschool center; H = home; RL = research laboratory; C = clinic; RA = researcher/research assistants; T = teacher; V = community volunteer; C/T = technology such as computers or television; Pt = parent; SG = small group; I = individual tutoring; WC = whole class; Pr = pair.

Appendix B

Effect Sizes of Independent Treatment/Control Contrasts Included in Meta-Analyses

Effect sizes for immediate outcomeEffect sizes for follow-up outcome
StudyNLNKLSKLWRLNFLSFPAReadSpellNPAReadSpell
Angerilli (1999)71 1.63 *
Aram & Biron (2004)711.90 *
Aram (2006)792.18 *
770.13
Ball & Blachman (1988)89−0.130.56 *
Ball & Blachman (1991)89−0.130.56 *
Bara, Gentaz, Colé, & Sprenger-Charolles (2004)600.16 0.230.43
Beech, Pedley, & Barlow (1994)36 0.590.50
Blachman, Ball, Black, & Tangel (1994)1590.280.62 *
Boyle (2006)270.000.00
Brodeur et al. (2006)2530.59 * 0.00
5780.060.00
Brodeur (2007)1390.66 * 0.75 *
850.52 * 0.20
480.37−0.88 *
Caplovitz (2005)139 0.16
Castle, Raich, & Nicholson (1994)280.28
48−0.17
Culatta, Kovarsky, Theadore, Franklin, & Timler (2003)300.10 −0.98 *
de Jong, Seveke, & van Veen (2000)270.43
de Jong & Vrielink (2004)34 −0.33 −0.31
Edwards (2000)64 0.91 * 0.66 *
20 1.02 * 0.11
Ehri & Wilce (1987)18 0.95
Eidlitz-Neufeld (2002)242 0.47 *
139 1.67 * 1.09 *
184 0.33 *
Farrell (1978)650.03
330.12
37−0.09
Fuchs et al. (2001)134 0.69 *
134 0.49 *
134 0.65 *
Fugate (1997)39 0.64 0.49 39 0.25
Gettinger (1986)721.27 * 2.56 *
Gillon (2005)20 0.78
Hecht & Close (2002)76−0.100.250.27
Hetzroni & Shavit (2002)241.36 * 0.41
Johnston & Watson (2004)292 0.83 * 62−0.14−0.05−0.26
62 0.32 −0.180.07
Justice (2000)300.46
Kerstholt, van Bon, & Schreuder (1997)29 2.08 *
Lennon & Slesinski (1999)780.66 * 0.87 *
561.43 * 0.60 *
MacCoubrey (2003)71−0.220.30
McMahon, Rose, & Parks (2003)628 0.50 * 0.62 *
Mioduser, Tur-Kaspa, & Leitner (2000)460.33
Murray, Stahl, & Ivey (1996)420.01
Murray (1998)48 −0.05
O’Brien (2006)37 0.36
O’Connor & Jenkins (1995)67 −0.47
O’Connor, Notari-Syverson, & Vadasy (1996)19 0.54
57 0.56
14 −0.12
O’Connor (2000)125 −1.91 *
Oudeans (2003)54 0.00
Peterson & Haines (1992)16 1.06 *
16 0.97
16 0.47
Putman (2005)24 −0.320.29
RMC Research Corporation (n.d.-a)254 0.09
RMC Research Corporation (n.d.-b)258 0.02
165 −0.10
Roberts & Corbett (1997)560.000.00
Roberts (2003)331.00 *
Schneider, Roth, & Ennemoser (2000)1380.88 * −1.00 * 0.36 90−0.58 * −0.05−0.22
Scott (2005)92 0.09
Shippensburg University Research (2004)564 0.18 *
Solity, Deavers, Kerfoot, Crane, & Cannon (1999)433 0.86 *
Stuart (1999)1120.050.190.31 0.100.120.301120.73 * .0320.86 *
Sumbler (1999)2650.190.96 *
Trout, Epstein, Mickelson, Nelson, & Lewis (2003)12 1.04
Uhing (2005)43 −0.61 *
van Bysterveldt, Gillon, & Moran(2006)24−1.85 * −1.31 *
Vandervelden & Siegel (1997)30−0.691.01 *
Vaughn et al. (2006)400.590.11 0.59
Walton, Bowden, Kurtz, & Angus (2001)21 0.56
Walton & Walton (2002)30 1.54 *
61 0.85 *
Whitehurst et al. (1994)1670.39 *
Williams (1980)102 0.63 *
102 0.48 *
Woodrome (2006)280.00
Yeh (2003)44 0.79 *

Note. Effect sizes significance indicates reliable difference from zero. Instruction = components of instruction explicitly included: LN (letter name training), LS (letter sound training), PA (phonological awareness training); N = total number of participants; LNK = letter name knowledge; LSK = letter sound knowledge; LWR = letter writing; LNF = letter name fluency; LSF = letter sound fluency; PA = phonological awareness; Read = text, word, and nonword reading; Spell = spelling.

Additional study characteristics are presented in Appendixe A and Appendixe B , per independent treatment/control contrast 4 , with factors included in moderator analyses summarized in Table 4 . The studies involved participants in eight countries (Canada, France, Germany, Israel, the Netherlands, New Zealand, the United Kingdom, and the United States). Most instruction involved training in the English alphabet (k = 67), although other alphabetic languages were also represented: Dutch (k = 3), French (k = 6), German (k = 1), and Hebrew (k = 5). Although most studies were conducted in school settings with preschool- or kindergarten-aged children (k = 68), a few studies involved only elementary school students (k = 10) or a combination of kindergarten and elementary school students (k = 4; see Table 4 ). Many participants were considered at risk for reading difficulties because of socioeconomic, disability, or low skill-level status, and this was particularly true for elementary school-aged participants.

Table 4

Effect Sizes as a Function of Moderator Variables

ModeratorOutcome
Letter name knowledge Letter sound knowledge Letter writing Letter name fluency
kEffect sizekEffect sizekEffect sizekEffect size
Use of random assignment
Yes250.40260.6510.915120.18 *
No130.31160.5660.4976−0.37
Presence of selection bias(es)
Yes240.33220.5150.49712−0.09
No140.43200.7420.74660.21
Alphabet instruction isolated
to the treatment group
Yes350.38370.6170.545160.04
No30.2050.630 2−0.24
Potential ceiling effects
Yes110.4790.5330.705
No190.35250.6840.427
Publication bias
Peer-reviewed journal240.46350.6830.3368−0.08
Other source140.2260.4540.662100.07
Participant age
Early childhood
(preschool/kindergarten)
330.37350.6550.601140.09
Elementary50.3460.4320.38130.32
Participants at risk for reading
difficulties
Yes260.40180.4220.5697−0.14
No20.5840.570 20.28
Instructional setting
School340.41 # 350.63 # 60.549150.11
Home2−0.80 # 2−0.61 # 0 20.24
Lab or clinic10.0010.780 0
Implementer
Teacher180.54160.6350.59490.16
Research staff160.31230.7110.550.01
Group size
Individual4−0.26 # 90.3920.35540.30
Small group210.52 # 200.7320.56970.07
Whole class120.24110.4830.60750.06
Instructional time a
Minimum30−0.26 * 320.5730.508150.11
Mean 0.31 0.50 0.546 0.15
Max 0.98 0.42 0.565 0.19

Note. Contrasts represent results of pairwise comparisons within factors.

a Total minutes of instructional time was highly positively skewed, and a logarithmic transformation of this variable was used in all analyses (Tabachnick & Fidell, 1996). Because this moderator was a continuous variable, effects are listed for instructional time at the minimum, mean, and maximum.

* Contrast between effect sizes is significant at p < .05. # Contrast between effect sizes trended towards significance at p < .10 and significantly improved model fit at p < .05.

Impact of Multicomponential Instruction on Alphabet Outcomes

Table 2 presents effect size results for multicomponent studies or those studies providing instruction in literacy domains other than simply alphabet knowledge. As statistical power was often limited, particularly given the small sample sizes once studies were disaggregated by instructional foci, 95% confidence intervals are helpful in evaluating the precision of the estimates of effect sizes. A confidence interval that does not contain zero indicates that the effect size was significantly different from 0.

Instruction had a significant impact on every alphabet outcome except letter name fluency. Overall effect sizes ranged from g U = 0.65 for letter sound knowledge to g U = 0.43 for letter name knowledge. In all cases except letter sound fluency, these overall effect sizes were heterogeneous (significant Q statistic), with between-study variance estimates (τ 2 ) ranging from 7% for letter writing to 32% for letter sound knowledge. As seen in Table 2 , these variance estimates often changed considerably when studies were classified according to instructional components; nonsignificant homogeneity statistics ought to be interpreted with caution, however, given the small sample sizes in many of these categories.

Disambiguating by instructional components allowed for a more nuanced examination of the overall effects. Letter name outcomes were reliably affected by when letter name or both letter name and sound instruction were combined with phonological training. With the exception of a single study (van Bysterveldt, Gillon, & Moran, 2006), all instruction targeting letter sound knowledge reliably improved letter sound outcomes, whether such instruction did or did not include phonological training. Moreover, letter name instruction also led to significant impacts on letter sound knowledge although these effects were somewhat smaller than those for letter sound instruction. Letter sound fluency was reliably affected by letter sound plus phonological instruction. Notably, the van Bysterveldt et al. (2006) study producing the negative effect on letter sound outcomes also produced a negative effect on letter name outcomes; the negative effect, however, may be attributable to that particular study rather than a general failure of combined letter name and sound instruction to improve alphabet outcomes.

Impact of Pure Alphabet Instruction on Alphabet and Early Literacy Outcomes

Table 3 presents effect size results for studies providing only alphabet instruction. Despite generally positive effects of pure alphabet instruction on alphabet outcomes, most effect sizes were not reliable. The sole exception was letter sound knowledge, which showed a significant overall effect. Letter sound knowledge also showed a significant impact on letter name instruction, but this finding was based on only a single effect size (McMahon, Rose, & Parks, 2003). Unlike results for multicomponent studies, between-study variability was minimal (maximum of 12%) and homogeneity of effect sizes could not be rejected. Caution is warranted in interpreting these results, however, given the small sizes involved.

Short- and long-term effects of alphabet instruction on phonological, reading, and spelling skills are also reported in Table 3 . Phonological outcomes did not show an overall impact of alphabet instruction when measured immediately following intervention; a significant positive impact of letter name instruction found in a single study (McMahon et al., 2003) was accompanied by a significant negative effect of combined letter name and sound instruction. Conversely, immediate reading outcomes showed a significant overall impact of alphabet instruction but nonsignificant impacts when considering the specific type of instruction provided. The sole study assessing spelling at immediate posttest (Stuart, 1999) also failed to show a reliable effect of letter sound instruction on this outcome. Effect sizes were heterogeneous for phonological outcomes, with 44% of the variance between-studies. Reading outcomes, on the other hand, showed essentially no between-study variability.

Only four studies included follow-up assessments (Fugate, 1997, 2 month follow-up; Johnston & Watson, 2004, 9 month follow-up; Schneider, Roth, & Ennemoser, 2000, 3 month follow-up for phonological awareness, 1 year follow-up for reading and spelling; Stuart, 1999, 1 year follow-up). None of the follow-up effect sizes significantly differed from zero. Similar to the result presented above, effect sizes were heterogeneous for phonological and spelling outcomes, with between-study variability ranging from 40% to 46%. Reading outcomes again showed almost no between-study variability.

Moderator Analyses

Results of moderator analyses are presented in Table 5 , with estimated effects and contrasts among levels of moderating factors presented in Table 4 . For letter name and letter sound knowledge outcomes, studies conducted in school settings showed greater impacts than those conducted at home. Studies of letter name knowledge also demonstrated differences in effect sizes according to the size of the group to which instruction was provided, with small groups having larger impacts than individual tutoring. Finally, studies of letter name knowledge were reliably influenced by the amount of time devoted to instruction; studies providing more instructional time tended to have larger impacts.

Table 5

Fit and Homogeneity Statistics for Moderator Analyses

ModeratorkModel Fit Test of Homogeneity of Effect Sizes
ΔR 2 ΔQdfpQdfp
Letter name knowledge outcomes
Use of random assignment380.010.351.55466.9835.001
Presence of selection bias(es)380.010.341.56066.9936.001
Alphabet instruction isolated
to the treatment group
380.010.321.56967.0036.001
Potential ceiling effects300.010.491.48561.5728< .001
Publication bias380.032.301.12965.0336.002
Participant age380.000.011.90967.3136.001
Participants at risk for reading
difficulties
280.010.331.56858.3826< .001
Instructional setting370.138.662.01357.2334.008
Implementer340.042.121.14550.4932.020
Group size370.149.012.01155.4834.011
Instructional time300.1910.721.00146.4028.016
Letter sound knowledge outcomes
Use of random assignment420.000.241.62752.8240.084
Presence of selection bias(es)420.031.551.21351.5040.105
Alphabet instruction isolated
to the treatment group
420.000.001.95053.0540.081
Potential ceiling effects340.010.441.50643.4632.085
Publication bias410.031.351.24551.7040.102
Participant age410.010.721.39552.2739.076
Participants at risk for reading
difficulties
220.020.251.61415.1920.765
Instructional setting380.157.162.02840.7635.232
Implementer390.000.191.66542.8037.236
Group size400.052.432.29645.7337.154
Instructional time320.010.151.70330.0630.463
Letter writing outcomes
Use of random assignment70.201.181.2784.835.437
Presence of selection bias(es)70.110.641.4235.375.373
Potential ceiling effects70.211.261.2624.755.447
Publication bias70.271.631.2024.385.497
Participant age70.100.621.4325.405.369
Implementer60.010.041.8365.214.267
Group size70.110.642.7255.264.261
Instructional time30.010.011.9101.601.206
Letter name fluency outcomes
Use of random assignment180.184.651.03120.6916.191
Presence of selection bias(es)180.061.451.22923.9016.092
Alphabet instruction isolated
to the treatment group
180.020.581.44724.7716.074
Publication bias180.020.371.54224.9816.070
Participant age170.060.541.4638.4115.906
Participants at risk for reading
difficulties
90.051.041.30920.247.005
Instructional setting170.010.121.7268.8315.886
Implementer140.030.271.6038.4412.750
Group size160.080.722.6988.2213.829
Instructional time150.000.031.8608.2713.825

k = number of independent contrasts/studies

Study quality differences were only predictive of variability in effect sizes across studies in a single case: True experiments showed reliably larger effects than quasi-experiments on letter name fluency outcomes. Notably, publication bias did not reliably affect results despite trends favoring peer-reviewed publications. Also, note that for all outcomes except letter name knowledge, accounting for potential moderators reduced the between-study variability to nonsignificant levels. The test of homogeneity of effect sizes continued to be rejected for analyses involving letter name outcomes.

Discussion

In its synthesis of more than 60 studies from the early reading literature, the present study demonstrates a significant impact of instruction on children’s alphabet learning. Effect size magnitude depended not only on the type of alphabet knowledge assessed, but also instructional factors such as skills taught, setting, grouping, and duration.

Impact on Alphabet Knowledge

Letter name knowledge, letter sound knowledge, and letter writing outcomes showed effects ranging from 0.14 to 0.65 across the various domains of alphabet knowledge when considering both multicomponential and pure alphabet instruction. According to Cohen’s guidelines, these represent small-to-moderate effects (Cohen, 1988); however, when compared with contemporary efforts to improve outcomes via educational programs, these effects may be considered of greater magnitude (Hill, Bloom, Black, & Lipsey, 2007; Konstantopoulos & Hedges, 2008). Effects tended to be larger (moderate effect sizes) and more aligned with the NELP findings when multicomponential instruction was provided, perhaps lending support to notions of reciprocal relations among emergent literacy skills (Burgess & Lonigan, 1998; Lonigan et al., 2000; McBride-Chang, 1999; NICHD, 2000; Wagner et al., 1994). Moreover, effects of multicomponential instruction tended to be slightly larger than those reported in the NELP study, likely because of the provision of alphabet instruction in all synthesized studies.

Effects for fluency outcomes were less clear. For multicomponential studies, letter naming fluency was the only outcome to show no effect of instruction, yet a moderate (though also nonsignificant) effect was found when letter names constituted the sole focus of instruction. Conversely, multicomponential studies showed a reliable moderate effect on letter sound fluency outcomes that was not apparent when considering the study providing only letter sound training. The positive effect for multicomponential studies on letter sound fluency may be partially explained by the close match between instructional target and assessed skill: All studies assessing sound fluency provided explicit instruction in letter–sound correspondences. In general, however, these findings were not wholly unexpected, as fluency is often touted as one of the most difficult aspects of literacy to improve (see e.g., Kuhn & Stahl, 2003; Torgesen & Hudson, 2006) even when restricted to the rapid naming of letters or sounds (de Jong & Vrielink, 2004).

Altogether, the previously presented results suggest that specific domains of alphabet knowledge can be enhanced through instruction to varying degrees. Importantly, effect size magnitude did not appear to be biased by ceiling effects, publication bias, or study quality (with the single exception of letter name fluency for the latter). Given these findings, we are confident that the overall impact results are an accurate portrayal of the selected studies’ effects on alphabet learning. On the whole, these modest effect sizes for alphabet knowledge are somewhat surprising, given (a) the emphasis placed on alphabet learning by research (Adams, 1990; Whitehurst & Lonigan, 1998), educational (NAEYC, 1998), and policymaking (U.S. Department of Education, 2002, 2003) communities, (b) the close alignment between alphabet instruction and alphabet assessments used to assess its efficacy, and (c) the meta-analytic results for other literacy skills (Bus & van IJzendoorn, 1999; NELP, 2008; NICHD, 2000; Wagner, 1988). The NRP, for example, reported effects ranging from 0.53 to 0.86 for phonological awareness and phonics training on primary students’ phonological, reading, and spelling outcomes (NICHD, 2000). These estimates were even greater for at-risk students (0.58 to 0.95), a result not seen in the present analysis, and ranged up to 2.37 for particular subgroups of children.

The modest effects may have a number of explanations. First, alphabet learning may be of a different nature than other skills like phonological awareness, requiring significant amounts of rote memorization and repeated practice. Second, many studies were not particularly focused on providing letter name or sound instruction, and such instruction was often included as only a small or incidental portion of a larger literacy program (e.g., teaching letters to serve as “placeholders” for facilitating phonological awareness development). Third, many children have opportunities to learn about letters outside of a structured intervention or research study. In the United States, the country in which the majority of studies were conducted, parents and teachers alike tend to impart letter knowledge to young children, particularly focusing on letter names (Ellefson, Treiman, & Kessler, 2009). Children in control conditions may thus learn letter names to a similar extent as children receiving additional instruction. This explanation is supported by the smaller effects on letter name learning. Conversely, a lack of emphasis on letter sounds may explain the more substantial, yet still modest, effects noted for this outcome.

Alternatively, the small-to-moderate effect size estimates may be an authentic representation of our current ability to foster alphabet knowledge development during early literacy instruction. The question then turns to whether such gains are adequate. Although many children may easily acquire alphabet knowledge from incidental or informal teaching during home and school literacy activities (e.g., Aram, 2006; Thompson, Fletcher-Flinn, & Cottrell, 1999), other children, particularly those from disadvantaged backgrounds or those at risk for later reading difficulties, may not (National Research Council, 1998). These children often arrive in kindergarten at substantial disadvantages in their alphabet and early literacy development (Lonigan, 2003; Zill & Resnick, 2006), with this gap widening as formal reading instruction begins (Stanovich, 1986; Torgesen, 2002). “Catching up” these children to meet grade-level standards is often difficult. For instance, the present synthesis does not support the use of current instructional practices to close this gap in alphabet knowledge; effect sizes were no larger for children at risk for reading difficulties than those who were not. Similar to findings regarding instruction in other early literacy skills such as decoding (Foorman, Fletcher, & Francis, 2002), more intensive, explicit alphabet instruction may be necessary to lessen the achievement gap. These facts may call for greater attention to the effectiveness of alphabet instruction during the early years.

Foci and Delivery of Instruction

One of the benefits of the present meta-analysis was the ability to disambiguate effects based on the foci of instruction. In examining effects according to the alphabet and phonological components taught, larger effects were often noted when (a) the alphabet components taught matched the outcome, and (b) alphabet and phonological awareness instruction was combined. Interpretation of these analyses, however, is tempered by the small number of available studies. Once the larger pools of studies were subdivided according to the type(s) of instruction provided, sample sizes were often too small to warrant great confidence in the effect sizes generated. For example, van Bysterveldt et al. (2006) was the sole multicomponential study to provide letter name and sound instruction without phonological training, in the form of a shared parent–child storybook reading program. In such cases, generated effect sizes may be due to idiosyncratic features of particular studies and are susceptible to any biases present in the original studies. In the van Bysterveldt et al. study, the negative effects on letter name and letter sound knowledge may have been attributable to a number of factors: the implicit instructional method, implementation by parents, initial selection biases favoring the control group, and so forth. Noting the small number of studies that use particular instructional foci is important, however, to identify gaps in the literature. We hope that our findings serve as impetus for future research and replication.

One interesting finding deserving of further examination and replication concerns the impact of letter name instruction on letter sound learning. Studies providing letter name instruction as the only alphabet component showed reliable, positive impacts on children’s letter sound learning. This result lends causal support to the argument for letter name-to-sound facilitation, discussed within the context of letter names that provide cues for learning letter sounds (Evans et al., 2006; McBride-Chang, 1999; Share, 2004; Treiman, Tincoff, & Richmond-Welty, 1996; Treiman et al., 1998). Further validation of the effect is warranted, as is investigation of the effect’s mechanism, whether because of letter names cues or other factors such as differences in letter frequency or familiarity.

The results of moderator analyses yielded minimal information regarding how alphabet instruction might most efficiently and effectively proceed. Instruction of greater duration and that is provided in a small-group context were more effective in promoting letter name knowledge. The benefit of small-group instruction has been repeatedly recognized within the early literacy literature as a hallmark of effective literacy instruction (Wharton-McDonald et al., 1997; Wharton-McDonald, Pressley, & Hampston, 1998) and linked to enhanced skill development in students (Connor, Morrison, & Slominski, 2006; NICHD, 2000). Instruction provided at home by parents was consistently the least effective means of promoting alphabet instruction. Although only a limited number of such studies were included in the present analysis, parent and home-based interventions were found similarly ineffective for various literacy outcomes in the NELP review (NELP, 2008). This reduced impact may be due to the inherent difficulties in implementing home-based programs. Home-based treatments may be more difficult to organize and over-see, with fidelity to the treatment program suffering (van Otterloo, van Der Leij, & Veldkamp, 2006). The content of such programs may also be less focused on teaching specific components of literacy as opposed to more global abilities (e.g., oral language, print awareness, general cognitive skills) or require better parent training to be implemented in a manner that affects children’s learning.

On the other hand, teachers and classroom instruction proved just as effective as the more controlled instruction provided by researchers in clinics and laboratories. This finding is important, demonstrating that literacy practices benefiting children’s alphabet development may be implemented in ecologically valid settings. More research detailing the training and support necessary to secure these effects by teachers is warranted, as such details were often not noted in the research reports analyzed for this study. In general, future studies would benefit from more detailed reporting of key instructional and implementation features to facilitate identification of additional moderators of instructional effects.

Impact of Alphabet Instruction Beyond Alphabet Knowledge Outcomes

Results were generally inconclusive with respect to the causal relations between alphabet learning and development of other early literacy skills. Small effects on reading skills were found when assessed immediately following instruction, but effects were no longer apparent 2 to 12 months later. No effects were found for phonological awareness or spelling outcomes at either immediate or follow-up assessment, nor did initial effects on alphabet outcomes persist. Notably, the subsample of studies included in these analyses was small, and none were explicitly designed as longitudinal investigations concerning alphabet knowledge acquisition and the development of other literacy skills. The comparison conditions involved in the studies are also important to consider. Of the seven studies included in the phonological awareness analyses, for example, two provided phonological training to the comparison group and two provided alternative forms of alphabet instruction hypothesized to be less effective than the treatment under study. Thus, more than half of the studies provided literacy training to the control condition, which might have influenced phonological awareness and attenuated effect size estimates. It is also possible that pure letter name or sound instruction does not readily transfer to reading and spelling without instruction and practice in using alphabet knowledge for these purposes (e.g., blending; Feitelson, 1988).

Noting the inconsistencies in the effect size estimates for long-term reading and spelling outcomes, we revisited these four studies in the hope of distilling shared components trending toward positive or negative effects. Unfortunately, further examination did not elucidate any factors linked to follow-up outcomes. Methodologically, positive and negative effects were found when using quasi-experimental designs, in the presence of selection biases, with teacher- and researcher-implemented instruction, and for samples including older and younger, at-risk and non–at-risk children. The content and delivery of instruction across studies was also diverse. Fugate (1997) found positive outcomes for an experimental daily tutoring program that trained grade 1 students to mastery-level letter naming abilities over the course of 12 days as compared with students who used that time for journal writing. Stuart (1999) also found positive long-term outcomes when working with kindergarten students over a 12 week period, comparing a commercially available program to teach letter sounds to shared storybook reading. In comparison, the Johnston and Watson (2004) study, which showed null or negative effects, provided either explicit letter sound instruction or training in matching pictures to sight words during 20 lessons across 10 weeks. The Schneider et al. (2000) study, also resulting in negative effects, provided children with either letter sound or phonological awareness instruction. Given that children were initially screened into the study based on phonological difficulties, the larger gains for children receiving the latter type of training are unsurprising.

These inconclusive results should not be misconstrued as disproving causal relations between alphabet knowledge and literacy skill development, nor ought they be interpreted as suggesting the futility of providing early alphabet instruction. Rather, these findings ought to stimulate research expressly designed to answer important questions concerning the role of alphabet knowledge development and later literacy abilities.

Limitations and Future Directions

Because meta-analytic data are essentially the existing studies in a given area, the present study was limited by the quality and design of research studies conducted in this domain. Within- and between-study confounds may jeopardize the validity of findings. For example, studies allowing teachers to voluntarily implement a new alphabet instruction technique may capitalize on this selection bias to produce larger effects. Similarly, between-study comparisons may not be entirely appropriate when studies differ largely in content or methodology, particularly when sample sizes are small. Although we attempted to examine such threats to validity via moderator analyses, such threats remain real limitations of any meta-analytic work.

Relatedly, we must point out that the results of moderator analyses may also be influenced by between- and within-study confounds. The factors included in moderator analyses were not directly manipulated and thus may have been confounded with one another within studies (e.g., many studies providing explicit letter sound instruction also provided phonological awareness training). Moderators may also have been confounded with methodological variables. Empirical validation, in which moderating variables are explicitly manipulated, is required for these effects to be causally interpretable.

Furthermore, additional research on how to effectively impart alphabet knowledge remains necessary. Despite our efforts to capture meaningful variations in instructional content and implementation for moderator analyses, details regarding individual studies could be coded only if they were explicitly mentioned within the text of research reports. Many proposed research questions concerning additional potential moderators of the effect of instruction on alphabet learning (e.g., organization of alphabet lessons including the order in which letters were taught, instructional materials, fidelity of implementation, training of instructors) could not be answered because of lack of information. Future design work, in which various ways of promoting mastery of letters and sounds are contrasted, is necessary to reliably answer such detailed questions concerning the efficacy and efficiency of alphabet instruction. Finally, as previously discussed, research aimed at elucidating the causal role of alphabet knowledge in promoting literacy development is required.

Acknowledgments

The authors thank Dr. Christopher Schatschneider, Dr. Christopher Lonigan, Dr. Mark Licht, and Dr. Carol McDonald Connor for comments on an earlier draft of this manuscript. This work was supported by a Pre-doctoral Interdisciplinary Research Training Grant from the Institute of Education Sciences (R305B04074), although the opinions, conclusions, and recommendations expressed are those of the authors and not necessarily those of the funding agency.

Footnotes

1 The NELP review included separate syntheses of both the correlational and intervention literature in the area of early literacy; the limitations addressed in the current study refer to the latter. Additional information pertaining to NELP is available at www.nifl.gov.

2 The NELP study also disaggregated those studies including phonological awareness training by the level of linguistic complexity (subphonemic, phonemic, both) and cognitive skill (analysis, synthesis, both) targeted.

3 Confidence intervals for effect size estimates were computed in the following manner (Shadish & Haddock, 1994):

T . * ¯ ± 1 . 9 σ υ . * with υ . * = 1 ∑ i = 1 k ( 1 υ i + τ 2 ) and υ i = [ n E + n C n E n C + g U 2 2 ( n E + n C − 2 ) ] + τ 2

in which T . * ¯ is the average weighted effect size from the random effects model, υ . * is the variance of the average weighted effect size, vi is the conditional variance for effect size estimates (g U ) contributed by individual studies (i, ranging from 1 to k), τ 2 is the estimate of the between-studies variance, nE is the number of participants in the experimental sample, and nC is the number of participants in the control sample.

4 k technically refers to independent treatment/control contrasts and not necessarily individual studies. Most meta-analyses, however, report results in terms of “studies,” typically extracting only a single effect size from each study or viewing each independent treatment/control contrast as a separate study in its own right. For this reason, together with concerns regarding the ease of reading, the terms independent treatment/control contrast and study will be used interchangeably.

References

(* indicates inclusion in meta-analysis)