Introduction

CHAPTER 1 - INTRODUCTION

Language is vitally important to all aspects of a child’s development. Research has shown that it is related to social (Craig, 1993; Gallagher, 1993; Rice, 1993), cognitive (Brown, 1973; Chapman, 1992; Dromi, 1993; Flavell, 1985; Schwartz & Murphy, 1975; Silva, 1980), and educational (Silva, McGee, & Williams, 1987) domains of development. In light of the importance of language to a child’s overall development, the early detection of disorders in any domain of language becomes crucial, as does early intervention (Silva et al., 1987).

1.1 Language Impairment And Development

        Language is central to all communication and provides the foundation for learning and socialisation. With respect to social development, children need to be competent language users to make and maintain friendships, and to communicate effectively with peers, parents, and teachers. Language is the primary means of establishing and maintaining social relationships (Prutting, 1982; Schwartz & Murphy, 1975). Social interaction is then a vehicle for learning and refining language skills (Craig, 1993; Gallagher, 1993; Schwartz & Murphy, 1975). The competent language user has the potential to continually improve his or her communication ability, as establishing social relationships is facilitated by strong language skills, and the social relationships themselves provide opportunities for further language learning. Language-impaired children; however, may have difficulties interacting with their peers, and consequently are at risk for social rejection (Craig, 1993; Gallagher, 1993; Rice, 1993). This rejection may then reduce their exposure to language and therefore limit their opportunities to practice and improve their conversational skills (Rice, 1993).
        A similar, reciprocal relationship exists between language development and cognitive development. The organisation of a child's experiences and the acquisition of cognitive abilities are associated with the development of language (Flavell, 1985; Kamhi & Catts, 1989; Schwartz & Murphy, 1975). In a review of studies investigating cognitive attainments in language impaired children, Rescorla and Goosens (1992) concluded that children with impaired language perform more poorly than peers with normal language skills on a wide range of cognitive tasks. In addition, once a child enters school, language is the means through which all subjects are taught (Berlin, Blank, & Rose, 1980; Schwartz & Murphy, 1975). Even subjects not obviously related to language, such as math and science, are taught through the teacher’s use of language to convey information to students. Thus, a child's language difficulties have the potential to interfere with academic success in all subject areas. Indeed, Brown and Edwards (1989) concluded that early language impairments later resurface as difficulties in reading and writing. Research has shown that a language delay identified in the preschool years has a strong predictive relationship to later learning difficulties (Silva, Williams, & McGee). Research also has shown that preschool language ability was the best single predictor of later reading ability (Aram, Ekelman, and Nation, 1984). Maxwell and Wallach (1984) reported that a persistent language delay in the preschool years was associated with risk for chronic deficits in language and academic achievement.
        In addition to language delays being predictive of later academic difficulties, research suggests that the effects of a language delay are broad and long-term. Weiner (1985) reviewed 17 longitudinal studies of children with speech and language impairments. Weiner’s review suggested that an impaired language system will have life-long negative consequences on social, academic, and vocational success. Problems in adolescence were identified by Donahue and Bryan (1984) who found that adolescents who had been identified with communication impairments as children were less popular with peers. Aram et al. (1984) also reported that adolescents who had been identified with communication impairments were described by their parents as having more behaviour problems than their peers. In addition, teachers were more likely to express concerns about their level of academic achievement. Lunday (1996) reported findings of a 1991 U.S. Department of Labor study that expressed similar concerns, concluding that communication disorders are likely to impair the ability of adolescents and young adults to acquire job skills, find employment and remain employed.

1.2 Early Intervention

In order to prevent or reduce many of the long-term consequences of language impairments, intervening with language impaired children as early as possible seems to be one solution. It has been suggested that the longer speech and language delays are left without intervention, the harder it will be to rehabilitate the language system (Eisenson & Ingram, 1972; Menyuk 1975; Semel & Wiig, 1975). Researchers have theorised that an impaired language system becomes increasingly inefficient and rigid with age and becomes less amenable to remediation. Early intervention may serve to modify and repair the language system at an age when it is less rigid. This is not to suggest that later intervention is not worthwhile and should not be undertaken. It should be noted; however, that as age increases, the difference in abilities between a child with impaired language and his or her peers widens (Wiig & Semel, 1984), possibly due to the changing language demands of the curriculum (Paul, 1991; Schwartz & Murphy, 1975). Thus, as a child gets older, he or she must work harder to catch-up to peers. Research has shown that the older a child is when a language impairment is identified, the poorer the outcome is for the child (Aram et al., 1984; Schery, 1985; Schwartz & Murphy, 1975). In summary, the age at which a child begins intervention can influence the impact of a language delay on social and cognitive skill development, educational achievement, and employment potential.
The first step in intervention with language impaired children is adequate and accurate assessment (Keith, 1994). According to Cox and Cox (1981), accurate assessment of developing language skills in preschool children should be a mandatory skill for all practising Speech-Language Pathologists. Clinicians make use of both standardised language tests and language sampling techniques, as well as informal, clinician-devised methods and their clinical judgement when assessing the language abilities of children (Allen, Bliss, & Timmons, 1981; Wilson, Blackmon, Hall, & Elcholz, 1991). The use of norm-referenced, standardised tests is generally considered appropriate and necessary for identifying speech or language impairments (Schery, 1981; Stark, Tallal, & Mellits, 1982).

1.3 Standardised Assessment

        Plante and Vance (1995) state that the primary purpose for administering a norm-referenced test is to determine if an impairment is present or not. As well as identifying if a problem exists, determining the goals of intervention and planning procedures for intervention are also objectives of assessment (McCauley & Swisher, 1984a). Carrow-Woolfolk and Lynch (1982) further state that language assessments are concerned with identifying if there is a problem, as stated above, determining the significance or severity of the problem if one exists, describing the nature of the problem, and describing the overall language profile of the person with the problem. To determine the existence and severity of a language problem, normative data must be used in order to compare the individual to his or her peers. Describing the nature of the problem requires naturalistic observation and interviews with parents or teachers (Carrow-Woolfolk, & Lynch, 1982).   Clinicians often rely on standardised tests to provide all of the necessary information about the child and the disorder. They often use tests that are familiar to them and that are easily available. Assessment batteries; however, should be determined by the need for specific information to be used for a definite purpose, not by familiarity, ease of testing, or availability (Carrow-Woolfolk & Lynch, 1982). Further, solely using available standardised tests may not provide a representative view of the child’s ability to use language across different meaning contexts (Crais, 1995).
        In the U.S., the use of standardised tests by clinicians to identify individuals with language disorders and to plan treatment has been increasing since PL 94-142 was introduced in 1975 (Danwitz, 1981; Plante & Vance, 1994), requiring the use of standardised tests during language assessments (Wilson et al., 1991). In a study by Wilson et al. (1991), the Expressive One Word Picture Vocabulary Test, Clinical Evaluation of Language Fundamentals, Structured Photographic Expressive Language Test, Test of Language Development - Primary, Illinois Test of Psycholinguistic Abilities, Preschool Language Scale, the WORD Test, and the Detroit Test of Learning Aptitude were among the top ten tests used to assess expressive language. A large majority of clinicians in the study used formal standardised tests as part of their language assessments. Many clinicians also reported using standardised tests in conjunction with informal assessments and language sampling methods. A primary constraint on the clinician’s assessments was time. Half of the respondents reported that they did not have enough time to perform adequate assessment.
        Standardised tests are frequently used by speech-language pathologists because of hospital, school, or treatment centre guidelines. Standardised test results are often required to determine a child’s eligibility for special programs and services (Schery, 1981; Wilson et al., 1991). Standardised tests with normative data allow the speech-language pathologist to compare a child's score to a group of other children who are believed to be similar to the child on important variables, such as age, ethnicity, socio-economic status (Kelly & Rice, 1986).
        Two types of tests are used to assess children: those designed to screen for potential speech and language impairments, and those designed to identify impairments (Plante & Vance, 1995). More than 150 tests have been developed to evaluate various aspects of a child's linguistic performance (Wilson et al., 1991), including over 50 standardised screening instruments (Sturner, Layton, Heller, Funk, & Machon, 1994). Standardised tests do have advantages over non-standardised assessment techniques. Standardised tests allow for increased objectivity by the examiner, the opportunity to repeat the assessment with the same child or other children, and they reduce unnecessary and uncontrolled variation during assessment (Kelly & Rice, 1986). However, the use of standardised tests is subject to limitations. The structure of standardised tests and the contrived nature of their tasks limits their ability to capture how a child uses his or her language ability in everyday situations. Many of the available tools for assessment require at least some degree of subjectivity on the part of the examiner (Cox & Cox, 1981). Finally, despite the large number of available tests, few meet the psychometric criteria for standardised tests (McCauley & Swisher, 1984a; Plante & Vance, 1994).

        1.3.1 Psychometric Characteristics of Tests

        In a review of 30 standardised preschool language tests, McCauley and Swisher (1984a), found only one test that met greater than six of the 10 psychometric criteria evaluated. Similarly, of the 21 standardised preschool language tests evaluated by Plante and Vance (1994), only four tests met six or more of the same 10 psychometric criteria. Only one of these four tests was found to provide "acceptable accuracy in discriminating between the children with normal and the children with impaired language" (Plante & Vance, 1994, p. 15). In both studies, the most frequently unmet psychometric criteria were the inclusion of empirical evidence of validity and reliability. Three kinds of validity have been acknowledged as important for tests that measure any kind of behaviour: construct, criterion-related, and content validity (McCauley & Swisher, 1984a).
        A construct is “an informed, scientific idea ‘constructed’ to describe or explain behaviour” (Cohen, Swerdlik, & Smith, 1992, p. 175). Construct validity seeks to judge the appropriateness of inferences made about an individual’s ability on a certain construct, based on test scores. It is examined by looking closely at test authors’ description of the construct they claim their test measures and comparing it to the actual test content (McCauley & Swisher, 1984a). Criterion-related validity looks at correlations between scores on the test under scrutiny and scores on another test measuring the same behaviour (McCauley & Swisher, 1984a). Convergent correlations provide evidence of criterion-related validity, as shown when test scores correlate with scores on older, more established and validated tests measuring the same or a similar construct (McCauley & Swisher, 1984a). Alternatively, divergent correlations provide evidence of criterion-related validity, as is shown when test scores do not correlate highly with other tests of behaviours with which they should not theoretically be correlated (Cohen et al., 1992). Content validity refers to how well a test samples behaviours that are representative of the whole family of behaviours, or the construct, the test was designed to measure (Cohen et al., 1992). Therefore, a general language test would be expected to sample behaviours from all of the language components.
        Factor analysis can be used to identify clusters of variables, or factors, that may be causing test scores to correlate (Cohen et al., 1992). Exploratory factor analysis (EFA) seeks to identify the sources of influence within a correlational matrix (Bryan & Yarnold, 1995). That is, the items in the matrix are grouped according to theoretical factors or similar underlying constructs, which must be identified by the researcher, that lead to the correlations between the items (Bryan & Yarnold, 1995).
        The most important psychometric property of a test, according to Plante and Vance (1995), is how accurately a test determines whether or not an impairment is present. Standardised language tests should provide evidence of their ability to distinguish children with normally developing language from children with impaired language development. Tests should also be developmentally sensitive. That is, tests should reflect the development of language ability as children grow older.

1.3.2 Developmental Sensitivity

        A language test’s sensitivity to a child’s development is very important. Evidence of changes that are associated with age for certain constructs has been accepted as evidence of construct validity. It is argued that the manifestation of the construct changes with age and the scores on the test change with age, therefore the test is measuring that construct (Cohen et al., 1992). Although developmental sensitivity is not enough to establish validity on its own, it adds to the validity and is an important aspect of developing any test for use with children.
        For tests that claim to measure constructs that are expected to change as a child matures, such as language ability, test scores should change over time as well (Cohen, et al., 1992). For example, if a three year old, a four year old, and a five year old all completed the Peabody Picture Vocabulary Test - Revised (Dunn & Dunn, 1981), one would expect that the raw scores received would be highest for the five year old and lowest for the three year old, because vocabulary increases with age.
        Unfortunately, many test manuals do not include information about how test scores relate to chronological age. The Structured Photographic Expressive Language Test-Preschool (SPELT-P) manual (Werner & Kresheck, 1983) reports that “the structures measured by the SPELT-P are developmental in nature and performance should be related to chronological age” (p. 19). Mean scores on the SPELT-P increased with age in the standardisation sample, but no correlations were reported of raw score with age were reported, and no mention was made of the significance or non-significance of the increases.
        The Renfrew Action Picture Test (Renfrew, 1993) also lists mean scores for the Information and Grammar subtests for age ranges from 3;6-3;11 to 8;0-8;5. Mean scores increased with age, but no information as to the statistical significant of the increases was mentioned.
        The Reynell Developmental Language Scales (Reynell, 1978) is based on three components of expressive language (structure, vocabulary, and content) that are developmental in nature. According to the manual, language structure develops rapidly over the first three years of life, and by the age of five, most of the basic language structure is available for use. In the development of the scales, the development of the individual components was taken into account. For example, vocabulary develops very rapidly between 1;6 and 4;6; therefore, greatest coverage was given to this age range when vocabulary items were chosen for the test.

1.3.3 Relationships between language tests

        Other than correlational information, little data exist about the relationships between language tests (Howlin & Cross, 1994). Much of the data come from standardisation samples, and often very small samples of younger age groups are all that are available (Howlin & Cross, 1994).
        A study by Howlin and Cross (1994) examined the scores of 35 three to four year olds on six different standardised language tests commonly used by speech-language pathologists in Great Britain. They found correlations between age-equivalence test scores that ranged from .32 to .82. An age-equivalence test score is the mean score obtained by a group of children at a given age. When they correlated age-equivalent test scores with chronological age, the correlations ranged from .04 to .39. Significant correlations among age equivalence scores and chronological age were expected, although not found. The restricted age range in the study was determined to be the reason for the small correlations (Howlin & Cross, 1994).
        Harwood (1997) examined correlations among eight language tests for a mixed group of children with normal language and children aged three to six with impaired language. She found correlations among raw scores on the tests that ranged from 0.16 to 0.86, and correlations among z scores that ranged from .11 to .83.
Language test manuals often report correlations between test scores and other similar tests as evidence of concurrent validity. For examples of these correlations as well as reliability correlations, please see the individual test sections in Chapter 2.

1.3.4 Ability of Tests to Distinguish Children with Normal Language from Children With Language Impairment

        Important decisions regarding the provision of clinical services are dependent on the results of language assessments. It is critical that tests meet psychometric criteria and distinguish between children with language impairment and children without language impairment children (Plante & Vance, 1994). In order to identify a child with language impairment, language performance must be assessed in more than one context, including contexts that stress the language system (Lahey, 1990). Tests designed to identify language disorders should discriminate between children with normal language and children with language impairment with a high degree of accuracy. According to Plante and Vance (1994), 90% is considered to be good accuracy, while 80% accuracy is only considered to be fair.
        The accuracy of current tests that are used to classify children as language-impaired or non-language-impaired has been called into question (Howlin & Cross, 1994; Howlin & Kendall, 1991). More information about the psychometric properties of these tests, such as the constructs these tests are actually measuring and the items that best represent the constructs, needs to be obtained for these tests.
        According to Plante and Vance (1995), the diagnostic accuracy of individual tests is dependent on an empirically derived cut-off score for that test. In their study, simply adopting a standard cut-off score, such as 1, 1.5, or 2 standard deviations below the mean, on the Clinical Evaluation of Language Fundamentals-Preschool (CELF-P) and SPELT-P resulted in reduced diagnostic accuracy of the tests. They found that the two tests were maximally effective at distinguishing between normal and abnormal language learners at different cut-off scores. The cut-off scores were -0.27 and -1.39 standard deviations for the CELF-P and SPELT-P, respectively.
        It should be noted that speech-language pathologists (SLPs) do not simply accept the results of standardised tests, but that they bring their years of experience to the assessment situation as well and examine information other than test scores, such as observations made during assessment. Experienced SLPs often report knowledge of individual tests and are aware of some test idiosyncrasies that they have observed through frequent use of the tests (Warr-Leeper, 1997, personal communication). As a result, children who fail a language test are not necessarily labelled as language impaired by clinicians, and children who pass a language test may still be diagnosed with a language impairment (Allen et al., 1981).

1.4 Language Sampling

        Standardised tests access language ability through contrived tasks that may have little resemblance to real environments (Hughes, Fey & Long, 1992). In order to gain an understanding of a child’s language ability in a more natural communicative setting, language sampling techniques have been developed (Klee, 1992). Language sampling is appealing to clinicians because it is the most ecologically valid way of assessing a child's ability to use language communicatively (Paul, 1995). Interacting and communicating with a child gives the clinician a clear impression of how that same child interacts with peers, parents and teachers. In addition, the clinician is provided with spontaneous examples of a child's language ability. Data from the language sample can then be used by clinicians to generate hypotheses about the child's language system and how it may be impaired (Lahey, 1988). McCauley and Swisher (1984b) note that the detailed description of a child’s language production that is provided by a language sample analysis is a good source of intervention goals. They also state that even though language sampling techniques may be lengthy and difficult, the information they provide is valuable enough to outweigh the difficulties. Furthermore they suggest that language sampling analysis is a preferable source for determining therapy objectives compared to standardised tests. Thus it appears that language sampling measures are considered valuable for both identification of a language impairment and selection of intervention objectives.
        A variety of language sampling methods have been developed. They include: Assigning Structural Stage (Miller, 1981), Index of Productive Syntax (IPSYN; Scarborough, 1990), Language Assessment and Remediation Procedure (LARSP; Crystal, Fletcher, & Garman, 1976), Developmental Sentence Scoring (DSS; Lee, 1974) and Systematic Analysis of Language Transcripts (SALT; Miller & Chapman, 1993).
SALT is a computerised method of calculating mean length of utterance (MLU), total number of words (TNW), number of different words (NDW), and calculating frequency counts of specific morphemes. Other computer programs designed to analyse language samples are also available on the market (e.g., Computerised Profiling: Long, 1987; Child Language Data Exchange System: MacWhinney, 1995). The output from such computer programs may result in many numbers quantifying a language sample (e.g., total number of utterances, total number of complete and intelligible utterances, total number of words) but the meaning and value of these numbers require more research (Klee, 1992).
        Klee (1992) examined nine measures and found three of the language sample measures had good developmental and diagnostic characteristics. They were Mean Sentence Length (MSL), TNW, and NDW. MSL, TNW, and NDW correlated either moderately or highly with age for both normal and specific language impaired (SLI) groups and showed potential for assessing change in language production over time. MLU was shown to have developmental and diagnostic characteristics in an earlier study (Klee, Schaffer, May, Membrino, & Mougey, 1989) and was correlated with age at r ‘ .75 for 24 normally developing children from 24 to 50 months of age, and at r ‘ .77 for 24 language impaired children from 24 to 50 months of age. In other studies, MLU was found to be correlated with age with correlations ranging from .16 (Klee & Fitzgerald, 1985) to .88 (Miller & Chapman, 1981).

1.5 Purpose of the Present Study

The purpose of the present study was to examine eight language tests commonly used by speech-language pathologists in Southern Ontario. The following questions were addressed:

1. Are the tests developmentally sensitive? Overall, do the test scores correlate with the chronological ages of the
children?

2. Does developmental sensitivity differ across various age groupings? That is, are correlations with chronological age
different for three year olds, four year olds, and five year olds?

3. What are the relationships among the tests? How do they correlate with one another?

4. What may account for the relationships among the tests?

5. Are the tests able to distinguish between children with normal language skills and children with impaired language skills?

6. Which tests best discriminate between normal language learning and language impaired children?

Hosted by www.Geocities.ws