EARLY INTERVENTION MANAGEMENT AND RESEARCH GROUP (EMRG)
  • Home
  • About EMRG
  • AEPS Information
    • History of AEPS-3
    • The Linked System
    • Benefits
    • Review of AEPS-3
  • International Use
    • Current AEPS translations
    • Translation and Adaptation Guidelines
    • Translation Inquiry
  • AEPS Research
  • Training
  • FAQ

Reviews from the Mental Measurement Yearbook of The Buros Center

Review #1

Review of the Assessment, Evaluation, and Programming System for Infants and Children, Third Edition (AEPS®-3) by CATHERINE A. FIORELLO, Professor of School Psychology, Temple University, Philadelphia, PA:

DESCRIPTION. The Assessment, Evaluation, and Programming System for Infants and Children, Third Edition (AEPS-3) is a linked system of assessment, goal/outcome development, teaching/intervention, and progress monitoring. It is the latest revision of a combined assessment and curriculum for ages birth to 6 years. It is designed to assess and intervene in eight broad developmental areas: Fine Motor, Gross Motor, Adaptive, Social-Emotional, Social-Communication, Cognitive, Literacy, and Math. The test authors note that the AEPS-3 can be used by a wide range of professionals who have training in child development and learning, such as early interventionists, special education teachers, and specialists, including psychologists. The authors recommend that a professional team implement the assessment and interventions. The measure also includes family support materials, which should be completed with assistance available from a professional.

Assessment data are collected through observation, interview, and direct testing. The test authors note that observations in the natural environment and interviews with families are the preferred methods of data gathering, with direct testing reserved for items that cannot otherwise be obtained.

The AEPS-3 Test can be used as a whole, or it can be used flexibly to target individual developmental areas, to assess at the level of goals rather than the more in-depth objectives, or to assess school readiness skills using a subset of items called Ready—Set. The test is designed to determine present levels of performance, develop meaningful goals, plan effective teaching or intervention, monitor performance over time, and determine eligibility for services. Although the assessment portion of the AEPS-3 is curriculum based, cutoff scores are provided for determining or corroborating eligibility for services.

The teaching/intervention materials are closely linked to the skills assessed by the test and are designed to be implemented within routines and activities at home or at school.

DEVELOPMENT. Development of the first edition was undertaken by a consortium of professionals seeking an alternative to standardized, norm-referenced tests for use with young children with disabilities. Initial development began in 1976 with conceptual and empirical work funded by the predecessor to the Office of Special Education Programs. This work yielded a very comprehensive but overly long assessment for ages birth to 2 years in 1980, which was revised and studied before being published as the AEPS for Birth to Three Years in 1993. The AEPS for Three to Six Years followed in 1996. Both assessment and curriculum were included.

The second edition, published in 2002, included psychometric data and cutoff scores to determine eligibility for services and added a comprehensive online system that generated a variety of reports from the individual to the state level.

Development of the third edition, which combined both age groups into one system for birth to age 6, began in 2008. Items and scoring were revised to eliminate redundancy and to clarify and streamline assessment. Additional items were added at the early and late developmental age ranges of the scale, and the Literacy and Math areas were separated out and given new emphasis. A subset of items for ages 4 and up focused on school readiness (cognitive, literacy, math, communication, and social skills) was developed as a separate abbreviated test, the Ready—Set. The curricular materials were re-organized around the three-tier systems of support: Tier 1—Universal Support, Tier 2—Focused Support, and Tier 3—Specialized Support.

TECHNICAL.
Standardization. The sample employed for development of the AEPS-3 eligibility cutoff scores consisted of 874 children ages 2 months through 6 years 11 months. The sample included 412 typically developing children and 462 children receiving services under IDEA. Programs from seven states providing childcare or educational services to young children and who were currently using the AEPS Test were contacted. Teachers or providers participated in online training and had to obtain 80% agreement to collect data. Test administrators used an online system to enter a code number, chronological age, status as typically developing or receiving services, and goal scores for the eight developmental areas.

The states chosen for field testing were primarily in the Midwest and the South. Demographic information about the children assessed was limited to age, gender, and status as typically developing/receiving services. It is not known whether the sample is representative of the U.S. population based on race, ethnicity, language background, or socioeconomic status. The test authors did not provide evidence to suggest that this measure is invariant across these groups. The number of children in each age group was not reported in the test manual. However, the test authors noted that the sizes of the 0- to 6-month and 7- to 12-month age groups were particularly small, at 16 and 39, respectively.

Reliability. Internal consistency for items in each developmental area was not reported in the test manual, but person reliability estimates resulting from IRT analyses reported in the supplemental online information from a published study (Toland et al., 2022). Coefficients ranged from .74 (Fine Motor) to .91 (Adaptive), and all except Fine Motor and Math were above .80.

Interrater reliability analyses were conducted using 37 video clips of children engaging in typical activities in home and classroom settings. Teachers or providers (N = 116) completed online training, then scored the video clips using 68 AEPS-3 test items across all eight developmental areas and developmental levels. Participants’ scores were compared to the author/expert ratings of the video clips and yielded mean interrater agreement of 89%, with a range of 66% to 100%. Interrater agreement was not assessed for the separate developmental areas.

Validity. In addition to the history of initial item development by the consortium of experts, the test authors used expert reviews to examine content validity evidence. Across developmental areas and for the overall test, they reviewed qualitative feedback and comments on individual items or perceived gaps and revised test content in author work groups by consensus.

To provide further evidence of validity, the test authors showed that AEPS-3 combined scores were positively correlated with age (r = .65 to .92). In addition, a small study in Kentucky (N = 50, 68% Caucasian) showed weak to moderate correlations between AEPS-3 area scores and area scores from the Battelle Developmental Inventory, 2nd Edition (r = .24 to .65, per Grisham et al., 2021). The full range of intercorrelations was not provided in the test manual and needed to be obtained from the published study (Grisham et al., 2021) but did not show a consistently strong pattern of scores being more highly correlated across similar areas, as would be expected.

Sensitivity and Specificity. Cutoff scores for special education eligibility were derived in 6-month intervals based on the field study data (N = 874). (Analyses were attempted with 3-month intervals, but some age groups were too small to support that.) Children were coded prior to testing as eligible or ineligible for services. Based on AEPS-3 results, children were identified as eligible if they obtained at least two area scores equal to or less than the cutoff scores for their age group. Sensitivity ranged from 57% (61- to 66-month age group) to 100% (0- to 6-month and 7- to 12-month age group), while specificity ranged from 0% (0- to 6-month and 7- to 12-month age group) to 81% (67- to 72-month age group). Areas under the receiver operating characteristic curve (AUC) were statistically significant at the p < .05 level and were considered good to fair indicators of separability, with the exception of the 0- to 6-month and 7- to 12-month age groups, which had non-significant AUC and were considered poor indicators of separability.

COMMENTARY. The test authors provide in-depth information about the curriculum and its development. The test, as a curriculum-based, criterion-referenced measure, is comprehensive and clearly developmentally organized. It leads to clear teaching and intervention recommendations.

The portion of the test designed to determine eligibility for services through the use of age-based cutoff scores is technically inadequate for the task. The normative sample is not described in enough detail to determine the extent to which it is representative of the U.S. population. The youngest age groups, 0–6 months and 7–12 months, are too small to derive reliable and valid cutoff scores. The other age groups provide scores that are reliable enough to serve as screeners, but not for making high-stakes decisions. The rates of over- and under-identification are unacceptable for individual decision making.

SUMMARY. The AEPS-3 is a combined assessment and curriculum for children ages birth to 6 years. It covers eight developmental areas: Fine Motor, Gross Motor, Adaptive, Social-Emotional, Social-Communication, Cognitive, Literacy, and Math. The assessment is designed to be completed primarily through naturalistic observation and interviews, which is appropriate for a developmental assessment for this age group. Using the assessment as a criterion-referenced tool linked to the curriculum has high utility. The test manual does not provide sufficient evidence to support the use of eligibility cutoff scores. Even after consulting studies published by the test authors, no detailed descriptive information about the normative sample was found. The cutoff scores are based on an inadequate normative sample, and there is insufficient evidence of representativeness or fairness for all children. The scores are only reliable enough to use as a screener.
Review #2

Review
 of the Assessment, Evaluation, and Programming System for Infants and Children, Third Edition (AEPS®-3) by RENÉE M. TOBIN, Professor of Counseling and School Psychology, and PAUL C. JONES, Assistant Professor of School Psychology, Temple University, Philadelphia, PA:


DESCRIPTION. The Assessment, Evaluation, and Programming System for Infants and Children, Third Edition (AEPS-3) is a comprehensive, linked system of assessment, goal development, intervention, and progress monitoring intended to be administered to infants or children (birth through 6 years of age) individually or in group settings. It is a criterion-referenced measure and a curriculum-based assessment, not a norm-referenced assessment. The complete kit includes a user’s guide, an assessment volume, and three curriculum volumes, corresponding to developmental levels: Beginning, Growing, and Ready. A USB fob includes a complete library of assessment forms and reference guides. An updated online scoring system (AEPSi) is available for progress monitoring and child outcome reports. Training in the use of the AEPS-3 is offered through print materials, videos and online materials, and live training.

The AEPS-3 assesses children across eight areas: Fine Motor, Gross Motor, Adaptive, Social-Emotional, Social-Communication, Cognitive, and two pre-academic skill areas of Literacy and Math, which are new to this edition. Each developmental area is organized according to strands, with goals and related objectives that can be assessed to guide intervention. Testing begins with developmentally more advanced or difficult objectives to facilitate administration (i.e., if a child can complete the more advanced skill, the child has attained the ability to master easier skills). Assessment data are gathered through structured and unstructured observations, interviews with caregivers, and direct testing. Items are scored on a 3-point scale corresponding to the level of mastery of a particular skill. The scale is anchored at 2 (independent performance) and 0 (the child has not yet learned or begun to demonstrate the skill). Response forms are structured for multiple assessments to facilitate progress monitoring.

The assessment protocol is flexible and is designed to be administered by a single professional or a group of collaborating professionals. When natural observations of skills are not possible, assessment activities designed to elicit specific skills are provided. Additionally, tools are available as part of the AEPS-3 to gather family input and to report results and progress to families. Although administering the full assessment could be time-consuming, individual areas can be administered pursuant to referral concerns. Additionally, the AEPS-3 includes the Ready-Set (a shortened version of the AEPS-3 Test) with forms that can be completed by both teachers and families.

The results of the assessment are linked to goals and subsequent intervention strategies. The Skills Matrix is a reference guide to assist the assessor in matching goals to curriculum strategies for intervention and suggestions for embedding interventions in daily routines, as described in the Beginning, Growing, and Ready volumes. Both the assessment and curricula are associated with childhood routines and activities that are likely targets for implementing interventions. The assessment is designed to be administered at multiple points for progress monitoring. Cutoff scores have been developed to assist in eligibility determination for special services, based on comparison to age-expected performance. Materials to facilitate the development of IFSPs/IEPs, including suggestions for goal development and planning, are also included.

DEVELOPMENT. The original AEPS was published in 1993, based on work dating to the 1970s that led to the development of the Adaptive Performance Instrument (API) and the Evaluation and Programming System (EPS). The 2002 AEPS-2 improved on the first edition by filling some of its psychometric gaps and including cutoff scores to assist in eligibility determination.

The AEPS-2 was composed of two scales based on the age of the child, whereas the AEPS-3 was designed to have a single continuous scale for all ages. Relatedly, there were repetitive and overlapping items between the two levels that were subsequently combined or otherwise revised in the AEPS-3, and items were added to extend the range of the assessment in the lower and upper age ranges. The AEPS-3 curriculum was redesigned significantly to better reflect children’s skill development as well as to suggest interventions and activities within a multi-tiered system of supports based on a child’s needs. Family support materials were developed and expanded, and forms were updated for ease of use and to reflect the changes in items and test design.

TECHNICAL. The test authors provide results of qualitative and quantitative analysis for the AEPS-3 and the briefer Ready-Set measure. Following revisions to items and design for the AEPS-3, the authors sought qualitative feedback from experts on the coverage of skills and usability of the assessment, including clarity of items, developmental sequence, and appropriateness of skills. The panel’s specific feedback was not reported in the test materials, but the test authors state that changes were minor, with the exception of the Literacy and Math areas. The Assessment Activities and Family Report components also received positive reports from users, per the authors’ report.

In terms of quantitative analyses, interrater agreement was examined by having 116 trained scorers rate video clips of children. Agreement between the trained scorers and predetermined expert ratings on 68 AEPS-3 items ranged from 66% to 100% (mean of 89%). Concurrent validity evidence was assessed via comparisons with the Battelle Developmental Inventory, 2nd Edition (BDI-2). Correlation coefficients were calculated between the BDI-2 and two sets of AEPS-3 scores: combined goals and objectives scores and total goal scores only. The total goals and objectives score from each area of the AEPS-3 was compared to analogous domains of the BDI-2. Correlations were weak to moderate (r = .31 to .65), with more similar domains demonstrating higher correlation coefficients, compared to less similar domains. Regarding age, stronger Pearson correlation coefficients were observed between age and AEPS-3 combined scores (r = .65 to .92) than between age and area goal scores (r = .34 to .84).

Rasch rating scale model analysis was used to determine cutoff scores for each developmental area within 6-month intervals. Items were removed iteratively until a unidimensional scale was established for each developmental area, although there remained high correlations between areas, particularly Cognitive and Adaptive (r = .91), and the sample was limited by variation in sample sizes within specific age bands (Toland et al., 2022). Sensitivity and specificity analyses were conducted using known eligibility status of children. Sensitivity ranged from 57% at the 61–66 month age interval to 100% at 0–6 and 7–12 month intervals. Specificity ranged from 0% at the 0–6 month and 7–12 month intervals to 81% at the 67–72 month age interval. ROC curve analyses indicated fair to good significance for most age ranges, but the 0–6 and 7–12 month intervals were not significant. The test authors attribute these findings to small sample sizes in the youngest groups. Similar to previous findings with the AEPS-2, the AEPS-3 is likely to identify those in need of services but also to overidentify children who do not require special services. Its use for eligibility determinations for children up to 1 year is questionable.

Additional technical data were provided regarding the Ready-Set abbreviated measure, indicating positive, but unreported, correlations in most areas between parent and teacher reports. Qualitative data regarding the AEPS-3 Curriculum were also reported by users, indicating support for the linked system of assessment and intervention, multi-tiered support for children with and without disabilities, and the usefulness of progress monitoring. Data regarding the utility of the Assessment Activities, Family Report, and Family Assessment of Child Skills were primarily qualitative in nature, generally indicating ease of use and understanding.

The test authors state the AEPS-3 can be used with children and families from different cultures and backgrounds with modifications made by users (e.g., use of objects and manipulatives that are culturally familiar to the child). The test manual reports race/ethnicity of the sample used in the concurrent validity study with the BDI-2. The sample was predominantly White (68%), and no research (e.g., differential item functioning analyses) has been reported to strengthen the argument for use with culturally diverse children and families. A Spanish translation is available, although no additional psychometric support was reported. A French translation of the AEPS-3 was also completed and deemed appropriate for use with French-speaking children. Importantly, some studies were completed using preliminary versions of the AEPS-3. Interrater reliability coefficients on select areas of the French version were high across several studies. Content validity evidence was presented for the Literacy area only.

COMMENTARY. The key strength of the AEPS-3 is its comprehensive design that links assessment with intervention and progress monitoring. The usability of the AEPS-3 has been established among previous users of the AEPS-2. The Ready-Set abbreviated measure is a welcome addition to improve efficiency and access to the measure in the context of limited resources. Psychometric evidence generally supports the validity of test scores and interpretation. Additional psychometric and statistical analyses would strengthen the evidence for the use of the AEPS-3. For example, structural models can provide construct evidence of validity beyond that contributed by experts. Outcome data related to the Assessment Activities, Family Report components, and Curriculum would further strengthen the existing research. The cutoff scores for the AEPS-3 should be reexamined with larger, randomized, and more varied samples. As is, support for the use of the AEPS-3 for eligibility determinations, particularly among younger groups, is limited; the measure should be used with caution in these groups. Support for the interventions and progress monitoring tools also should be examined. The test authors have improved the means by which families can collaborate and communicate with professionals about strengths and needs of their children.

SUMMARY. The AEPS-3 is a well-designed, criterion-referenced measure and comprehensive assessment of child development across broad areas that improved upon noted weaknesses of the AEPS-2. Although additional validity evidence would further support the use of the AEPS-3, the instrument has reasonable evidence to recommend its use for assessment, intervention, and progress monitoring of children with and without disabilities. Caution should be used, however, with eligibility determinations, and professionals should rely on multiple methods of assessment for decision-making regarding infants, in particular.
Powered by Create your own unique website with customizable templates.
  • Home
  • About EMRG
  • AEPS Information
    • History of AEPS-3
    • The Linked System
    • Benefits
    • Review of AEPS-3
  • International Use
    • Current AEPS translations
    • Translation and Adaptation Guidelines
    • Translation Inquiry
  • AEPS Research
  • Training
  • FAQ