Compare and contrast the general definitions of the terms reliability and validity as they apply to standardized testing. Explain the problems that would be encountered if a test were unreliable. Include in your discussion definitions of the three forms of reliability described in the text. How does a test's reliability affect its validity? Explain the relationship between reliability and
validity.
What will be an ideal response?
Reliability is defined as the similarity between two rankings of test scores obtained from the same individual. The more test scores are free of measurement error, the greater the probability that two scores from the same individual will be the same. Validity is defined as how well a test measures what it claims to measure. Reliability and validity are both critical issues in testing. In order to make meaningful interpretations of students' test scores, tests must yield consistent scores and be measuring what the user intends to measure.
Three types of reliability were discussed in the text.
Test-retest reliability: This pertains to the stability of an individual's test score when given the same test on two separate occasions. If a test does not have test-retest reliability a student may score very well on one occasion and very poorly on a second occasion. This would make it impossible to finally determine an accurate indication of the individual's level of the characteristic being measured.
Alternate-form reliability: This pertains to the equivalence of two forms of a single test. Both forms should measure the same content using items of similar difficulty. If a test does not have this characteristic student performance would be affected by the form of the test they took when they should be able to take either form and score the same.
Split-half reliability: This pertains to the homogeneity or internal consistency of a test's items. They should all be working together to measure the same characteristic. If a test does not have this quality, some of its items will be measuring the intended characteristic while others will be measuring something else. For example, math word problems should not have difficult vocabulary. This would result in items measuring language skills rather than math skills.
Tests must be reliable in order for them to be valid. If a test's scores are not consistent over time, or forms of a test are not equivalent, or its items measure the wrong characteristics, appropriate conclusions regarding student performance cannot be made. However, when a test is valid, meaning that it is measuring the right thing and appropriate conclusions regarding student performance can be made, then we know it must be reliable.
You might also like to view...
Explain why children learn science concepts best in a social milieu
What will be an ideal response?
Refer to the ways in which women and men are expected to behave and are often based on patriarchal social norms
a. Traditional social norms b. Gender norms c. Traditional gender roles d. Gender roles
Teachers who are ill should:
a. stay home whenever they are feeling "under the weather" b. follow the same exclusion guidelines set for children c. ignore minor illnesses and come to work d. not come to work if they have been exposed to communicable illness; they should wait to see if symptoms develop
____________ is a management tool or planning vehicle that ensures that children with disabilities receive an individualized education appropriate to their unique needs
a. Section 504 document b. IFSP c. IEP