Faculty Teaching
A comprehensive model of evaluating teaching advocates the use of multiple sources of information to confirm decisions. Each source–students, self-reports, colleagues and chairs, and evidence of learning–has particular strengths and limitations. The weight of the accumulated results leads to the most valid personnel decisions. Using a mixture of evaluation sources can also lead to greater improvement in teaching because different sources are helpful to different teachers and can help identify weaknesses in different areas of instruction.
Four major sources of information are used to determine the effectiveness of an individual teacher: student evaluations, teacher self-reports, colleague/department chair evaluations, and evidence of student learning. Each of these approaches can be useful in making personnel decisions (salary, promotion, and tenure) or in improving teaching. Each approach has strengths and weaknesses that must be kept in mind and that make it imperative to combine evidence for the best judgments.
Student Evaluations
Evaluations of teaching by students have become important and a frequently used method of assessment, the rationale being that students, and only students, are constant observers of what happens in the classroom. Moreover, only students can answer questions about the effects of instruction on them. Research over the past forty-five years has generally demonstrated their validity, reliability, and utility in improving instruction; this same body of research has shown that systematic course evaluations by students can provide useful information in assessing teachers for salary, promotion, and tenure decisions.
Although typically administered during the last week or two of the semester (but prior to final exams and grades), rating forms are on occasion given at midsemester so the instructors may make immediate adjustments for a particular course. Machine-scored forms are frequently used in order to process the large amounts of college and department data, but the open-ended comments solicited by most rating forms often provide teachers with more specific suggestions for improvement. The questions asked about teaching or the course commonly fall into these categories: organization or planning, teacher–student interaction or rapport, clarity or communication skill, workload and course difficulty, grading and assignments, and student self-reported learning. In addition there are always questions asking students to provide a global or overall rating of the course, the instructor, or the instruction received.
Many colleges have assembled their own student rating forms and some allow students to make their ratings by computer. Commercial forms published by the Educational Testing Service (SIR II) and Kansas State University (IDEA) have for many years provided colleges with score summaries, comparison information, and research reports.
Any use of student evaluations should take into account the vast amount of evidence from research (more than two thousand studies in the Educational Resource Information Center (ERIC) system since 1971). This evidence provides the foundation for guidelines for the proper use of student evaluations. Some of the guidelines that institutions should keep in mind follow.
1. Use several sets of evaluation results. Because an individual course may not accurately reflect a teacher’s performance, a set of results based on several classes should be used for personnel decisions. Some research suggests using at least five classes.
2. Have a sufficient number of students evaluate each course. Averaging responses from enough students will minimize the effects of a few divergent opinions. Generally, fifteen students is a sufficient number, assuming they represent at least half of the enrolled students in a class. Research also has shown that student evaluations are consistent over short periods of time.
3. Consider some course characteristic in interpreting ratings. Although any single characteristic does not have a great effect, a combination could effect a teacher’s evaluation. Research shows, for example, that small classes receive slightly higher ratings and that subject areas such as natural sciences and mathematics receive somewhat lower ratings. Courses that are college required, but not to satisfy the requirements of a major or minor, are also rated lower.
4. For personnel decisions, emphasize global ratings and estimates of learning. Research has shown that an overall or global rating correlated best with measured student achievement–more highly than ratings dealing with different teaching styles and presentation methods. Likewise, student estimates of their learning in a course are good reflections of instructional effectiveness.
5. Student evaluations can improve instruction, depending on how instructors use the results. Good evaluation forms help teachers diagnose their strengths and weaknesses. Studies indicate that some teachers can use the results directly, while others may need to discuss the results with a colleague or a professional consultant.
6. Give those being evaluated an opportunity to respond to evaluation results and to describe their teaching in writing. Teachers being evaluated for personnel purposes should have a chance to describe what they were trying to accomplish in the course and how their teaching methods fit those objectives. The self-report of teaching, which can be part of a teaching portfolio, gives teachers an opportunity to make their own best case.