Incorporating Peer Assessment in Adult Education

by Tara Fenwick and Jim Parsons
Department of Educational Policy Studies
University of Alberta, Edmonton, Alberta, CANADA T6G 2G5
tara.fenwick@ualberta.ca

This is an early version of the article "Peer Assessment Without Tears: Making it Work", appearing in the Journal of the Alberta Association of Adult and Continuing Education, May 1999. (also available in The Art of Evaluation: A Handbook for Educators and Trainers by Tara Fenwick and Jim Parsons, published by Thompson Educational Publishers, Toronto, 2000.)

I conceive the major contribution of evaluation to be a heightened awareness of the qualities of life so that people can become more intelligent within it. -- Elliot Eisner

Peer assessment of adult learning enjoys sporadic spins of popularity in the workplace, in post-secondary education, and in community-based programs (Black, 1993; Davie, 1997). Whether one is assembling ‘formative’, developmental feedback or summative ratings to an individual or group, peer assessment invariably is considered to be one of the many means available to collect, interpret, and communicate evaluative information.

We all know the reasons why peer assessment is such a great idea in theory. Peer assessment helps create more democratic spaces for work and education, because it interrupts the top-down power flow set in place when a facilitator or boss is sole evaluator. Peer assessment provides those necessary alternate perspectives which balance the evaluation’s bias. Peers often point out practical issues from their own experience that may be blind spots for another evaluator, and offer a smorgasbord of rich feedback for learners. Peer assessment continues to be an important dimension of educator development (Heimlich and Norland, 1994), leadership development programs (Congor, 1997) and learning organization initiatives (Senge, 1999). Besides the benefits to the person being evaluated, the activity of assessing our peers can help us as the peer evaluators develop a key learning ability: the skill to evaluate oneself and others regularly, critically and appreciatively, using defensible criteria, and to form follow-up action plans for further learning.

The Problem With Peer Assessment

But in our society, peer assessment is not a natural part of our everyday communication. We are usually anxious to sustain warmth and harmony in a relationship, and steer clear of saying or doing things which may create lasting tension and awkward relations. Many people struggle when they are forced to give critical feedback to others, and many suffer when they are told things they interpret as negative. Goleman (1998) claims that our emotional awareness of self and others, and our ability to articulate precisely what we feel and perceive, are critical dimensions of our relationships at work and at home that need much more serious attention. We don’t always know what to look for, or how to interpret what we see in order to create a reasonable judgment. We tend to avoid the whole situation if we possibly can.

Problems can also develop when the learning activity of peer assessment is combined with summative judgment. For example, instructors may incorporate a grade derived from peer assessment with a grade they have awarded to a student. Sometimes there is discrepancy between the peer rating and the instructor’s rating for various reasons: peers anxious to support one another may award ‘inflated’ grades, or award unfairly low grades (Marienau, 1994). But in adult contexts, learners’ perspectives are viewed as offering valuable input. An instructor must think carefully before simply adjusting a peer’s assessment to be more in line with the instructor’s judgments.

So, our skills as assessors of our peers are not naturally strong. Most of us need to consciously think about what to do and how, and to practice peer assessment before we can be effective enough to be truly helpful to others. Facilitators, supervisors, and community development leaders cannot simply expect people to be effective evaluators of their peers because they are told to do it. They often need help.

Four Ways to Improve Peer Assessment

This article is intended to provide some help. We show four main processes in peer evaluation, along with some ideas for helping people improve their ability to participate meaningfully in these four processes. The four are subsumed in two main activities of peers assessment: observing and communicating.

Observing

Communicating

These four processes usually unfold in situations of peer assessment such as these:

Individual -- One person provides face-to-face or written feedback for a peer, either anonymously or with full disclosure of evaluator

Group -- Together provide feedback for one individual through discussion.

Individual -- Evaluates a group he or she has worked with, through written feedback (Refer to the sample provided below: Evaluation Guide for Cooperative Groups).

Group -- Evaluates themselves as a group through discussion.

Group -- Evaluates another group, through discussion or written feedback.

Here are the four processes, each described through a series of suggested activities or discussion ideas to try with a group of learners.

1. Learning to recognize one’s role and limitations as a peer observer

Taste-Test: In our evaluation workshops, we sometimes ask people in small groups to taste-test different kinds of donuts or cookies. Each group must develop criteria, rate the items using this criteria, then present and defend their choice for ‘top pick’. This exercise invariably alerts people to the role played by personal preference and individual perception in any assessment.

Compare Ratings: Show a group of learners two or three short video clips of people performing activities demonstrating the kinds of skills the learners are trying to developing themselves. (For example, student teachers watch clips of teachers presenting information or working with students; managers watch chairs at meetings or staged samples of manager-employee interactions; employment counselors watch clips of staged interviews, etc.)

Ask viewers to jot down evaluative comments after the first clip, then share their comments. Most find that while there are some points of agreement, each viewer focuses on different aspects which influences their judgment. Our discussions also find that each viewer offers a different perspective which can alter the way one interprets what is seen.

After the first video, give learners a simple set of criteria to use, or have learners together develop a list of four or five benchmarks they consider important. Then watch all the clips, applying the criteria. The discussion following the viewing is usually rich with insight about the difficulties of observing, the dilemmas of comparing external criteria against the complexities of a particular incident, and the need for important information about context and the performer’s perspective which is lacking in this exercise, but crucial to understand what is being seen.

Read-Around Groups: To bring to the surface one’s inner sense of ‘what is good’, a Read-Around-Groups activity is helpful. People in groups quickly skim through a pile of writings by their peers, and rate each item. Then they examine their top picks to determine what traits made these the best, and what criteria they were using to decide. From these criteria they develop a rating scale that they apply to new cases. This can be adapted to any type of peer performance, by having people recall or observe samples of different work skills.

2. Learning how to observe, interpret and assess what one sees, using criteria

Note-Taking: A useful exercise when we are learning how to observe a peer’s performance is take longhand notes. First establish a focus for the observation, then simply record everything we see and hear. Then we should verify these notes with the person being observed, attending particularly to those ‘critical incidents’ upon which our assessment might be based. We should ask, Did you and I notice the same critical incidents? What did I miss in my observations that you as performer felt was important? Then we should check our interpretations with the person we observe. We ask, What did it mean for you when...? Why did you ....? And we share: This is what I thought was going on when you ....Here’s what I felt when you.....

Inter-Rater Reliability: Try having a group of people go through the process of establishing inter-rater reliability, using a particular rating scale. First have them become familiar with the items on the scale, and talk about their expectations for each item. Give them examples and benchmarks for the items. Then have them view and assess a few examples of performance, using the scale. Stop and compare their ratings. Raters present reasons for their ratings, and discuss discrepancies.. Then they view a few more samples, trying to make their ratings consistent with one another.

Ladder of Interpretation: Use the Ladder of Inference activity with a group, but make time to discuss the results and talk about the lessons in this activity for peer assessment. Chris Argyris (1993) created the Ladder of Inference exercise to help people uncover their interpretive assumptions. Jot down a brief description of a ‘critical’ incident with another person in which you were an observer or participant. Then focus on a small piece of that incident: Draw two columns lengthwise down a page. In the left column write down, in sequence, exactly what was said and done in the incident. In the right-hand column opposite each item, write how you interpreted the speech or action. Then -- and this is the hard part -- talk it through with the person you observed. Share your interpretations, and find out what the other person was thinking during the same incident. This activity immediately shows how much meaning we infer from very little observed evidence, ladling on our own expectations, assumptions, and personal understandings of what things mean.

3. Learning to express one’s judgment honestly and sensitively to a peer

Rate-the-Argument: To an issue posed in a group, have everyone decide their opinion then write down two arguments to support that opinion. Each argument is written on a separate yellow "stickie". Post the stickies on flipcharts. Now give everyone a marker pen or gummed coloured dots. They circulate around the flipcharts, reading the arguments. Each person marks the ‘best’ four arguments, using personal opinion, and jots their reason for choosing an argument to be a ‘best’. Groups re-congregate and examine which "stickies" received the most marks as ‘best’. Together discuss the criteria used to assess these arguments.

Ink-Shedding: Newman (1991) suggests you ask people to ‘free-write’ for ten or fifteen minutes , scribbling down honest ideas about a relevant topic. Depending on the situation, for example, people can write their arguments for a particular issue, their assessment of a problem, their recommendations for a solution, their response to an experience, and so on. At the end of ten minutes, have everyone pass their writing to someone else in the group, without signing it. That person reads the page, then ‘freewrites’ a personal response. The responses note areas where the readers agree or disagree, ideas that they appreciate as original, and ideas they would challenge for whatever reason. Papers are passed on to one more reader, who reads both the original and the first response, then writes a second response. The focus is always on honest personal response, and no one signs their names. After people are given time to re-read their original writing and the two responses, discuss insights and other learnings.

Writing Conferences: Have learners in pairs respond to each other’s written drafts of assignments or reports. First show people what to give feedback on. (See sample below: Peer Responses in One-on-One Writing Conference). Ask writers to state a few things for which they desire feedback. The responder restricts feedback to personal response: I liked this part because . . . I found this part unclear because . . .

Group Self-Assessment: Have people who have just completed a group activity each jot down their honest, private assessment of how the group worked. Provide a simple list of criteria before they write these assessments. Then invite the group to discuss together their individual assessments, noting areas of discrepancy and talking these through. A sample form for group assessment is provided here.

Bouquets and Beefs: Peers write a note, signing their name, providing personal response to a peer’s performance. One thing I appreciate . . . One thing I learned . . . One thing I wish had happened . . . Emphasize the need for specific description. General, vague responses are not helpful to a peer.

Practice Giving Critical Feedback: View a videoclip together of an unsuccessful performance. Note your responses/what you observed/your judgments (using simple criteria). Discuss in pairs, as well as strategies for delivering feedback. Make new pairs and practice delivering the critical feedback to the new partner. Partner provides a personal response to the peer assessor: his or her feelings during the feedback, and further actions he or she might take.

4. Learning how to hear, interpret, and act on feedback provided by others

Benefits of Peer Assessment

If you have difficulty ‘selling’ the idea of peer assessment to learners, managers, clients, colleagues, institutional gatekeepers, or other stakeholders in evaluation, here is a list of the benefits that a system of peer assessment can offer. The assumption is that peer assessment is a complement to other forms of evaluation, including self assessment.

  1. People sharpen their ability to observe both critically and appreciatively their own and others’ thoughts and actions.

  2. People become more aware of their own implicit personal biases. They begin to see how these biases influence their judging processes, both when they assess themselves and when they respond to others.

  3. People are opened to a wide variety of perspectives and observational styles in judging.

  4. People develop ability to judge what they observe more by using particular standards and criteria, and less by personal preference and emotional appeal.

  5. People become aware of complex issues in establishing defensible criteria for judging quality of work and thought.

  6. People learn how to express their evaluative opinions in ways that do not offend.

  7. People gain confidence and trust that presenting honest feedback to someone else will not necessarily be a negative experience. Thus, people learn to provide constructive feedback to one another as a regular routine.

  8. People learn how to accept constructive feedback from others, without viewing such assessment as a personal attack.

Potential Concerns of Peer Assessment

For the record, we have assembled here a list of the drawbacks and potential pitfalls of peer assessment. Where possible we have indicated some preventive measures or ‘solutions’.

  1. There are definitely risks in peer assessment. Some adults need to fine-tune their skills in diplomacy, and others need to learn accept feedback without viewing it as personal attack.

  2. Humiliation can occur if the dignity of a learner is not preserved throughout the assessment process. To avoid this, frame peer assessment within the framework of a caring, appreciative relationship. Emphasize the need to stick with observable behaviors in the here-and-now, and personal responses to these observations. Communication of feedback unfolds in a private dialogue, never in a public environment. And finally, peer assessment is understood to be confidential, between the observers, the observed, and the instructor where there is one.

  3. Power is always an issue. In an assessment situation, power accrues to the evaluator. A learner is in effect being asked to temporarily surrender to a peer as an object of scrutiny. This dynamic needs to be acknowledged openly and understood by all participants as a voluntary and temporary relation. It can be mitigated when the evaluator affirms his or her peer, offering feedback within the context of caring for a peer as a "critical friend". Both observed and observer enter assessment as an opportunity to dialogue about different experiences and perspectives, to learn from one another.

  4. Sometimes peers are afraid to be honest, perhaps because our society attaches many negative connotations to constructive criticism as being ‘overly critical’ and ‘judgmental’. The "critical friend" approach can help. Clear criteria and specific descriptions of indicators can help people learn to observe with attention to detail, and describe with precise phrases. This removes the focus from the ‘personal’, placing it on sharp observations and careful, valid interpretations of behaviours.

  5. Communication styles among people are very different. Tannen (1990) describes two main styles: indirect and direct. Direct communicators may appear abrupt and even rude to indirect speakers, while the indirect style uses so much nuance that a direct person doesn’t ‘get it’. People need to be aware of their own communication style and how it’s received and possibly misinterpreted by others. They also need to be resilient and allow elbow room for other communication styles. Check perceptions and interpretations!

  6. Peers may provide different feedback to instructors. This can be problematic if peer assessment is an important part of summative assessment for a program. Our contention here is that this problem should not emerge if learners have clear criteria (either provided or developed by themselves), models and specific descriptive indicators to help them visualize what these criteria look like, and practice in applying the criteria. The relationship dynamics of offering critical feedback in a particularly must also be recognized openly. If all of these elements are in place and there still are significant discrepancies between instructor and peer assessment, these should be discussed. It may be that instructors are not applying the same criteria in the same way as the learners. Or, there may be relational issues interfering with simple comparison of criteria to observations of peer performance.

Overall . . .

We are strong believers that the advantages of peer assessment outweigh the potential disadvantages. The key, we believe, is to help learners develop the skills of observing their peers and interpreting these observations, then communicating their perceptions with compassion and respect. Distinguish carefully between personal response and criteria ratings. Be absolutely clear which kind of feedback is requested in a particular situation, and help people develop the skills of preparing and presenting the feedback with specific description. We suggest the notion of "critical friend" is helpful to encourage peer assessment as a positive learning experience for all participants.

Evaluation Guide for Cooperative Groups

Rate your team’s effectiveness in each of the items below, using the following four-point scale:

Four - Outstanding: A consistent strength of the team

Three - Competent. Our team is usually effective in this area.

Two- Developing: We had some problems in this area, but we’re improving.

One- Weak: Help! We had problems that we were unable to solve.

______ 1. All members shared their ideas freely.

______ 2. We offered support and encouragement to each other.

______ 3. We asked each other questions to make sure everyone understood the ideas and information we were working with.

______ 4. Our group was energetic. We welcomed new ideas, showed enthusiasm, and laughed with each other.

______ 5. We questioned and criticized each other’s ideas, but we didn’t make personal attacks.

______ 6. We shared the workload fairly.

______ 7. We tried to make sure that everyone in the group was comfortable with our plans and decisions.

______ 8. We often probed for new ideas or deeper understanding by asking questions.

______ 9. We tried to explore alternate ideas before we settled on a solution.

______ 10. We were satisfied with the report our group shared with the class.

An area to improve in our future work together is____________________

______________________________________________________________________

Some specific things individuals are going to work on in this area are:______________

______________________________________________________________________

Sample: Peer Responses in One-on-One Writing Conference

The writer completes the top part, then asks the peer responder to read and respond to the sections below.

Writer

My central purpose in this writing is:

The audience I have in mind is:

The parts I would particularly like your feedback on are:

Some questions I have for you are:

Responder

From my reading of your piece, I find the main idea to be:

Here are some specific strengths I liked in your piece:

A part(s) I found a bit confusing or unclear are:

Here’s what I felt about the parts you wanted me to focus on:

Here are some responses to your questions:

References

Argyris, C. (1993).Overcoming organizational defences. Needham, MA: Allyn and Bacon.

Black, L. C. (1993). Portfolio assessment. In T. W. Banta and Associates (Eds.), Making a difference: Outcomes of a decade of assessment in higher education. San Francisco: Jossey Bass.

Congor, J. (1996). Can we really train leadership? Strategy and Business, Winter, 1996, 2-15.

Davie, L. E. (1997). Program evaluation for adults. In T. Barer-Stein and J. A. Draper (Eds.), The craft of teaching adults. Toronto: Culture Concepts.

Goleman, D. (1998).Working with emotional intelligence. New York: Bantam Books.

Heimlich, J. & Norland, E. (1994). Developing teaching style in adult education. San Francisco: Jossey-Bass.

Marienau, C. (1994). Self-assessment: An essential skill for learning and performance in the workplace. In Proceedings of the 35th Annual Adult Education Research Conference.. University of Tennessee, Knoxville, Tennessee.

Newman, J. (1991).Interwoven conversations: Learning and teaching through critical reflection.. Toronto: Ontario Institute for Studies in Education Press.

Palmer, P. J. (1998). The grace of great things: Reclaiming the sacred in knowing, teaching, and learning. The Sun, September, 1998, 24-28.

Rough, J. (1994). Measuring training from a new science perspective. Journal for Quality and Participation , October/November 1994, 12-16.

Schon, D. A. (1987). Educating the reflective practitioner: Toward a new design for learning and teaching in the professions. San Francisco: Jossey-Bass.

Senge, P. (1999). The dance of change: The challenges of sustaining momentum in the learning organization. New York: Doubleday.

Tannen, D. (1990). You just don’t understand: Women and men in conversation. New York: Ballantine Books.

Wheatley, M. J. (1994). Leadership and the new science: Learning about organizations from an orderly universe. San Francisco: Berrett: Koehler.

Note: This article is an excerpt from the book The Art of Evaluation: A Handbook for Educators and Trainers, published in 1999 by Thompson Educational Publishers, Toronto, Ontario.