Assessment reform and transforming higher education
Peter Knight
Educational Research
Lancaster University, UK
It is argued that the procedures that higher education institutions use to assess student learning are revealing indicators of quality, which is itself taken to depend upon the extent to which the undergraduate experience transforms students.
Authentic learning, authentic assessment
In curriculum studies, a distinction is often made between the planned curriculum, the delivered curriculum (what is offered to the students), and the received curriculum (what the students actually learn). It is well known that there are gaps between each of these (Fullan, 1991): the implementation gap between planning and delivery, and the learning gap between delivery and reception.
These gaps are caused by many things, not all of them amenable to technicist treatment. In this section, attention will be focused on the part played by the ways in which student learning is assessed.
Tutors, like schoolteachers, have an interest in seeing that students do as well on a course as possible. This is often judged in terms of student performance on graded assessments. It is therefore in their interests to gear their teaching to helping students to succeed on graded assessments. Of course, if the assessments are fair and reputable reflections of the well-conceived aims of the course, there is nothing wrong with this. However, for a variety of reasons, not least that some aims are hard to assess reputably (that is to say with reliability) and fairly (which is to say with validity), graded assessments are not normally like that. Typically, they give priority to assessing student performance against a limited - and arguably limiting - subset of course aims.
Unless the assessment of student learning is carefully considered at the time when programmes and modules are developed, there is a strong likelihood that assessment procedures will fail to represent the full range of programme and module goals. As a result, what is to be assessed is what gets priority in teaching and a planned-delivered gap is put in place.
Learners wish to get the best grades that are consistent with their understandings, motivation and the effort they think it reasonable to make. Assessment procedures will therefore signal to them what it is that they ought to concentrate upon, suggesting what the real aims of the module or programme are and how they might try to show that they are meeting those 'real' aims. Learners will tend to focus upon elements of a programme that are subject to high status, usually summative, grade-bearing assessment. In some cases, then, they will take a strategic approach, even if they appreciate that understanding the material might be better served by a different learning approach. The system of assessment can encourage surface approaches to learning (Entwistle, 1994) and be a source of the delivered-received gap.
Authentic learning needs authentic assessment. Where there is a mismatch between programme goals, teaching arrangements and assessment procedures, it will be the assessment procedures that exert the defining force. The assessment arrangements will define the curriculum in the eyes of the learners.
A view of quality in higher education
There are many views of what counts as quality in higher education. However, state-sponsored views have tended to dwell upon anything but the quality of learning. For example, when judgements about quality are based upon performance indicators there is the problem that these indicators tend, at best, to be proxy measures of student learning, raw grades being such an indicator. Moreover, state-sponsored quality measurement tends to be characterised by external quality monitoring, a high-stakes, externally-run operation. Unsurprisingly, each institution approaches the exercise preoccupied with the need to give a good account of the institution rather than by a drive for continuous quality improvement. Just as it is a commonplace that the formative and the summative purposes of assessment are not easily reconciled, so too there are tensions between the accountability function of quality measurement and the improvement goal. Trow (1993) has noted that the former runs the risk of encouraging a compliance culture, in which the institution seeks to maximise its ratings on the performance indicators, often without changing the underlying processes and sometimes at the expense of attention to the core activities of learning and teaching. Third, in many cases, attempts to improve quality have been associated with attempts to cut costs, not with a concern to improve.
Harvey and Knight (1996) argue that higher education must concern itself with transforming the life-experiences of students, by enhancing or empowering them. Only this version of quality is directly connected to the key business of higher education, which is student learning. The key question becomes one about how higher education might support the process of student transformation. The subtext, explored here, is the role that assessment plays in transformation.
Transformation and assessment
What might be the goals of a quality higher education in which transformation was the guiding principle?
There is a body of research into what stakeholders in higher education value that suggests that an emphasis on 'low level' cognitive achievements is undesirable (Harvey and Knight, 1996). Employers, to take just one group of stakeholders, are not looking for 'trainees' when they employ graduates but people equipped to learn and to deal with change. They want rounded, intelligent people, with a broad grounding in a subject and an understanding of core principles. Employers want graduate recruits who are adaptable and flexible; who can communicate well and relate to a wide range of people; who are aware of, but not indoctrinated into, the world of work and the culture of organizations; who have inquiring minds, are willing and quick to learn; and who are critical, can synthesize, and are innovative. In terms of outcomes, there is little here that is contrary to traditional academic values and expectations, nor that is conflict with a transformative view of quality.
It follows, then, that a programme that exhibited quality in the sense of being fit for employers' purposes would have a wide range of cognitive and non-cognitive goals. Student learning would be assessed according to the degree to which they met the specific criteria that were derived from such goals. Consequently,
- Assessment would become openly criterion-referenced.
- A test of the quality of a programme is the degree to which the assessment arrangements document learners' achievements on the full range of avowed goals.
- Since those goals will usually pervade the whole programme, there should be signs that they are assessed throughout the programme. In a quality programme, several assessments of group work, for example, across the duration of the programme, will tell students that skill in group work is seen as an important goal, not as a marginal concern.
- Repeated assessments of any competence enhance the reliability of the assessment.
- A range of goals implies that there will be a range of assessment techniques in place because many of the things valued by employers, for example, cannot be satisfactorily measured by paper-and-pencil assessments of individual performance. Some demand self-assessment, others co-assessment, and a set of written, oral and performance assessments will be necessary to document achievement across the range of programme goals (Brown and Knight, 1994).
Summative, criterion-referenced assessment
Few summative assessment systems attempt to assess such a breadth of goals, not least because of pervasive reliability problems (see, for example, Birenbaum, 1996). Reliability, as traditionally understood, is maximised by repeated measures of clearly-specified competences. However, learning in higher education tends to involve particular engagements with poorly-specified (and poorly-specifiable) competences. The belief that criteria are transparent, that is to say that they are plainly explicit about what constitutes satisfactory performance, is mistaken. Criterion statements tend to include words that need interpretation, such as 'understand', 'know', 'confidently' and 'greater'. It might be argued that they should not: but they do. These words need to be given a practical expression. What, in this programme is to count as understanding, knowledge or confidence? A second issue is related to the first. All criteria take on meaning in the contexts of a body of knowledge and of a set of tasks. It is well known that where a person might be able to succeed on one task, another that appears to demand the use of similar abilities proves to be too hard. What sort of tasks and what sort of knowledge are to be associated with the application of these criteria? Besides, it was remarked above that summative assessment data typically describe only a portion of the transformation that may occur in the undergraduate experience. It may not be the most important portion. So, emphasising reliability is not simply chimerical but may compromise the 'higher' in higher education. Emphasising validity, as traditionally understood, may compromise the reliability of summative grades and degree awards.
Some resolution of these problems may be achieved by saying that shared understandings of criteria have to be constructed. This has often happened tacitly, where the operational meanings of criteria have become known to faculty and students by inference from the tasks set and the standards applied. This is neither fair nor efficient. It is preferable that both staff and students analyse samples of work using the criteria and in this way construct shared understandings of what the criteria are to mean. In a quality programme, where summative assessments have high status, occasions for constructing this meaning will be inbuilt and prominent. The distinction between assessment and learning, a distinction that is both appealing and misleading, may then prove less easy to make (Birenbaum, 1996, Harvey and Knight, 1996).
However, some of the problems can be attenuated by placing formative assessment above summative assessment, thereby reducing the imperative to guarantee reliability to outsiders.
Formative assessment
At the very least, the foregoing discussion implies that learners will need to have opportunities to practise understanding and applying assessment criteria: they will need formative, ungraded assessments. This makes it clear that formative assessment is centred upon the improvement of learning. Key features are:
- it involves giving learners feedback that can be used to improve the next performance;
- it may not, and need not, involve grading;
- it is therefore directly connected to the transformational notion of quality, since it is about improving student learning;
- there is no reason why tutors have to conduct this assessment; co-, peer- and self-assessment are possible and, arguably, necessary;
- it tends not to be high stakes assessment;
- where it is associated with a dialogue between learner and teacher, the initial reliability of the assessment need not be high: what matters is that a basis is laid for a conversation;
- however, if there is to be communication between assessor and assessed, there needs to be an understanding of the meaning of the relevant criteria. This is not the same as making assessment criteria available to learners: the operational meaning of the criteria must be understood with formative assessment, as with summative assessment.
Here the main problem with formative assessment is seen. It connects directly with the idea of quality as transformation but looks to be absurdly impracticable, not least because of the demands on faculty time.
There seem to be several solutions to this problem. One is to assess a group's understanding or competence, perhaps by using a method such as Classroom Assessment. This is the term given by Angelo and Cross (1993, pp. 4-5) to a set of activities 'to help teachers find out what students are learning in the classroom and how well they are learning it ... the teacher is not obliged to share the results of Classroom Assessment with anyone outside the classroom ... Classroom Assessments ... are almost never graded and are almost always anonymous'. In other words, here there is a direct connection between one form of formative assessment and the improvement of learning quality.
Arguably more important is the idea of encouraging [formative] self-assessment (Boud, 1995) and [formative] peer-assessment (Brown and Knight, 1994). Again, there is a close link between these approaches and the enhancement of learning. Interestingly, the student who has learned something of self-assessment has not only been transformed in some measure but has also gained a powerful tool for future transformation.
Computer-based learning systems also have the power to reduce the cost of formative assessment.
Conclusion - assessment systems
A view of quality in higher education has been proposed. If it is accepted, then the quality of a programme or institution can be judged, in some measure by whether there is evidence of the presence of conditions that are necessary for successful, transformative learning (Erwin and Knight, 1995). The ways in which student learning is assessed and the purposes of assessment have been identified as powerful, indirect indicators of that quality.
These include:
- consistency between programme goals, teaching, learning and assessment procedures;
- the use of a wide range of assessment modes and methods;
- provision of opportunities for the development of shared meanings about assessment criteria;
- plentiful formative assessment, both at the classroom level and at the individual level;
- evidence of self-, peer- and co-assessment;
- a concern for depth of understanding in preference to breadth of knowledge;
- provision of opportunities for all faculty to learn about ways of teaching that are consistent with an emphasis on student learning and on the tranformative potential of higher education.
Conclusion - quality and transformation
Review of these indicators is not best done within the state-sponsored quality assurance systems that are now commonplace. Preference has been given to the development of a continuous quality improvement (CQI) approach. CQI must, it has been said, be driven from two directions: bottom-up, as with Classroom Assessment, and top-down, as with the institutional adoption of systemic approaches to teaching and learning (Biggs, 1993). The key is to encourage and ensure the former, whilst developing a sensitive but effective external monitoring process, to achieve simultaneous loose-tight coupling (Orton and Weick, 1990) in the university's organisation. This approach entails the development of a quality-improvement culture that is contingent upon trusting the professionalism of the workforce. One indicator of that professionalism is the approaches that are taken to the assessment of student learning.
How we assess students says far more about us and what we value than we might wish.
References
Angelo, T. A. and Cross, K. P. (1993). Classroom Assessment Techniques: A handbook for college teachers. Second edition. San Francisco: Jossey Bass.
Biggs, J. B. (1993). From theory to practice: A cognitive systems approach. Higher Education Research and Development, 12(1), 73-85.
Birenbaum, M. (1996). Assessment 2000. In M. Birenbaum and F. Dochy (Eds), Alternatives in Assessment of Achievement, Learning Processes and Prior Knowledge. Norwell MA: Kluwer.
Boud, D. (1995). Enhancing Learning through Self Assessment. London: Kogan Page.
Brown, S. and Knight, P. (1994). Assessing Learners in Higher Education. London: Kogan Page.
Entwistle, N. (1994). Recent Research on Student Learning and the Learning Environment. Paper presented to the International Symposium Independent Study and Flexible Learning. Cambridge, 6 September.
Erwin, T. D. and Knight, P. T. (1995). A transatlantic view of assessment and higher education. Quality in Higher Education, 1(2), 179-188.
Fullan, M. (1991). The New Meaning of Educational Change. London: Cassell.
Harvey, L. and Knight, P. (1996). Transforming Higher Education. Buckingham: Open University Press.
Orton, J. D. and Weick, D. (1990). Loosely-coupled systems: A reconceptualization. Academy of Management Review, 15(2), 203-220.
Trow, M. (1993). Managerialism and the academic profession: The case of England. Paper presented to the Quality Debate Conference, Milton Keynes, 24 September.
| Please cite as: Knight, P. (1996). Assessment reform and transforming higher education. Different Approaches: Theory and Practice in Higher Education. Proceedings HERDSA Conference 1996. Perth, Western Australia, 8-12 July.
http://www.herdsa.org.au/confs/1996/knightp.html |
[ HERDSA ]
[ Proceedings Contents ]
This URL: http://www.herdsa.org.au/confs/1996/knightp.html
Created 7 May 2002. Last revision: 24 May 2002.
© Higher Education Research and Development Society of Australasia Inc