Analysing the implementation of a didactic sequence based on peer assessment: reflections on the development of evaluative judgement in higher education

doi:10.21203/rs.3.rs-2427857/v1

Download PDF

Research Article

Analysing the implementation of a didactic sequence based on peer assessment: reflections on the development of evaluative judgement in higher education

https://doi.org/10.21203/rs.3.rs-2427857/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Competency-based designs promote the development of knowledge, skills and attitudes that enable professionals to develop adaptive experience, preparing them through learning activities and authentic assessment, for the future acquisition of new content. This requires the development of evaluative judgement, so that the quality of one’s own work processes can be assessed autonomously and with critical judgement. This paper presents the design of an educational experience with students of Galenic Pharmacy (N = 339) during the 2021–2022 academic year, the objective of which, when giving presentations on ways of administering medicines, was not only the learning of content but also the development of competencies. A complex task with iterative deliverables is proposed in which peer assessment is the key to the development of evaluative judgement.

The results show the positive effects of peer assessment, the growth of feedback provided from loop to loop and the development of critical judgement. However, engagement with the process has been erratic and the focus of feedback has not been sufficiently centred on content. Improved assessment literacy would probably be necessary both for teachers, in order to be able to establish criteria more aligned with their competencies, and for students to be able to attach greater formative value to these practices and engage with the learning process itself, and thus be able to continue this autonomous and self-regulated learning throughout life.

competencies

feedback

self-regulated learning

peer assessment

evaluative judgement

higher education

Competency-based designs

The rapid creation and obsolescence of knowledge and the development of technologies have generated the need for lifelong learning. Initial training, lifelong learning and the work we undertake must update and shape our skills and knowledge (Le Boterf, 2010).

Therefore, competency-based designs make sense. Competencies can be defined as a combination of knowledge, skills and attitudes appropriate to the context (European Parliament, 2006, p. 13; European Commission, 2018, p. 189/7). Competency-based learning is rooted on the idea that it is necessary to activate the concepts, skills and values that learners possess in an integrated way for effective-decision making and/or effective problem solving of authentic and complex problems in real-life contexts (Cano & Ion, 2013). Training based on competencies aims to prepare future professionals and citizens by identifying, with the help of all the stakeholders (Gulikers et al., 2009), the desirable competency profiles to meet the challenges of the various fields of knowledge.

In this vein, Dauphinee et al. (2019) make a clear commitment to competency-based training, but also question its feasibility and sustainability. They criticise the lack of a clear conceptual framework and a holistic and structured approach to competency assessment that would provide public evidence about the extent to which competencies are achieved. They suggest that institutions should aim to have clearly specified frameworks, based on the profession’s regulatory norms and legal standards of practice, that establish the main competencies and sequence the levels of competency development, and that collect evidence of performance to support this level of development. Recording relevant information to demonstrate competence throughout the process by various agents is essential to certify the mastery of that competency (Ion et al., 2016).

Despite these shortcomings, the relevance of competency-based designs seems to be proven and has been implemented through proposals such as the Mini-cex (Johnson et al., 2016) or the OSCE (Ajjawi & Boud, 2017; Bearman & Ajjawi, 2019). The competency approach enables self-regulation and self-direction of learning and facilitates lifelong learning, so that graduates are equipped with training for economic globalisation and market demands, innovation, and competitiveness (Bratianu et al., 2020). However, for this to happen, the development of evaluative judgment, as a basis for self-regulation and progressively autonomous and professional work, must be intentionally and explicitly promoted throughout the university career.

Evaluative judgement

Evaluative judgement can be defined as “the capability to make decisions about the quality of work of self and others” (Tai et al., 2018, p. 472). It implies understanding what constitutes the quality of an action or a product and applying this understanding in the assessment of a task, whether one’s own or that of others. This means comparing one’s own or others’ creations with pre-established standards, with exemplary work from other courses, with indicators from a rubric, among others, and having to assess them by applying quality criteria, and justifying these assessments. This process fosters transversal competencies in students, which becomes crucial for their professional future in the sense that it enables them to judge the quality of their work without being dependant on external approval, thus stimulating autonomy and reflective thinking. Being competent requires having the reflective capacity to adjust the knowledge that needs to be activated to respond to an authentic and complex situation (Villarroel & Bruna, 2014) and it is essential to develop evaluative judgement to know whether the proposed solution is optimal or whether it could be improved and with what resources and strategies. Hence the importance of evaluative judgement and its link to professional experience (Luo & Chan, 2022).

Tai et al. (2016) studied the contribution of Peer Assisted Learning (PAL) to the development of evaluative judgement, and found that it improved the ability to make comparisons and led to a deeper insight of the notions of quality of work. Figure 1 highlights the factors involved in the process according to these authors. It shows how giving feedback to others is one of the strategies for making comparisons, thus generating (and nurturing) a greater understanding of quality criteria, which in turn comes from receiving and discussing information with others.

Regarding evaluative judgement, Panadero et al. (2019) suggest that, when faced with an assessment, students start from their knowledge of the content of the task, but also from their understanding of how to fulfil the requirements of it. Some students with higher levels of self-regulation have a more accurate evaluative judgement. However, those who do not have the capacity to make evaluative judgements cannot be left out given that they will reach a less precise definition of the task and will plan their execution with a lower level of quality (Tai et al., 2022). Therefore, they call for specific opportunities for everyone to gain knowledge of the criteria, for everyone to have the chance to apply them and to obtain feedback on the quality of their work so that they can improve their learning processes and/or products by applying the information that has been given to them (Cubero et al., 2018; Villamañe et al., 2018).

In fact, To & Panadero (2019) warned of some limiting factors in the development of evaluative judgement, such as lack of competence trust; tensions in feedback communication; competition or lack of readiness for peer learning, but at the same times they mainly reported the benefits of peer assessment, since it enriches the understanding of quality, refining subjective judgement and deepening self-reflection.

The benefits of peer assessment have been widely documented (Adachi et al., 2018; Panadero et al., 2016). These include strengthening critical judgement (Nicol et al., 2014) and evaluative judgement (Ajjawi & Boud, 2018; Tai et al., 2018). As students train their skills as assessors, a progression that is noticeable from one loop to the next (Carless, 2019; Scott, 2017), their comments are less superficial and show greater development of critical thinking.

Despite the reported benefits, Tai et al. (2016) found that students state that they prefer feedback from teachers, so there is a need for studies that empirically demonstrate to students that Peer-Assisted Learning (PAL) has benefits for a greater level of performance and/or collecting feedback from graduates who remarked that they found it useful in their professional lives. In addition, they indicate that, without prior training, dysfunctions such as overrating can occur. This overrating, already documented by Knight et al. (2018), can be linked to multiple factors, including gender, as indicated by Andrade: “summative self-assessment tends to be inconsistent with external judgements (…) with males tending to overrate and females to underrate” (Andrade, 2019, p. 6). But it is particularly the lack of evaluative literacy that produces these situations, as multiple studies have affirmed (Yan et al., 2022).

In view of this, it is clear that there is a need to further strengthen evaluative literacy, which is understood as “the processes of understanding the grading process and of applying this understanding to make academic judgements of one’s work and performance” (Winstone et al., 2017, p. 25). Strategies that can contribute to developing this evaluative judgement and evaluative literacy include peer and self-assessment processes: “Veridicality is still an essential and worthwhile goal to pursue in SA/PA since realistic judgements help students focus on their learning needs” (Yan et al., 2022, p. 10).

Peer assessment processes

Feedback is understood as the action through which learners make sense of comments regarding their learning process and use them to self-improve (Carless & Boud, 2018, p. 1). Peer assessment or peer review provides a structured learning process for students to critique and provide feedback to each other on their work (Falchikov & Goldfinch, 2000). Peer assessment processes allow not only to improve the tasks under assessment but also to develop critical judgement (Lipnevich & Smith, 2022; Malecka et al., 2020; Nicol, 2020). Critical judgement is understood as “an intellectually disciplined process of actively and skilfully conceptualizing, applying, analysing, synthesizing, and/or evaluating information gathered from, or generated by, observation, experience, reflection, reasoning, or communication, as a guide to belief and action” (National Council for Excellence in Critical Thinking, 2017, p. 40) and is therefore considered a key skill for the twenty-first century (Changwong et al., 2018).

These processes have to be planned under certain conditions, such as those indicated by Panadero et al. (2016, p. 10): (1) Clarifying and justifying peer assessment as well as expectations to the students; (2) Involving the whole student body in deciding, developing and clarifying assessment criteria; (3) Pairing students participating in the peer process by encouraging productive assessment; (4) Determine specifically the format of the peer assessment, with a numerical rating or comments; (5) Provide an assessment tool (rubric, checklists or other) for the assessment process; (6) Specify activities for continuous assessment and their timing and, finally, (7) Carry out an in-depth monitoring of the peer assessment process by accompanying the students at all times.

On the other hand, the peer assessment process requires initial training (preparatory activities) and supports (instructional scaffolding) to ensure its success (Alemdag & Yildirim, 2022).

As Hannigan et al. (2022) point out, the peer assessment process requires an evaluative literacy characterized by 45 indicators grouped into 6 dimensions, namely: general knowledge of assessment, development of strategies to engage in assessment, active engagement in assessment, monitoring learning progress, engagement in reflective practice; and disposition in assessment.

In the overall understanding of assessment, it is necessary to identify the focus of the feedback. Hattie & Timperley (2007) established the levels of focus of feedback: feedback could focus on the task (the content of the assignment, the completion of the work); the process (the steps or procedures taken to carry out the task); self-regulation (cognitive behaviours, metacognitive or emotional feedback linked to the work) or the students themselves (personal approach). They consider that the personal focus does not have a significant impact on students’ progress and, although task-focused feedback is the most common, self-regulated feedback is the most meaningful. The review of the literature developed by Haughney et al. (2020) highlights this by showing that most of the feedback provided is task-related. When this is the case, most of the criteria focus on formatting issues, thus overlooking more content-related elements (van den Berg et al., 2006).

It is also necessary for the student to take on an engaged and active role (Carless & Boud, 2018; Ryan et al., 2021) in the assessment process. Moreover, it seems highly beneficial that the student adopts the role of assessor, and not just the recipient of feedback from the teachers or peers. The benefits of the assessor role over the role of the assessed have been reported on several occasions. Studies by Gielen & De Wever (2015), Li et al. (2010) or Voet et al. (2018) certify this.

This, in turn, is affected by the perceptions that are experienced. Students’ perceptions of feedback have been systematically analysed (Gielen & De Wever, 2012; Glazzard & Stones, 2019; Huisman et al., 2018) and appear to diverge from teachers’ perceptions. However, Mulliner & Tucker (2017) challenge this idea by concluding that students not only access and read the feedback, but use it to improve themselves and what they value is the same as what teachers value: that it is detailed, encouraging, constructive and shareable.

Students’ perceptions of peer feedback processes are varied. The systematic literature review by Van der Kleij & Lipnevich (2021) categories both positive perceptions, especially if the feedback is accompanied by dialogues to understand its meaning, as well as the negative ones. Several authors have previously warned of the difficulties associated with a low perceived usefulness of feedback and its effects on motivation and performance. Jonsson (2013), for instance, found that because it was not perceived as useful, it was not applied to subsequent tasks. This lack of usefulness can be attributed to it not being sufficiently detailed, not being personalised, being delivered in an authoritative tone or using terminology that students do not fully understand. Likewise, Price et al. (2010) confirmed student dissatisfaction and frustration with feedback and Crosby (2021) once again considered that this may be due to the lack of clarity of the feedback or its overly negative tone. However, it should be pointed out that perception is linked to the type of feedback and its purpose. For example, when feedback is given with a grade, it can have both a positive effect on self-confidence and possible negative effects on extrinsic motivation (Vaessen et al., 2016). In terms of aims and timing, a different perception has also been reported with respect to feed-up, feed-back and feed-forward, which seem to be considered more useful (Brooks et al., 2019). Therefore, there is a set of conditioning or mediating factors, and so it is worth exploring students’ perceptions and what they attribute to the value (or lack of) of feedback.

If the conditions of peer assessment are considered along the lines suggested by Panadero et al. (2016) and scaffolds are provided for its implementation (Kruiper et al., 2022), the learning derived from it can be really useful not only for students’ academic development but also for their future professional life (Huisman et al., 2019). As a result, evaluative literacy is key to professional development (Giraldo, 2021; Juanjuan & Mohd Yusoff, 2022).

These arguments formed the foundation for the development of a didactic sequence based on peer assessment, with the aim of testing whether there were improvements in the tasks, but also in evaluative literacy and evaluative judgement as competencies for professional development. This study therefore aims to analyse peer assessment processes and explore whether they can improve the quality of an assignment during the process of task development, thereby fostering the development of evaluative judgement. The following research questions (RQ) will serve as a guide to reflect on how a peer assessment experience can contribute to the development of evaluative judgement in higher education students:

RQ1: What is the focus of the comments and/or proposals for improvement made by students in a peer-feedback process of a task?

RQ2: What is the students’ perception and assessment of the feedback given and received in a peer-feedback process?

RQ3: Are there differences between the assessment carried out by teachers and the assessment carried out by peers?

RQ4: What skills do students perceive they develop from a peer-feedback experience?

RQ5: What are students’ perceptions of the benefits and difficulties of a peer assessment process before and after participating in a peer-feedback-based experience?

Participants

This educational study was carried out within the framework of the R + D project [data blinded] (ref. [data blinded]). This project proposes the implementation of a didactic sequence in which students participated in peer feedback activities in various subjects of different degrees at the University of [data blinded]. This project and its implementation were approved by the Bioethics Commission of the University ([data blinded]). All the students of the subject “Introduction to Galenic Pharmacy” were invited to participate. Of the 373 students enrolled for the 2021–2022 academic year, 339 completed the task proposed for the continuous assessment process. The students (N = 339) were divided into three morning class-groups (Nm = 182) and three afternoon class-groups (Nt = 157). At the same time, students were organised into four-member working groups within each group-class, where one of the students had to take on the role of spokesperson. The role of the group spokesperson was very important, as they had to lead the group, keep track of the task, schedule and post the latest version of the task on the Virtual Campus. Each group had to work on preparing the content of the two theoretical topics divided into 22 knowledge pills (V01 to V22) to be presented in a video screen recording with voice over, using a PowerPoint (PPT hereinafter) to convey the information. Each team was responsible for studying two knowledge pills in-depth; one to create their video and the other to evaluate the peer group.

Intervention design

Didactic sequence design. Following the contributions of the theory of self-regulation of learning (Zimmerman, 2000), a didactic sequence of a complex peer feedback task supported by technology (Learning Management System, LMS) was designed and developed to promote the self-regulation of learning of higher education students. To achieve this, various activities and resources available in Moodle, the system that supports the University’s Virtual Campus, were sequenced. This sequence was developed with the following resources and activities: i) statement file, ii) forum, iii) questionnaire, iv) workshop (a tool that allows peer feedback) and v) the task. For each phase of the self-regulation cycle (Zimmerman, 2000), the different actions that the students had to carry out to progress step by step through the sequence were established, as can be seen in Fig. 2. In the planning phase, Moodle provides a file with the statement and the evaluation criteria of the complex task, as well as a forum to discuss these criteria with the aim of focusing their efforts more effectively and improving their SRL processes, and, as a follow-up to this planning phase, a Moodle questionnaire with an open-ended question to describe their plan of action for the task and two check questions to verify their understanding of the assessment criteria. In the implementation phase, the experiment incorporates two cycles of peer feedback. In other words, students submit the task (version 1) and apply the peer assessment process in the Moodle Workshop tool; in addition, students complete a questionnaire to explain the actions they intend to take in subsequent versions of the task, based on their peer experience. This is repeated for the second version of the task. Finally, in the reflection phase, students deliver the final version of the task with a statement on how their learning process has been enriched and what learning will be applied in other tasks or subjects.

Task design. All the working teams had to first prepare the PPT with the text to be narrated at the bottom of each slide and upload it to Workshop 1 to be peer-reviewed. In a second phase, they had to improve the PPT format, considering the constructive criticism received, and then to design the resulting video and submit it to Workshop 2 for peer assessment (using the same assessors/assessed peers as in Workshop 1). Seven quality requirements were identified, which were also the final assessment criteria used by the teaching team: i) Criterion 1 (15% of the mark): grade of production and technical quality of the product with respect to the general format (in particular of the first slide), sharpness of images and quality of texts, colours, format of bibliographical references and, in the case of the video, sound/voice-over quality; ii) Criterion 2 (15% of the mark): attractiveness, interest and structuring of the information; dynamic presentation with a good synthesis of the required information (organised and balanced) for a video lasting between 5 and 8 minutes; iii) Criterion 3 (15% of the mark): academic style (no spelling, grammatical or typographical errors) and use of specific terminology; and, iv) Criteria 4, 5, 6 and 7 (55% of the mark): accuracy, thoroughness and relevance of the information on the characteristics of the route of administration (4-Content A), the advantages and disadvantages of this route (5-Content B), the classification of the dosage forms for this route (6-Content C) and the characteristics of the specific pharmaceutical dosage under study (7-Content D). Students were asked to highlight the positive points for each criterion and to indicate the aspects to be improved, corrected, or added. In this way, they provided constructive feedback and an overall rating of the product on a six-level scale from A to D corresponding to the following ranges on the 0-to-10-point scale (10 being the highest quality): A (≥ 9.0 Excellent); B+ (8.5–8.9 Rather High); B (7.5–8.4 Good), B- (6.5–7.4 Rather Low), C (5.0-6.4 Sufficient) and D (< 5.0 Insufficient). The timetable of the programmed work with its individual activities during the course is shown in the following table (Table 1).

Table 1

*Timetable of work during the course*
Month 1
Week 1	Week 2	Week 3	Week 4
Start of the course and presentation of the task.	Creation of the team and registration of spokespersons. Reading of the statement of the task. Start of the participation in the campus forum.	Answer the planning questionnaire after the group meeting. Create the PPT document.
Month 2
Week 1	Week 2	Week 3	Week 4
Upload the PPT document in Workshop 1.		View the PPT designed by the peer group, attend the face-to-face peer feedback sessions and provide the written assessment in workshop 1.
Month 3
Week 1	Week 2	Week 3	Week 4
Read the feedback received on your PPT document and provide a reflection on the comments received. Modify the PPT, create video 1 from the PPT and with audio and upload it to YouTube to get a link.	Upload the link to video 1 in Workshop 2.		Watch the video created by the peer group and provide feedback in Workshop 2.
Month 4
Week 1	Week 2	Week 3	Week 4
Read the feedback received on video 1 and submit a written reflection on the process. Modify the video and upload the new version to YouTube to get a link.		Submit (only the spokesperson) the link of video 2 of the task for the evaluation of this final version by the teachers. Answer the questionnaire on satisfaction and perception of learning.

As already indicated, two peer review loops were designed using two Moodle workshops so that each team could give and receive constructive criticism, first after uploading the PPT and then after uploading the link to the corresponding video. In addition, each loop also included a reflection on the feedback received, which should also be submitted to the Virtual Campus.

During the first feedback loop, two face-to-face classroom sessions were planned to explain how to correctly perform the peer assessment, where to write the feedback in the Moodle Workshop tool and to receive some assistance from the teaching team. These sessions allowed the peer groups to get to know each other and to communicate verbally. Each group then had the opportunity to make a second version of the video, which was finally assessed by the teachers.

Instruments and materials

Within the framework of the R + D project in which this educational intervention was undertaken, the participating students voluntarily answered an initial questionnaire and a final questionnaire. These questionnaires were comprised of a scale to measure self-regulation strategies for learning. Each questionnaire also included a question to be answered in an open-text field: i) before starting the experience (PRE): What benefits and difficulties do you think the peer assessment process can have? and ii) at the end of the experience (POST): Now that you have participated in a peer assessment process, what benefits and difficulties do you think this type of process can have? Only the responses to these questions were analysed to address the research answers leading to this paper.

In addition, based on the theoretical framework analysed in the project, an online questionnaire was created and made available on the Virtual Campus at the end of the course. The questionnaire includes blocks of questions to be answered on a Likert scale from 1 to 5. It also includes a series of open questions to be explored, in order to find out, on the one hand, perceptions of the difference in the quality of the feedback given and received and, on the other hand, the participating students’ perception of the strengths and weaknesses of the peer assessment experience and its relationship with the development of competencies. The data presented in the following table (Table 2) was used to address the objectives of the work presented here.

All the questionnaires were available online on the Virtual Campus, using the tool provided and approved by the University (Microsoft Forms). Answering the questionnaires was a voluntary activity of the students and as identification (student ID number) was required, they gave their consent and accepted the data protection and data processing policy before answering the questions.

Table 2

*Blocks of data used to respond to the objectives of the study.*
Block	Objective	Nº of items	Items content
Block A	Perception of participation as an assessor	8 items	1. Rethink the objectives of the assessed task.
			2. Have a more critical view of the work I have done.
			3. Involve myself more in my learning process.
			4. Realize the processes that I need to improve in my learning process.
			5. Realize the processes that I must maintain and enhance in my learning process.
			6. Contribute to the development of the competence learn to learn.
			7. Learn how to give feedback.
			8. Understand the evaluation criteria of the assessed task.
Block B	Perception of participation as assessed.	8 items	1. Rethink the objectives of the assessed task
			2. Have a more critical view of the work I have done
			3. Improve my own work based on the opinions/assessments and advice of my colleagues
			4. Realize the processes that I need to improve in my learning process
			5. Realize the processes that I must maintain and enhance in my learning process
			6. Contribute to the development of the competence learn to learn
			7. Learn how to give feedback
			8. Involve myself more in the learning process
Block C	Perceptions related to the peer assessment experience.	4 items	1. I discovered strategies, competencies or skills that I could apply in other contexts
			2. I have become aware of the actions and processes that can allow me to improve learning with more autonomy, efficiency and understanding in future task
			3. I am able to represent the objectives, the evaluation criteria and the processes to plan and carry out a quality task
			4. I am able to self-assess the quality of my work
Block D	Overall satisfaction with the peer assessment experience.	1 item	1. Overall satisfaction with the peer the peer assessment experience.

Data analysis

The descriptive exploration of the quantitative data collected was undertaken with the GraphPad Prism software package, to see the behaviour of the data (sampling distribution) and how the results are displayed (min., max., M, Me, σ). A mean comparison analysis was conducted for unpaired data by parametric treatment (Student’s t-test), once the normal distribution had been checked, so as to detect possible significant differences between perceptions across groups of students. Cohen’s d was calculated to measure effect size. A prior analysis of the data with responses from 220 students in Workshop 1 was carried out to categorise the proposals for improvement of each of the assessment criteria by two researchers, who are also teachers of the subject, with prior agreement on the aspects for their categorisation and observing the degree of agreement between them (Nardí-Ricard et al., 2022).

Students’ responses to the initial (pre) and final (post) questionnaires were also analysed in open text fields. The answers were downloaded into spreadsheets, which were the converted into text documents. Taking into account that some of the students’ answers were in Catalan and others in Spanish, and that the analysis programme did not correctly analyse the Catalan text, these documents were translated into Spanish. Subsequently, the orthography of the texts was checked and the documents were analysed (pre and post) by applying the word list function of the Atlas.ti programme. On the other hand, the word search engine of the word processor was used to search and count specific words.

The initiative was well received and the 339 students organised themselves within a few days to form 86 working teams, mostly of four members. All had access to information on the tasks to be carried out and to the planning questionnaire. On the other hand, there were 59 entries in the Virtual Campus forum and 1909 views: 27 questions on PPT/video content, 13 related to tools and resources external to Moodle, 9 on Moodle tools, 7 on timing and deadlines and 2 on general work guidelines.

There were 318 students (91.4%) attending the first face-to-face feedback session. After explaining to the students how to perform quality feedback and how to evaluate the PPT documents correctly, each student performed the peer review, checking compliance with the more formal aspects, namely those related to the first three quality criteria. In this way, the students were able to get to know each other personally and establish a dialogue for an exchange of views on these format requirements.

During the second face-to-face feedback session attended by 314 students (90.2%), peer review was focused on information content A, B, C and D. Triangular exchanges were established between the assessor students, the assessed students and teachers aimed at reviewing the depth and thoroughness of the contents of the PPT, according to quality criteria 4, 5, 6 and 7.

Concerning RQ1 (What is the focus of the comments and/or proposals for improvement made by the students in a peer assessment process of a task), it can be affirmed that a large majority of the students carried out the peer assessment first of the PPT in Workshop 1 (92.9% of them) and then of the video created from the improved PPT (98.2%). The analysis of the comments allowed the categorisation of the aspects that according to them should be improved, considering the assessment criteria of the work. The suggestions for improvement affected all the quality indicators, but to varying degrees, as shown below (Table 3).

Table 3

*Percentage of responses per task and assessment criteria*
Criteria	Task	Responses Nº (%)
Criterion 1: Level of preparation, technical quality of the product and format of bibliographical references	Initial PPT format	266 responses (81.6%)
	1st video version	215 responses (62.0%)
Criterion 2: Attracting interest, structuring information, and length of presentation	Initial PPT format	133 responses (40.8%)
	1st video version	150 responses (43.2%)
Criterion 3: Academic style and terminology	Initial PPT format	124 responses (38.0%)
Criterion 3: Academic style and terminology	1st video version	64 responses (18.4%)
Criteria 4 to 7: Content A, B, C and D	Initial PPT format	185 responses (56.7%)
Criteria 4 to 7: Content A, B, C and D	1st video version	131 responses (37.8%)
No comments for the improvement of the task	Initial PPT format	22 responses (6.7%)
No comments for the improvement of the task	1st video version	62 responses (17.9%)

With regard to the peer assessment of the initial version of the PPT, it can be observed that the comments pointed to improvements not only in the format and structuring of the deliverables (criteria 1, 2 and 3), in particular those aspects of Criterion 1, but also in the information content (criteria 4 to 7).

Regarding the quality of the videos that were subsequently designed, namely the comments from Workshop 2, these points were also highlighted, although in general there were fewer of them, which could indicate that the suggested corrective actions were undertaken.

Based on the comments on the quality of the products created, the different aspects to be improved within each established quality criterion were categorised. Figure 3 shows the results of the analysis concerning the initial PPTs and the corresponding videos, respectively.

Regarding the technical quality and format of the PPT (Criterion 1), most of the comments for improvement mentioned deficiencies in the format of the “cover page” (56.1%) and in the bibliographical references (53.4%). Regarding Criterion 2, it was pointed out that in 17.5% of the PPT formats, the structuring of the information as well as the number of slides needed to be improved to comply with the required timing (22.1% cases). Some spelling, grammatical and typographical errors were mentioned in 35.9% of the documents and, in addition, some inappropriate or erroneous terms were detected in 5.5% of the deliverables (Criterion 3). Finally, constructive criticism of the content (Criteria 4 to 7) indicated that students assimilated many of the concepts and exercised their learning skills. However, there were comments for improvement in the case of a quarter of the PPTs concerning the depth and thoroughness of the information for a better understanding of the information.

Regarding the videos generated after the improvement of the PPTs, most of the comments on the technical quality of the videos (Criterion 1) pointed to the need for improvement of the sound and/or the voice-over in 46.15% of the products. Regarding Criterion 2, the need to adjust the voice-over timing and/or the number of slides was considered in 32.9% of the cases. The other quality aspects of the videos were largely improved compared to the first peer review of the PPT formats, due to previous constructive criticism. In particular, the percentage of comments concerning the format decreased significantly (from 60.0–8.1%). Similarly, there was a decrease in observations of improvement on bibliographical references (from 55.5–18.2%), academic style (from 35.9–14.1%) and also on all information content (A, B, C and D).

As for RQ2 (What is the students’ perception and assessment of the evaluations given and received in a peer-feedback process?), each student was asked to write a reflection on the feedback received in each of the two workshops and to grade on a scale of 1 to 5 (1 being of very low quality and 5 of high quality) of both the feedback received as the assessor and the feedback given as the assessed. NR (no response) is noted in the case of not having been an assessor/assessed. 326 students provided the two feedbacks. Figures 4 and 5 show the rating of the quality of these feedbacks. It can be noted that, in general, the comments of the PPT revision were very well rated and that the scores for feedback provided and received were more balanced in the second peer assessment.

On the other hand, the difference between the mark of the feedback given and the feedback received was determined separately for each student (Fig. 6). For both peer assessments, around 60% of the students thought that the quality of the feedback given and received was similar. In contrast, 26.0% thought they provided poorer quality feedback than their peers in the first assessment, and this fact was acknowledged by only 14.3% of the peers; whereas in the second assessment there was a strong agreement of opinion in this respect, ranging from around 18.6% -18.9% of the cases.

Responding to RQ3 (Are there differences between teachers’ assessment compared to peers’ assessment?), the scores with which each student rated the two products created by the peer group were analysed, using the already mentioned A to D marking scale. Table 4 shows the frequency distribution of the scores for the PPTs and then for the corresponding videos. It also includes the teachers’ rating of the final versions of the videos on the same scale, but using scores from 0 to 10.

The two peer assessments were predominantly rated between “good” and “excellent” with more than half of the products rated as being “very good”. No products were rated as “insufficient”. 3.4% of the students in the first assessment, and 0.9% in the second, preferred not to evaluate their peers.

Table 4

*Rate of distribution of the products’ assessments.*
	NC Not assessed	D Insufficient (< 5.0)	C Sufficient (5.0-6.4)	B- Rather low (6.5–7.4)	B Good (7.5–8.4)	B+ Rather high (8.5–8.9)	A Excellent (≥ 9.0)
First peer assessment	3.4%	0%	0.6%	3.7%	19.0%	52.1%	21.2%
Second peer assessment	0.9%	0%	0,3%	2.0%	20.5%	51.3%	25.1%
Teachers’ assessment	0%	0%	0%	0%	33.3%	29.2%	37.6%

Overall, it can be seen in Table 4 that the scores of the products have improved due to the corrections of the previously delivered versions, in benefit of the quality of the final product. Thus, none of the final versions of 86 videos formally assessed by the teachers rated less than “good” (7.5 out of 10). The mean score was 8.7 ± 0.7, with partial scores of 8.7 ± 0.8, 8.9 ± 0.7, 8.8 ± 0.8 y 8.7 ± 0.7, respectively for quality criteria 1, 2, 3 and 4 to 7. The most notable deficient aspects of the final versions were the incorrect use of specific terminology (41.9%), errors in galenic concepts (39.5%) and speech/locution defects (25.6%), as can be seen in Fig. 7.

On the other hand, after the formal assessment, 107 student assessors (31.6%) in the so-called OVR (overrated) group were found to have over-assessed the quality of the first versions of their peers’ videos by one or two points higher than those established on the A to D scale. Tables 5 and 6 compare the academic performance of the assessors/assessed of the OVR group (n = 107 students) with the rest of the students (Group REF) (reference), (n = 232 students), differentiating within each of these populations the subgroup of assessors and the subgroup of those assessed by the assessors. The result of the mean comparison analysis for unpaired data using parametric treatment (Student’s t-test), which was carried out to detect possible significant differences between the scores obtained by the students in the four subgroups, is shown. Cohen’s d was calculated to measure the effect size. The analysis of the data using Student’s t-test did not provide conclusive results that could relate the academic performance of the OVR group’s assessors to the overrating of the videos assessed by them.

Indeed, it was found according to the teachers’ perception that the quality of the final versions of the videos submitted by the assessed of the OVR group (mean mark of 8.25 ± 0.57) was significantly lower (p 4 and p1 < 0.05 and d4 and d1 > 0.8) than the products created by the assessors of the same group (8.75 ± 0.59) and by the assessed of the reference group (mean mark of 8.95 ± 0.60). These lower scores were related to serious errors in terminology and fundamental concepts in the field of galenic pharmacy knowledge that had not been corrected, even though these deficiencies were mentioned in the different feedbacks. It is worth mentioning that no significant differences were found between the final scores of the subject for the assessors and those assessed in the OVR group and the REF group, with the average mark of the four subgroups being very similar and around 7.1 (p5, p6, p7 and p8 > 0.05 and d < 0.5).

Table 5

*Difference in the videos formal scores (given by the teachers) between the assessors and assessed of the OVR and REF groups.*
Comparison of two populations	Average of the formal scores of the videos (mean ± SD)	t-student (p value)	d Cohen value
Peer assessors of the group OVR	8.75 ± 0.59	p1 < 0.05	d1 = 0.861
Students assessed by the group OVR	8.25 ± 0.57	p1 < 0.05	d1 = 0.861
Peer assessors of the group REF	8.74 ± 0.69	p2 < 0.05	d2 = 0.323
Students assessed by the group REF	8.95 ± 0.60	p2 < 0.05	d2 = 0.323
Peer assessors of the group OVR	8.75 ± 0.59	p3 = 0.897	d3 = 0.016
Peer assessors of the group REF	8.74 ± 0.69	p3 = 0.897	d3 = 0.016
Students assessed by the group OVR	8.25 ± 0.57	p4 < 0.05	d4 = 1.199
Students assessed by the group REF	8.95 ± 0.60	p4 < 0.05	d4 = 1.199

Table 6

*Difference in the final mark of the subject (given by the teachers) between the assessors and assessed of the OVR and REF groups.*
Comparison of two populations	Average of the final mark of the subject (mean ± SD)	t-student (p value)	d Cohen value
Peer assessors of the group OVR	7.10 ± 1.08	p5 = 0.732	d5 = 0.047
Students assessed by the group OVR	7.15 ± 1.05	p5 = 0.732	d5 = 0.047
Peer assessors of the group REF	7.00 ± 1.07	p6 = 0.686	d6 = 0.038
Students assessed by the group REF	7.04 ± 1.06	p6 = 0.686	d6 = 0.038
Peer assessors of the group OVR	7.10 ± 1.08	p7 = 0.426	d7 = 0.093
Peer assessors of the group REF	7.00 ± 1.07	p7 = 0.426	d7 = 0.093
Students assessed by the group OVR	7.15 ± 1.05	p8 = 0.374	d8 = 0.104
Students assessed by the group REF	7.04 ± 1.06	p8 = 0.374	d8 = 0.104

In relation to the RQ4 (What competencies do students feel they develop from a peer assessment experience?), a total of 138 students responded to the survey available on the Virtual Campus at the end of the activities provided by the didactic sequence. The mean and the standard deviation of the scores assigned by the students were determined based on the responses obtained. The following tables show the values obtained for the scores from the role of assessor (Table 7) and from the role of the assessed (Table 8).

Table 7

*Scores attributed to the experience from the role of assessor.*
The assessment of the tasks of my colleagues (being an assessor) has allowed me to...	Mean	SD
Rethink the objectives of the assessed task	4.07	0.91
Have a more critical view of the work I have done	4.24	0.71
Involve myself more in my learning process	4.01	0.94
Realize the processes that I need to improve in my learning process	4.23	0.88
Realize the processes that I must maintain and enhance in my learning process	4.17	0.97
Contribute to the development of the competence learn to learn	3.99	0.95
Learn how to give feedback	4.21	0.84
Understand the evaluation criteria of the assessed task	4.27	0.79
Global rate	4.15	0.88

Note. The rating scale was from 1 to 5, where 1 = do not agree and 5 = very agree.

Table 8

*Scores attributed to the experience from the role of assessed.*
Receiving the opinions, evaluations and advice of my colleagues (being the one assessed) has allowed me to...	Mean	SD
Rethink the objectives of the assessed task	4.21	0.87
Have a more critical view of the work I have done	4.30	0.86
Improve my own work based on the opinions/assessments and advice of my colleagues	4.42	0.78
Realize the processes that I need to improve in my learning process	4.12	0.90
Realize the processes that I must maintain and enhance in my learning process	4.12	0.98
Contribute to the development of the competence learn to learn	4.04	0.93
Learn how to give feedback	4.08	0.94
Involve myself more in the learning process	4.14	0.92
Global rate	4.18	0.90

Note. The rating scale was from 1 to 5, where 1 = do not agree and 5 = very agree.

The responses of the participating students were also analysed in relation to their perception of the overall experience of peer assessment. In this sense, and as shown in Table 9, the global mean was of the same order as in the results according to the assessor/assessed role (4.16 ± 1.07). Nevertheless, better scores stand out for the item “I am able to self-assess the quality of my task” (4.50 ± 0.87) and lower scores (3.74 ± 1.30) for the item “I have discovered strategies, competencies or skills that I could apply to other contexts”.

Table 9

*Scores attributed to the peer assessment experience.*
Through the peer assessment experience...	Mean	SD
I discovered strategies, competencies or skills that I could apply in other contexts	3.74	1.30
I have become aware of the actions and processes that can allow me to improve learning with more autonomy, efficiency and understanding in future task	4.00	1.25
I am able to represent the objectives, the evaluation criteria and the processes to plan and carry out a quality task	4.26	1.06
I am able to self-assess the quality of my work	4.50	0.87
Global rate	4.16	1.07

Note. The rating scale was from 1 to 5, where 1 = do not agree and 5 = very agree.

Finally, and in relation to RQ5 (What is the students’ perception of the benefits and difficulties that a peer assessment process may have before and after participating in a peer-feedback-based experience?), the responses of the participating students were analysed at two different times: i) before starting the experience (PRE): What benefits and difficulties do you think the peer assessment process may have?; and ii) at the end of the experience (POST): Now that you have participated in a peer assessment process, what benefits and difficulties do you think this type of process may have? As shown in Table 9, from the first quartile of responses (N = 308) to the question of the PRE moment, the following verbs are extracted: learn, assess, have, improve, do, believe and see. Regarding these results, it is worth mentioning that the verb “to have” is used in many of the responses to express the benefits or difficulties that the experience may entail. This verb, in turn, appears linked to ideas such as: “…learn to have an assessment criterion”, “have a better internalisation of information” and “have an external view of the work”, among others. Regarding the verb “believe”, it should be disregarded, given that it is widely used to express opinions or perceptions about the experience; in this sense, it is used in all cases to introduce an opinion (“I believe that”).

In the first quartile of the responses (N = 164) to the post moment question, the following verbs are extracted: improve, believe, learn, see, do, have and know. In this case, and for the same reasons as in the PRE moment, the verb “believe” should be disregarded. The verb “to know” appears in this list. This verb is linked to ideas such as “to know how to identify weak points”, “to know the opinion of my colleagues”, “to know the assessment criteria of the task”, “to know how to admit mistakes in one’s own work”, “to know how to evaluate other people’s work in a tactful way”, and “to know how to rectify and improve”, among others.

As for the verb “to see”, both in the initial (pre) and in the final moment (post), it is used to express ideas linked mainly to the ability to consider other points of view and to identify errors.

Table 10

*First quartile of verbs present in the responses before the start of the experience*
Verb	Frequency	Percentage
learn	71	5.80
assess	66	5.39
have	43	3.51
improve	42	3.43
do	32	2.61
believe	29	2.37
see	26	2.12

From the comparison of Tables 10 and 11, the verb “to improve” has a greater presence after the experience has been carried out. In this respect, one student at the post moment, responded:

"Thanks to peer assessment, it can allow one to know how to identify weak points in order to work to improve them. Some difficulties that may arise is knowing how to identify the mistakes made.”[1]

This same student, before starting the experience, has responded: "Knowing how to value the work done by one’s companions." [2]

In the case of another student, this change is also evident. At the post time, he/she mentioned:

“As benefits I can highlight the improvement of critical thinking, the self-assessment of the work done, accepting and increasing the external work proposals of one’s companions. Although sometimes being objective and "hard" is difficult.”[3]

While at the PRE moment, this same student mentioned: "Difficulties: Not having a good criterion to evaluate or being subjective. Benefits: Be more permissive and score better”[4]

Table 11

*First quartile of verbs present in the responses at the end of the experience*
Verb	Frequency	Percentage
improve	53	5.38
believe	33	3.35
assess	33	3.35
learn	30	3.04
see	29	2.94
do	27	2.74
have	27	2.74
know	20	2.03

Another skill that is considered relevant in the development of critical thinking is the ability to “be aware” and the capacity to consider other points of view. In this sense, we searched for the word "aware” and found that it did not appear on any occasion in the responses at the pre moment, while at the post moment, three responses were found. Regarding this capacity to “be aware” and the ability to consider other points of view, the following response from one of the participating students after the end of the experience is proof of how the peer-feedback process has contributed to these aspects:

These types of processes are very beneficial, as they allow you to gain new insight and a fresh perspective on the work. I believe that this allows us to perfect the work done as well as improve our abilities when doing an assignment, since we can identify small details that perhaps we would never think of looking at. On the other hand, I also think that it is very important to evaluate our companions because it allows us to look at the work done with different eyes. In other words, before evaluating my peers, I wasn't fully aware of what was being evaluated with this work. However, after looking at his work and checking that all the requirements have been met properly, I have been able to identify some errors in my own work. However, while I have been able to observe the multiple benefits related to peer review, I consider it to be a rather tedious process and can be very cumbersome at times. That is why I think it should be a shorter process, rather than such an elaborate one.[5]

[1] The quote has been translated from the following original: “Gràcies a la avaluació entre iguals et pot permetre saber identificar els punts febles per tal de treballar per millorar-los. Algunes dificultats que es poden presentar és saber identificar els errors comesos.”

[2] The quote has been translated from the following original: “Saber valorar el treball realitzat pels teus companys.”

[3] The quote has been translated from the following original: “Com a beneficis puc destacar la millora de l’esperít crític, l’autoavaluació de la feina realitzada, aceptar i incrementar al treball propostes externes d’altres companys… tot i que de vegades ser objectiu i “dur” és difícil.”

[4] The quote has been translated from the following original: “Dificultats: No tenir un bon criteri per avaluar o ser subjectiu. Beneficis: Ser més permissius i puntuar millor.”

[5] The quote has been translated from the following original: “Aquest tipus de processos són molt beneficiosos, ja que permeten obtenir una nova visió i una nova perspectiva del treball. Crec que això ens permet perfeccionar el treball realitzat, així com millorar les nostres capacitats quan fem un treball, ja que podem identificar petits detalls que potser mai pensaríem a mirar. D'altra banda, també crec que és molt important avaluar als nostres col·legues perquè ens permet mirar el treball amb diferents ulls. En altres paraules, abans d'avaluar els meus companys, no era plenament conscient del que s'estava avaluant amb aquest treball. No obstant això, després d'examinar el seu treball i comprovar que s'han complert degudament tots els requisits, he pogut identificar alguns errors en el meu treball. No obstant això, encara que he pogut observar els múltiples beneficis relacionats amb la revisió per homòlegs, considero que es tracta d'un procés bastant tediós i a vegades pot resultar molt enutjós. Per això crec que hauria de ser un procés més curt, en lloc d'un procés tan elaborat.”

The results allow us to have a positive view of the peer assessment process for the development of critical thinking and evaluative judgement. From the analyses carried out in this work, it was found that peer feedback was oriented in both loops towards the format and the content. The decline in certain content-related comments in the second loop may well be due to the corrections introduced, so feedback seems to be considered and applied. This would support the idea of Carless & Boud (2018) or Alemdag and Yildirim (2022) regarding actionable feedback or feedback as uptake.

On the other hand, it has been noted that between the first and second loops, the scores granted to both the feedback given and received are more balanced. This could be due, as Carless (2019) and Scott (2017) had already suggested to learning/training in giving feedback and/or to an increase in the perception of self-efficacy with respect to their role as assessors. On the other hand, scoring the feedback received with a lower mark may also be a symptom of a greater level of “rigour” regarding the quality of the comments; an aspect that could also be attributable to the evaluative judgement that is being developed and the capacity for self-regulation that is growing (Panadero & Alonso-Tapia, 2014).

Thirdly, although the students shifted the focus of feedback slightly and the content was less relevant as an object of feedback in the second loop, teachers felt that the content could still be improved in the final delivery. This discrepancy between teachers and students may be due to their different perceptions (Glazzard & Stones, 2019; Huisman et al., 2018). It is also possible that an overrating phenomenon is occurring, as Tejeiro et al. (2012), Andrade et al. (2019) and Yan et al. (2022) had already pointed out, which shows that evaluative literacy still needs to be strengthened much more.

In this same line, teachers noted the continued existence of certain errors in galenic concepts or the incorrect use of specific terminology in the final delivery, even though in the second peer assessment loop, comments on these aspects decrease. Therefore, the quality of the feedback is affected because of the focus on formatting criteria and not so much on content. In this case, a possible explanation could be that there is a deviation towards the more easily observable and/or less committed or, simply, students may think that the task of assessing the content corresponds to the teacher, thus showing reluctance to participate in peer assessment (Winstone et al., 2017).

In fact, teachers commented that, during the in-class sessions for dialogue on feedback, the recipients conversed with their assessors, but failed to write down the criticisms, comments or suggestions provided to them. In other words, as Panadero et al. (2022) have already reported, students use few strategies after receiving feedback and specific actions need to be designed for decision-making to take place on the information received (Carless & Boud, 2018). Once again, multiple factors may lead to this fact, namely: the importance of the task, the meaning given to it in the specific discipline of pharmacy or the lack of evaluative literacy and feedback may be contributing to students’ engagement with this learning proposal.

Fourthly, it is observed that overall, students’ perception of their ability to evaluate increases, and the reported benefits are similar for being assessors as well as for being assessed. However, there are also some differences: understanding the criteria is what scores higher in the case of being an assessor, whereas, when in the role of being assessed and receiving feedback from a peer, improving one’s work, and having a more critical eye are more valued (Scott, 2017). These results are consistent with the analysis of open-ended responses shown above, where “improve” is the term most present in the second loop.

Finally, the results show two particularly relevant data. The item that best informs on the disposition and skills linked to the development of critical judgement (“Have a more critical view of the work I have done”) is precisely the one that obtains the highest score (4.24), which reinforces our hypothesis regarding the relevance of the peer assessment process. This would allow us to account for how evaluative judgement is developed and shaped (Luo, & Chan, 2022; Tai et al., 2016; Tai et al., 2018). However, another notable result is that the item referring to the competence of “learning to learn” has obtained a lower score. One possible explanation for this is the students’ lack of understanding of the construct itself because other items, which score higher, are closely linked to learning to learn.

Based on these data, and as a result of the analysis of the students’ answers to the questions in the pre and post moments, it can be considered that this type of practice contributes to the construction of critical judgement. In this respect, it is interesting to observe how they move from “Knowing how to value the work done by their peers” as the main expectation associated with the start of the peer assessment process, through to contributions, at the time after the experience, that are more linked both to improving their own work and to becoming aware of the margin for improvement that exists with respect to work being done and the processes that are put into practice. In short, and in general terms, more learning or benefits appear to be evident at the end of the experience, and of greater depth, compared to initial expectations.

The development of competencies throughout initial and ongoing education and professional experience is what shapes a person’s identity in becoming a successful professional. The development of the competence of learning to learn, which needs to be exercised with specific learning proposals, is essential for a person to remain competent throughout life.

In this case, the results shown respond to an action in which a sequence has been designed to promote self-regulation of learning. This sequence also aimed, based on peer assessment in two loops, to strengthen the development of evaluative judgement, as this becomes a particularly relevant element for the self-regulation of learning (Panadero et al., 2019).

The study has shown the strengthening of critical judgement and evaluative judgement in a peer assessment intervention, which offers hope for the potential of such sequences if they are maintained and consolidated over time. The study also provides other reflections in this regard.

Firstly, it is necessary to design authentic and complex tasks (Ibarra et al., 2020) that allow students to develop their critical judgement and focus the assessment criteria not only on the product but also on the processes that lead them to create a quality product (Bearman et al., 2017). The relevance of the criteria, their transparency, and their understanding on the part of students are key to good peer feedback (Bearman & Ajjawi, 2019).

A certain amount of evaluative literacy (Hannigan et al., 2022) and feedback (Carless & Boud, 2018) is essential for this understanding and application. Part of the results found could be attributed to a lack of literacy (Yan et al., 2022). However, this literacy can be trained by scaffolding the process of participatory assessment along the course of study so that graduates do indeed attain the critical judgement and evaluative judgement that will enable them to continue learning throughout their lives.

Achieving this is a challenge for institutional decision-makers, who must provide the opportunities for this to happen (Pitt & Winstone, 2022). Coordinating various formal and informal sources of feedback at different times and instances (Nicol, 2020), contemplating the specificity of the assessment processes of the different areas of knowledge (Dawson et al., 2021) or creating a curricular architecture that enables the longitudinal development of participatory assessment processes (beyond the specific experience in a particular subject and/or task) (McLean, 2018) is essential for the development of evaluative judgement, which, at the same time, is an essential part of lifelong learning.

Adachi, C., Tai, J. H. M., & Dawson, P. (2018). Academics’ perceptions of the benefits and challenges of self and peer assessment in higher education. Assessment & Evaluation in Higher Education, 43(2), 294–306. https://doi.org/10.1080/02602938.2017.1339775.
Ajjawi, R., & Boud, D. (2017). Researching feedback dialogue: An interactional analysis approach. Assessment and Evaluation in Higher Education, 42(2), 252–265. https://doi.org/10.1080/02602938.2015.1102863.
Ajjawi, R., & Boud, D. (2018). Examining the nature and effects of feedback dialogue. Assessment and Evaluation in Higher Education, 43(7), 1106–1119. https://doi.org/10.1080/02602938.2018.1434128.
Alemdag, E., & Yildirim, Z. (2022). Effectiveness of online regulation scaffolds on peer feedback provision and uptake: A mixed methods study. Computers & Education, 188, 104574. https://doi.org/10.1016/j.compedu.2022.104574.
Andrade, H. L. (2019). A Critical Review of Research on Student Self-Assessment. Frontiers in Education, 4. https://doi.org/10.3389/feduc.2019.00087
Bearman, M., & Ajjawi, R. (2019). Can a rubric do more than be transparent? Invitation as a new metaphor for assessment criteria. Studies in Higher Education, 46(2), 359–368. https://doi.org/10.1080/03075079.2019.1637842.
Bearman, M., Dawson, P., Sue Bennett, B., Hall, B. M., Molloy, B. E., B., & Educ, H. (2017). How university teachers design assessments: A cross-disciplinary study. Higher Education. https://doi.org/10.1007/s10734-016-0027-7.
Bratianu, C., Hadad, S., & Bejinaru, R. (2020). Paradigm Shift in Business Education: A Competence-Based Approach. Sustainability, 12, 1348. doi:10.3390/su12041348.
Brooks, C., Huang, Y., Hattie, J., Carroll, A., & Burton, R. (2019). What Is My Next Step? School Students’ Perceptions of Feedback. Frontiers in Education, 4(September), https://doi.org/10.3389/feduc.2019.00096.
Cano, E., & Ion, G. (2013). Curriculum development through competency-based approach in higher education. In S. Mukerji & P. Tripathi (Eds.), Handbook of Research on Transnational Higher Education (pp. 79–95). IGI Global, https://doi.org/10.4018/978-1-4666-4458-8.ch005
Carless, D. (2019). Feedback loops and the longer-term: towards feedback spirals. Assessment and Evaluation in Higher Education, 44(5), 705–714. https://doi.org/10.1080/02602938.2018.1531108.
Carless, D., & Boud, D. (2018). The development of student feedback literacy: Enabling uptake of feedback. Assessment and Evaluation in Higher Education, 43(8), 1315–1325. https://doi.org/10.1080/02602938.2018.1463354.
Changwong, K., Sukkamart, A., & Sisan, B. (2018). Critical thinking skill development: Analysis of a new learning management model for Thai high schools. Journal of International Studies, 11(2), 37–48. doi:10.14254/20718330.2018/11-2/3.
Crosby, R. (2021). Student Perceptions of Assessment and Feedback - are they valid? Computing Education Practice 2021.
Cubero-Ibáñez, J., Ibarra-Sáiz, M. S., & Rodríguez-Gómez, G. (2018). Development and use of mobile technologies that foster students’ evaluative judgement: A design-based research. ACM International Conference Proceeding Series, 151–156. https://doi.org/10.1145/3284179.3284207
Dauphinee, W. D., Boulet, J., & Norcini, J. (2019). Considerations that will determine if competencybased assessment is a sustainable innovation. Advances in Health Sciences Education, 24, 413–421. https://doi.org/10.1007/s10459-018-9833-21.
Dawson, P., Carless, D., & Lee, P. P. W. (2021). Authentic feedback: Supporting learners to engage in disciplinary feedback practices. Assessment and Evaluation in Higher Education, 46(2), 286–296. https://doi.org/10.1080/02602938.2020.1769022.
European Commission (2018). Teaching careers in Europe. European Commission. https://op.europa.eu/en/publication-detail/-/publication/435e941e-1c3b-11e8-ac73-01aa75ed71a1/language-en
European Parliament (2006). Recommendations of the European Parliament.
Falchikov, N., & Goldfinch, J. (2000). Student Peer Assessment in Higher Education: A Meta-Analysis Comparing Peer and Teacher Marks. Review of Educational Research, 70(3), 287–322. 10.3102/00346543070003287. https://doi-org.sire.ub.edu/.
Gielen, M., & De Wever, B. (2012). Peer Assessment in a Wiki: Product Improvement, Students’ Learning And Perception Regarding Peer Feedback. Procedia - Social and Behavioral Sciences, 69(Iceepsy), 585–594. https://doi.org/10.1016/j.sbspro.2012.11.450.
Gielen, M., & De Wever, B. (2015). Scripting the role of assessor and assessee in peer assessment in a wiki environment: Impact on peer feedback quality and product improvement. Computers and Education, 88, 370–386. https://doi.org/10.1016/j.compedu.2015.07.012.
Giraldo, F. (2021). A Reflection on Initiatives for Teachers’ Professional Development Through Language Assessment Literacy. Profile: Issues in Teachers’ Professional Development, 23(1), 197–213. https://doi.org/10.15446/profile.v23n1.83094.
Glazzard, J., & Stones, S. (2019). Student perceptions of feedback in higher education. International Journal of Learning Teaching and Educational Research, 18(11), 38–52. https://doi.org/10.26803/ijlter.18.11.3.
Gulikers, J., Biemans, H., & Mulder, M. (2009). Developer, teacher, student and employer evaluations of competence-based assessment quality. Studies in Educational Evaluation, 35(2–3), 110–119. https://doi.org/10.1016/j.stueduc.2009.05.002.
Hannigan, C., Alonzo, D., & Oo, C. (2022). Student assessment literacy: indicators and domains from the literature. Assessment in Education: Principles Policy & Practice, 29(4), 482–504. https://doi.org/10.1080/0969594X.2022.2121911.
Hattie, J., & Timperley, H. (2007). The Power of Feedback. Review of Educational Research, 77(1), 81–112. https://doi-org.sire.ub.edu/10.3102/003465430298487.
Haughney, K., Wakeman, S., & Hart, L. (2020). Quality of feedback in higher education: A review of literature. Education Sciences, 10(3), https://doi.org/10.3390/educsci10030060. MDPI AG.
Huisman, B., Saab, N., van Driel, J., & van den Broek, P. (2018). Peer feedback on academic writing: Undergraduate students’ peer feedback role, peer feedback perceptions and essay performance. Assessment and Evaluation in Higher Education, 43(6), 955–968. https://doi.org/10.1080/02602938.2018.1424318.
Huisman, B., Saab, N., van den Broek, P., & van Driel, J. (2019). The impact of formative peer feedback on higher education students’ academic writing: A Meta-Analysis. Assessment and Evaluation in Higher Education, 44(6), 863–880. https://doi.org/10.1080/02602938.2018.1545896.
Ibarra-Sáiz, M. S., Rodríguez-Gómez, G., & Boud, D. (2020). The quality of assessment tasks as a determinant of learning. Assessment and Evaluation in Higher Education, 46(6), 1–13. https://doi.org/10.1080/02602938.2020.1828268.
Ion, G., Cano, E., & Cabrera, N. (2016). Competency Assessment Tool (CAT). The evaluation of an innovative competency-based assessment experience in higher education. Technology Pedagogy and Education, 25(5), 631–648. https://doi.org/10.1080/1475939X.2015.1134635.
Johnson, C. E., Keating, J. L., Boud, D. J., Dalton, M., Kiegaldie, D., Hay, M., McGrath, B., McKenzie, W. A., Nair, K. B. R., Nestel, D., Palermo, C., & Molloy, E. K. (2016). Identifying educator behaviours for high quality verbal feedback in health professions education: Literature review and expert refinement. BMC Medical Education, 16(1), https://doi.org/10.1186/s12909-016-0613-5.
Jonsson, A. (2013). Facilitating productive use of feedback in higher education. Active Learning in Higher Education, 14, 63–76. https://doi.org/10.1177/1469787412467125.
Juanjuan, G., & Mohd Yusoff, N. (2022). The shared features of effective improvement programmes for teachers’ assessment literacy. Pedagogies: An International Journal, 1–18. https://doi.org/10.1080/1554480X.2022.2084398.
Knight, S., Buckingham Shum, S., Ryan, P., Sándor, Á., & Wang, X. (2018). Designing Academic Writing Analytics for Civil Law Student Self-Assessment. International Journal of Artificial Intelligence in Education, 28, 1–28.
Kruiper, S. M. A., Leenknecht, M. J. M., & Slof, B. (2022). Using scaffolding strategies to improve formative assessment practice in higher education. Assessment & Evaluation in Higher Education, 47(3), 458–476. https://doi.org/10.1080/02602938.2021.1927981.
Le Boterf, G. (2010). Repenser la compétence. Editions d’Organisation.
Li, L., Liu, X., & Steckelberg, A. L. (2010). Assessor or assessee: How student learning improves by giving and receiving peer feedback. British Journal of Educational Technology, 41(3), 525–536. https://doi.org/10.1111/j.1467-8535.2009.00968.x.
Lipnevich, A. A., & Smith, J. K. (2022). Student – Feedback Interaction Model: Revised. Studies. in Educational Evaluation, 75, 101208. https://doi.org/10.1016/j.stueduc.2022.101208.
Luo, J., & Chan, C. K. (2022). Conceptualising evaluative judgement in the context of holistic competency development: results of a Delphi study.Assessment & Evaluation in Higher Education,1–16.
Malecka, B., Boud, D., & Carless, D. (2020). Eliciting, processing and enacting feedback: Mechanisms for embedding student feedback literacy within the curriculum. Teaching in Higher Education, 0(0), 1–15. https://doi.org/10.1080/13562517.2020.1754784.
McLean, H. (2018). This is the way to teach: Insights from academics and students about assessment that supports learning. Assessment and Evaluation in Higher Education, 43(8), 1228–1240. https://doi.org/10.1080/02602938.2018.1446508.
Mulliner, E., & Tucker, M. (2017). Feedback on feedback practice: Perceptions of students and academics. Assessment and Evaluation in Higher Education, 42(2), 266–288. https://doi.org/10.1080/02602938.2015.1103365.
Nardí-Ricart, A., Oliva Herrera, M., Aparicio Pelegrín, R. M., Clerck, V., Cano, A., García, E., & Halbaut, L. (2022). “An active learning method based on peer assessment experience with feedback process”. XIV Annual International Conference on Education and New Learning Technologies (EDULEARN 22), Palma de Mallorca (Spain). 4–6 July 2022.
National Council for Excellence in Critical Thinking (2017). A draft statement of principles Retrieved from https://tinyurl.com/y79xcx52
Nicol, D. (2020). The power of internal feedback: Exploiting natural comparison processes. Assessment and Evaluation in Higher Education, 46(5), 756–778. https://doi.org/10.1080/02602938.2020.1823314.
Nicol, D., Thomson, A., & Breslin, C. (2014). Rethinking feedback practices in higher education: A peer review perspective. Assessment and Evaluation in Higher Education, 39(1), 102–122. https://doi.org/10.1080/02602938.2013.795518.
Panadero, E., & Alonso-Tapia, J. (2014). ¿Cómo autorregulan nuestros alumnos? Modelo de Zimmerman sobre estrategias de aprendizaje. Anales de Psicología, 30(2), 450–462. https://doi.org/10.6018/analesps.30.2.167221.
Panadero, E., Broadbent, J., Boud, D., & Lodge, J. M. (2019). Using formative assessment to influence self- and co-regulated learning: The role of evaluative judgement. European Journal of Psychology of Education, 34(3), 535–557. https://doi.org/10.1007/s10212-018-0407-8.
Panadero, E., Jonsson, A., & Strijbos, J. W. (2016). Scaffolding Self-Regulated Learning Through Self-Assessment and Peer Assessment: Guidelines for Classroom Implementation. In D. Lavelault and L. Allal (eds.), Assessment for Learning: Meeting the Challenge of Implementation (pp. 311–326). Springer. https://doi.org/10.1007/978-3-319-39211-0_18
Panadero, E., Pérez, D. G., Ruiz, J. F., Fraile, J., Sánchez-Iglesias, I., & Brown, G. T. L. (2022). University students’ strategies and criteria during self-assessment: Instructor’s feedback, rubrics, and year level effects. European Journal of Psychology of Education. https://doi.org/10.1007/s10212-022-00639-4.
Pitt, E., & Winstone, N. (2022). Enabling and valuing feedback literacies. Assessment & Evaluation in Higher Education, 1–9. https://doi.org/10.1080/02602938.2022.2107168.
Price, M., Handley, K., Millar, J., & O’Donovan, B. (2010). Feedback: All that effort, but what is the effect? Assessment and Evaluation in Higher Education, 35(3), 277–289. https://doi.org/10.1080/02602930903541007.
Ryan, T., Henderson, M., Ryan, K., & Kennedy, G. (2021). Identifying the components of effective learner-centred feedback information. Teaching in Higher Education, 1–18. https://doi.org/10.1080/13562517.2021.1913723.
Scott, G. W. (2017). Active engagement with assessment and feedback can improve Group-Work outcomes and boost student confidence. Higher Education Pedagogies, 2(1), 1–13. https://doi.org/10.1080/23752696.2017.1307692.
Tai, J., Ajjawi, R., Bearman, M., Boud, D., Dawson, P., & de Jorre, T. (2022). Assessment for inclusion: Rethinking contemporary strategies in assessment design. Higher Education Research & Development, 1–15. https://doi.org/10.1080/07294360.2022.2057451
Tai, J., Ajjawi, R., Boud, D., Dawson, P., & Panadero, E. (2018). Developing evaluative judgement: Enabling students to make decisions about the quality of work. Higher Education, 76(3), 467–481. https://doi.org/10.1007/s10734-017-0220-3.
Tai, J., Canny, B., Haines, T., & Molloy, E. (2016). The role of peer-assisted learning in building evaluative judgement: opportunities in clinical medical education. Advances in Health Sciences Education, 21, 659–676. DOI 10.1007/s10459-015-9659-0.
Tejeiro, R. A., Gomez-Vallecillo, J. L., Romero, A. F., Pelegrina, M., Wallace, A., & Emberley, E. (2012). Summative self-assessment in higher education: implications of its counting towards the final mark. Electron J Res Educ Psychol, 10, 789–812.
To, J., & Panadero, E. (2019). Peer assessment effects on the self-assessment process of first-year undergraduates. Assessment and Evaluation in Higher Education, 44(6), 920–932. https://doi.org/10.1080/02602938.2018.1548559.
Vaessen, B., van den Beemt, A., van de Watering, G., van Meeuwen, L., Lemmens, L., & den Brok, P. (2016). Students’ perception of frequent assessments and its relation to motivation and grades in a statistics course: a pilot study. Assessment & Evaluation in Higher Education, 4(6), 872–886. https://doi.org/10.1080/02602938.2016.1204532.
van den Berg, I., Admiraal, W., & Pilot, A. (2006). Designing student peer assessment in higher education: Analysis of written and oral peer feedback. Teaching in Higher Education, 11(2), 135–147. https://doi.org/10.1080/13562510500527685.
Van der Kleij, F. M., & Lipnevich, A. A. (2021). Student perceptions of assessment feedback: A critical scoping review and call for research. Educational Assessment Evaluation and Accountability, 33(2), 345–373. https://doi.org/10.1007/s11092-020-09331-x.
Villamañe, M., Álvarez, A., Larrañaga, M., Caballero, J., & Hernández-Rivas, O. (2018). Using visual learning analytics to support competence-based learning. ACM International Conference Proceeding Series, 333–338. https://doi.org/10.1145/3284179.3284233
Villarroel, V., & Bruna, D. (2014). Reflexiones en torno a las competencias genéricas en educación superior: Un desafío pendiente. Psicoperspectivas, 13(1), 22–34. https://dx.doi.org/10.5027/psicoperspectivas-Vol13-Issue1-fulltext-335.
Voet, M., Gielen, M., Boelens, R., & De Wever, B. (2018). Using feedback requests to actively involve assessees in peer assessment: Effects on the assessor’s feedback content and assessee’s agreement with feedback. European Journal of Psychology of Education, 33(1), 145–164. https://doi.org/10.1007/s10212-017-0345-x.
Winstone, N. E., Nash, R. A., Parker, M., & Rowntree, J. (2017). Supporting Learners’ Agentic engagement with feedback: A systematic review and a taxonomy of recipience processes. Educational Psychologist, 52(1), 13–37. https://doi.org/10.1080/00461520.2016.1207538.
Yan, Z., Lao, H., Panadero, E., Fernández-Castilla, B., Yang, L., & Yang, M. (2022). Effects of self-assessment and peer-assessment interventions on academic performance: A meta-analysis. Educational Research Review, 37, 100484. https://doi.org/10.1016/j.edurev.2022.100484.
Zimmerman, B. J. (2000). Attaining self-regulation: A social cognitive perspective. In M. Boekaerts, P. R. Pintrich, & M. Zeidner (Eds.), Handbook of self-regulation (pp. 13–40). Academic Press.

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Analysing the implementation of a didactic sequence based on peer assessment: reflections on the development of evaluative judgement in higher education

Status:

Version 1

Abstract

Figures

Introduction

Competency-based designs

Evaluative judgement

Peer assessment processes

Methodology

Participants

Intervention design

Instruments and materials

Data analysis

Results

Discussion

Conclusions And Future Implications

References

Additional Declarations

Status:

Version 1