Help end medical misogyny. Sign our petition.

Help end medical misogyny.
Sign our petition.

Sign the petition

Please or to access all these features

Higher education

Talk to other parents whose children are preparing for university on our Higher Education forum.

Should universities bring back in-person exams to tackle AI?

83 replies

Jaxx · 07/06/2026 12:26

From the Speccie:
Why isn’t Durham University taking AI cheating seriously?

At Durham University, I have been Chair of the Board of Examiners for Philosophy since 2016. Last week I resigned, because I feel that it is my responsibility to raise a vital issue in higher education, one whose true significance is not understood. The existence of a crisis requiring immediate action is not generally recognised. I am not blaming the deans and pro-vice-chancellors. I want to hold the appropriate figures to account: the Vice-Chancellors.

Durham University is a beacon of excellence in the UK university sector. I wish to maintain standards in its top-rated Philosophy department. The issue I am addressing affects students past, present and future, in many leading British universities. It involves a crime that is not victimless. It is this: the lazy student cheats with professional-grade versions of AI chatbots such as ChatGPT or Claude, and gets a first-class result. The hard-working student uses no bots, or a non-professional bot honestly, thinking and writing for themselves, but gets a 2:1. That’s not fair. Yet Vice-Chancellors – to mix metaphors – are sitting on their lavishly-remunerated backsides and adopting the ostrich position.

Let me explain how the university marking system works, at least in the UK. Until Covid, there was a mix of assessment by sit-down exam and continuous assessment. During Covid, this was replaced, for understandable reasons, by “at home” exams done on computer – to the relief of academics who no longer had to decipher student handwriting. After Covid this continued, and worked satisfactorily. A system called Turnitin would check for plagiarism by scanning essays for text compiled from published sources, and it was hard to cheat.

That system has been subverted by ChatGPT and similar AI tools. These apps now work very well for producing university-style essays and essay components, especially in the professional version which many Durham students can afford. (You see that there is discrimination already against working-class students. What a surprise.)

A return to sit-down exams is the obvious solution. Universities say that these are financially impractical for various reasons. I doubt this, but in any case it is too late at Durham and elsewhere to implement sit-downs exams this year. (They should have been implemented a year ago.) So in response to the immediate crisis I must suggest sticking plaster solutions. I am old enough to recall the introduction of anonymous marking, which happened because research showed that female students were discriminated against. Now, when the alternative is what I call a chaotic system of guesswork, it may be our least-worst immediate alternative.

AI, in contrast, is now often impractical to detect with enough reliability to meet the high standards of proof required for accusations of cheating. Perhaps there is no ultimate alternative to sit-down exams – other options such as vivas are very time-consuming. My suggestion of abandoning anonymous marking seems a reasonable sticking plaster. We need an immediate response to the unfairness in the coming exam season. Yet most of my colleagues don’t seem to understand how it is fairer.

The buck stops with VCs, who should be exercising leadership. Obviously students who cheat are behaving badly – but the point is that the system allows them to do so with ease. Students have always cheated, but now they know they can do so with impunity.

I have been an academic since 1988, and worked at Durham University since 1991. I’ve been very lucky to get a job at Durham – and privileged. The students are excellent, the university is well-known and generally well-run. Any student who gets a place here will be both excellent academically, and well-set for future employment. Academic work is the only full-time employment I’ve had, after a series of temporary jobs as a student, and taking a PGCE course at Moray House Edinburgh (Primary). But now, shortly before retirement, I find that the skills I’ve developed as a marker – essential to my teaching role – aren’t able to be put to their proper use. Much effort is being expended by academics in working out how to identify the improper use of AI. But this vain pursuit is becoming very difficult and may soon be impossible. AI “tells” that remain are being aggressively stamped out by AI companies. One cannot mark essays under the Covid system, in the era of ChatGPT. Yet the university leadership is still in the ostrich position.

I was on research leave for the first term this academic year; Durham still operates a term system. Since then I have chaired one disciplinary panel for AI cheating, detected by the junior colleague who marked it. He noticed that the referencing contained tell-tale signs of AI “hallucinations”. This AI “tell” can prove misuse, as has happened in a number of high-profile cases. But AI makers have reduced its frequency, and cheaters can cover their tracks by minimally checking or omitting references. With existing tools, AI cheating is almost impossible to prove except by student confession. Stylistic cues, incongruous sophistication or maturity of writing for an undergrad cannot reliably show that AI has been misused, even when the marker is suspicious. Only with the last batch of essays which I started marking in April, did the true horrors of the situation become apparent to me. The system was broken and needed immediate emergency repair.

It should be clear that my complaint is not against Durham University. Few British universities have re-introduced sit-down exams extensively. Many departments have never had anonymisation; music, for obvious reasons. I know from bitter experience that one cannot play the piano with a paper bag over one’s head. Universities seem unable to respond in a timely manner to the current crisis. Academics are grumbling. Someone surely has to blow the whistle?

I guess that few whistle-blowers are enthusiastic. I have never engaged in a lawsuit, or been the victim of one, on any matter. A cursory Google search reveals that UK whistleblower laws protect my salary and pension, and I’ll get a hundred quid for this article. So I’m prepared to light the blue touchpaper and retire, in both senses.

It has been a very difficult decision to get this article published. Colleagues will be upset, and students disconcerted. But should they continue in ignorance? There needs to be quick action – something that large institutions such as universities have never been known for. Think of Air Chief Marshal Dowding in 1940, confronted with massive losses of planes in France. He goes to Winston Churchill and says that we cannot send any more Hurricanes and Spitfires there. But instead of reluctantly agreeing, Churchill sets up a cabinet committee to investigate, and committees in parliament. By the time they report, Britain is being invaded. I think I would be on Dowding’s side in that debate.

The imperative is the wellbeing of students who are suffering under the present system, though they don’t realise it. Staff in UK universities, and elsewhere, are not able to mark with the integrity the matter demands, because there is no sufficiently reliable way of detecting or preventing improper use of AI. Academics are doing their best, but as one young colleague said to me, “We are fighting with our hands tied behind our backs”.

Cromwell’s immortal words to the Rump Parliament apply to the current generation of VCs: “You have sat too long here for any good you have been doing lately…Depart, I say, and let us have done with you. In the name of God, go!” We need an equivalent of the mani pulite movement in Italian politics – one that does not end with a Berlusconi.

by Andy Hamilton, Professor of Philosophy and former Chair of Board of Examiners at Durham University

Full disclosure, my son is at Durham doing an humanities degree and I totally agree there should be a return to in person degrees for all exams. He doesn’t use AI to write essays tbf, but does use it to help plan, find sources and check his work acting on recommendations he agrees with.

OP posts:
fortyfifty · Yesterday 10:17

Notanorthener · Yesterday 10:05

There is a danger here in spending money (& academic hours) policing AI use, which is just addressing the symptoms.

Instead, resources shld be focussed on the causes and devising AI proof and/or AI incorporated assessments.

Investigating and then punishing students after the event is very very stressful for all involved.

If there has been an uptick in disciplinary action as PP have stated, then this is the canary in the coal mine for universities to restructure assessments, not devote extra time and money to unproductive disciplinary processes.

Given the weighting of the degree is majority 3rd year work, they could at least focus attention on redesigning 3rd year assessments in the short term.

Elbowpatch · Yesterday 11:24

Instead, resources shld be focussed on the causes and devising AI proof and/or AI incorporated assessments

I agree, and that is happening. However, until AI proof (or tolerant) assessments can be devised, AI abuse with current assessment methods needs to be policed.

poetryandwine · Yesterday 11:40

Elbowpatch · Yesterday 08:57

Some universities have two stage process where AI abuse is strongly suspected. If suspected, the student has what is effectively a preliminary viva. The outcome of that determines whether or not the case is passed on to an academic conduct panel.

The viva does not affect the mark.

True, but it is not clear to me whether a viva given on these grounds may in itself be cause for appeal. Might it be said to be discriminatory?

I am not sure on what grounds the OIA has upheld student complaints involving vivas required on the grounds of these suspicions. I rather doubt they can be assigned a very high weighting in the course unit assessment, for the reasons discussed above by @Owlbookend

greenlampern · Yesterday 11:43

I think this shows the gaping generational divide. If you're AI literature, in particular genAI, there is no way you will not use it. It's like asking people not to use the internet for emails and to go back to handwritten letters. Students will role their eyes an ignore you. Not because they're lazy and want to cheat, younger people seem to be much more ambitious and savvy than my generation, but because they'll use the best tools out there to get to their point. Look at the innovation they're producing at such a young age.

It's nuts that people think that it should be policed. The whole system of how education is given and assessed needs be dismantled and rebuilt to fit within AI innovation. Not to lock it out. Students know it's not about passing exams and getting a 1st, it's about coming out the other end having learned and innovated.

This rhetoric reminds me of the people who voted leave.

Badbadbunny · Yesterday 11:46

greenlampern · Yesterday 11:43

I think this shows the gaping generational divide. If you're AI literature, in particular genAI, there is no way you will not use it. It's like asking people not to use the internet for emails and to go back to handwritten letters. Students will role their eyes an ignore you. Not because they're lazy and want to cheat, younger people seem to be much more ambitious and savvy than my generation, but because they'll use the best tools out there to get to their point. Look at the innovation they're producing at such a young age.

It's nuts that people think that it should be policed. The whole system of how education is given and assessed needs be dismantled and rebuilt to fit within AI innovation. Not to lock it out. Students know it's not about passing exams and getting a 1st, it's about coming out the other end having learned and innovated.

This rhetoric reminds me of the people who voted leave.

I agree with all that. AI is here to stay and is being integrated into workplaces etc., so it makes more sense to accept it and integrate it into education rather than stick your fingers in your ears and chant "la la la" trying to pretend it doesn't exist. That basically means re-modelling education to accept the fact that AI WILL be used, in the same way that workplaces are being remodelled to use AI. Instead of thinking of AI as the enemy, we need to embrace it and work with it.

poetryandwine · Yesterday 11:51

PS I agree with you that we need continued, and improved, monitoring for AI. My understanding is that privacy protections and, probably, insufficient ability to classify the work at present, prevent the inclusion of papers from UK, EU and US universities in Turnitin’s internal AI detection training database. Millions of these papers are housed elsewhere on Turnitin.

Does anyone know more about this?

It ought to be possible to access this database to improve AI detection.

Agree that getting it wrong is very serious.

greenlampern · Yesterday 11:54

Badbadbunny · Yesterday 11:46

I agree with all that. AI is here to stay and is being integrated into workplaces etc., so it makes more sense to accept it and integrate it into education rather than stick your fingers in your ears and chant "la la la" trying to pretend it doesn't exist. That basically means re-modelling education to accept the fact that AI WILL be used, in the same way that workplaces are being remodelled to use AI. Instead of thinking of AI as the enemy, we need to embrace it and work with it.

So I do data analysis. The amount of data I can analyse with genAI and get better results from (better use of public money and time funding my research) its astronomical its scary. I can do 4/5 years work in 1 year now.

My issue is, stuff is advancing faster than the scientific method can keep up, so I'm having to build methodology- validate it, and then go on. Another issues is that suddenly we're being expected to produce much more work, much faster than before. The idea that AI can be used in workplace stuff (creating documents etc) is acceptable, which means you're expected to develop/submit/apply for things much faster.

Older academic administrators understanding of AI being used to cheat on essays is so limited and banal it doesn't warrant addressing and younger people are just ignoring them.

DelurkingAJ · Yesterday 11:56

Badbadbunny · Yesterday 11:46

I agree with all that. AI is here to stay and is being integrated into workplaces etc., so it makes more sense to accept it and integrate it into education rather than stick your fingers in your ears and chant "la la la" trying to pretend it doesn't exist. That basically means re-modelling education to accept the fact that AI WILL be used, in the same way that workplaces are being remodelled to use AI. Instead of thinking of AI as the enemy, we need to embrace it and work with it.

Hard disagree. You need to understand a subject before you can check AI properly. We’re seeing issues already at work (accounting) where people are asking AI and the answers are simply wrong and they don’t spot it because they never had the knowledge in the first place.

Doesn’t mean that AI isn’t a useful tool but you can’t use a spade if you don’t understand what you’re planting where.

greenlampern · Yesterday 11:56

Oh and many universities are giving students free AI platforms. Then using that to 'detect cheating'. This just comes across as antagonistic and pointless. This aggression is just widening the generational divide.

poetryandwine · Yesterday 11:57

greenlampern · Yesterday 11:43

I think this shows the gaping generational divide. If you're AI literature, in particular genAI, there is no way you will not use it. It's like asking people not to use the internet for emails and to go back to handwritten letters. Students will role their eyes an ignore you. Not because they're lazy and want to cheat, younger people seem to be much more ambitious and savvy than my generation, but because they'll use the best tools out there to get to their point. Look at the innovation they're producing at such a young age.

It's nuts that people think that it should be policed. The whole system of how education is given and assessed needs be dismantled and rebuilt to fit within AI innovation. Not to lock it out. Students know it's not about passing exams and getting a 1st, it's about coming out the other end having learned and innovated.

This rhetoric reminds me of the people who voted leave.

Yes and no. Analysis shows that a lot of AI generated literature is form over content. That is true from suboptimal vibe coding to musical composition. Of course this stuff has its place. But we need to identify it accurately, just as we should not confuse ready meals with properly cooked food if they happen to be served in a nice gastropub.

I fully agree that appropriate use of AI can be a great thing. But it is very difficult to persuade students not to take too many shortcuts.

greenlampern · Yesterday 11:57

DelurkingAJ · Yesterday 11:56

Hard disagree. You need to understand a subject before you can check AI properly. We’re seeing issues already at work (accounting) where people are asking AI and the answers are simply wrong and they don’t spot it because they never had the knowledge in the first place.

Doesn’t mean that AI isn’t a useful tool but you can’t use a spade if you don’t understand what you’re planting where.

Yes, which is why it needs to be incorporated into education as a tool. Its doesn't replace the act of teaching...

greenlampern · Yesterday 11:59

poetryandwine · Yesterday 11:57

Yes and no. Analysis shows that a lot of AI generated literature is form over content. That is true from suboptimal vibe coding to musical composition. Of course this stuff has its place. But we need to identify it accurately, just as we should not confuse ready meals with properly cooked food if they happen to be served in a nice gastropub.

I fully agree that appropriate use of AI can be a great thing. But it is very difficult to persuade students not to take too many shortcuts.

You cant identify it accurately, and too much energy and aggression is wasted on this.

Badbadbunny · Yesterday 12:07

DelurkingAJ · Yesterday 11:56

Hard disagree. You need to understand a subject before you can check AI properly. We’re seeing issues already at work (accounting) where people are asking AI and the answers are simply wrong and they don’t spot it because they never had the knowledge in the first place.

Doesn’t mean that AI isn’t a useful tool but you can’t use a spade if you don’t understand what you’re planting where.

I never said to replace knowledge with AI, I said to integrate AI into education, accept it WILL be used, etc.

And yes, I'm an accountant so know exactly the limitations of AI as I use AI on a daily basis myself and can spot a client email that has been generated by AI a mile off. I never ask AI for specific questions as it's already well known that it doesn't "do" Maths very well, but it's utterly brilliant when writing databases or complex spreadsheets when you ask it to create a formula to do "x, y, and z" or how to link several different fields in a database etc. It's also incredibly good to do "sanity checks" of advice/letters/reports I've drafted before sending to clients.

It's exactly why I think we need to incorporate AI into education to "teach" students what it can do, what it can't do, how to check it, etc., rather than pretend it doesn't exist and expect students not to use it.

DelurkingAJ · Yesterday 12:11

@Badbadbunny I agree completely.

I do think it has no place in exams, however. It feels like an excellent coursework module.

JulietteHasAGun · Yesterday 12:11

greenlampern · Yesterday 11:59

You cant identify it accurately, and too much energy and aggression is wasted on this.

Totally agree. I’ve put in an assignment I’ve written without AI and it flagged it as possible AI. I also got ChatGPT to write me an essay mapped to specific learning outcomes and then ran that through CoPilot which confidently said it wasn’t AI.

As an academic I’m told if I suspect AI use all I can do is ask the student. If they don’t confess we can’t even send them to an academic offences panel. I had one student whose reference list was full of ghost references. Journals and articles which didn’t exist. She just said she must have written them down incorrectly 🤷🏻‍♀️🙈

poetryandwine · Yesterday 12:11

greenlampern · Yesterday 11:59

You cant identify it accurately, and too much energy and aggression is wasted on this.

Students need to know the difference, and they need an understanding of workplace integrity.

The details of good practice constantly evolve. I do agree with you that we need to teach proper -intellectually productive and ethical - use of AI rather than burying heads in the sand.

But I wondered what you meant in your post of 11.54: ‘stuff is advancing faster than the scientific method can keep up’? So you’re inventing new methodology. Why are you not following the scientific method?

Elbowpatch · Yesterday 12:16

poetryandwine · Yesterday 11:40

True, but it is not clear to me whether a viva given on these grounds may in itself be cause for appeal. Might it be said to be discriminatory?

I am not sure on what grounds the OIA has upheld student complaints involving vivas required on the grounds of these suspicions. I rather doubt they can be assigned a very high weighting in the course unit assessment, for the reasons discussed above by @Owlbookend

On what grounds? Apologies if that has already been covered.

poetryandwine · Yesterday 12:20

Elbowpatch · Yesterday 12:16

On what grounds? Apologies if that has already been covered.

It is generally recognised that AI detection is far from perfect. Thus using it to choose which students to viva is problematic.

Elbowpatch · Yesterday 12:22

poetryandwine · Yesterday 12:20

It is generally recognised that AI detection is far from perfect. Thus using it to choose which students to viva is problematic.

Even when that AI detection is done by a human?

poetryandwine · Yesterday 12:25

PS @Elbowpatch and there are those who perform poorly on vivas for all sorts of reasons. Particularly some SEND students, but others also.

One of my best ever UG project students gave a dreadful viva. Luckily for her this was before AI. Also, she had worked hard and enthusiastically all year and done a lot of high quality thinking in my presence. I can easily imagine things going badly wrong for her nowadays, under different circumstances.

greenlampern · Yesterday 12:33

poetryandwine · Yesterday 12:11

Students need to know the difference, and they need an understanding of workplace integrity.

The details of good practice constantly evolve. I do agree with you that we need to teach proper -intellectually productive and ethical - use of AI rather than burying heads in the sand.

But I wondered what you meant in your post of 11.54: ‘stuff is advancing faster than the scientific method can keep up’? So you’re inventing new methodology. Why are you not following the scientific method?

But I wondered what you meant in your post of 11.54: ‘stuff is advancing faster than the scientific method can keep up’? So you’re inventing new methodology. Why are you not following the scientific method?

Because the scientific method continuously evolves- but never at this scale. Let's say I'm studying recycling policy development in Scotland. I'd use a method called policy analysis. Because of how time consuming and detailed this work is, I'd most likely pick one city and a timeline. So my findings will tell us about recycling policy in Aberdeen between 2000 and 2025. With genAI I can teach a model my research protocol, and then all I have to do is feed it the policies. I can run analysis on the whole of Scotland in a week.

But, this is scientific research- not a hobby. You have to use the scientific method because it has been validated. Usually policy analysis requires human decisions and judgments. Each decision is logged, then cross checked by 2 or more other researchers. This takes time, money, and expensive softwares. Then every decision is published along with my findings- where we got this information, how this decision was made etc.

Thats the check we traditionally have to ensure that what the research is saying is sound. Essentially with genAI, I can run a project with 5 or as many as I want elite research assistants in my computer. But how do we validate what the genAI model I created is sound?

I have to design a control, or test and run that first. I have to pull up every possible hidden drawback and unpack it and interrogate it. Usually controls like this are part of the scientific method, there is a checking system. You just prove you used it. Usually the scientific method advances in such a slow pace, this work would be done by lots of people in lots of countries (which is why academic conferences are useful). Now I (and other people in my position) have to waste time and effort proving this. It's double the work. Usually innovation is so slow, this kind of testing is done before it reaches the user.

We all know its sound because the tools is that good. We've tested it through our own use. Everyone using it knows it's correct. Advance models don't hallucinate, and they can be closed off so no worries about dating leaking etc.

Another example of this is the use of AI in diagnostics. It can detect breast cancer five years before a human can detect it on a mammogram. It can detect gender on retina scans, and no one seems to understand why. But we cant use it for breast cancer diagnosis- beaucse it's not been verified. What if it's wrong? It's not, but how can we prove it's not?

Sorry this is long and rambly- I hope it's been helpful!

greenlampern · Yesterday 12:40

I also do think that there is a huge issue with companies selling at exorbitant costs and controlling access to knowledge (that they didn't create...). A lot of this is tied to government tech development, so what the public has access to very much controlled by the neoliberal military industrial complex.

This is like when people didn't want electrical wiring in their homes, or wifi. I'd love to go back to the days when computers were kept on a desk at home or at work. I think the internet has messed up all out brains. But it's here now, you can't work, learn, or even be without it anymore.

So universities focusing on student using AI is such a limited and pointless waste of energy. This is going to revolutionise the world, not necessarily for the better, but it is.

poetryandwine · Yesterday 12:51

greenlampern · Yesterday 12:33

But I wondered what you meant in your post of 11.54: ‘stuff is advancing faster than the scientific method can keep up’? So you’re inventing new methodology. Why are you not following the scientific method?

Because the scientific method continuously evolves- but never at this scale. Let's say I'm studying recycling policy development in Scotland. I'd use a method called policy analysis. Because of how time consuming and detailed this work is, I'd most likely pick one city and a timeline. So my findings will tell us about recycling policy in Aberdeen between 2000 and 2025. With genAI I can teach a model my research protocol, and then all I have to do is feed it the policies. I can run analysis on the whole of Scotland in a week.

But, this is scientific research- not a hobby. You have to use the scientific method because it has been validated. Usually policy analysis requires human decisions and judgments. Each decision is logged, then cross checked by 2 or more other researchers. This takes time, money, and expensive softwares. Then every decision is published along with my findings- where we got this information, how this decision was made etc.

Thats the check we traditionally have to ensure that what the research is saying is sound. Essentially with genAI, I can run a project with 5 or as many as I want elite research assistants in my computer. But how do we validate what the genAI model I created is sound?

I have to design a control, or test and run that first. I have to pull up every possible hidden drawback and unpack it and interrogate it. Usually controls like this are part of the scientific method, there is a checking system. You just prove you used it. Usually the scientific method advances in such a slow pace, this work would be done by lots of people in lots of countries (which is why academic conferences are useful). Now I (and other people in my position) have to waste time and effort proving this. It's double the work. Usually innovation is so slow, this kind of testing is done before it reaches the user.

We all know its sound because the tools is that good. We've tested it through our own use. Everyone using it knows it's correct. Advance models don't hallucinate, and they can be closed off so no worries about dating leaking etc.

Another example of this is the use of AI in diagnostics. It can detect breast cancer five years before a human can detect it on a mammogram. It can detect gender on retina scans, and no one seems to understand why. But we cant use it for breast cancer diagnosis- beaucse it's not been verified. What if it's wrong? It's not, but how can we prove it's not?

Sorry this is long and rambly- I hope it's been helpful!

Yes, I am a STEM academic. I don’t specialise in anything medical or biological, but I regard the use of LLMs in medical imaging and diagnostics as one of the unmitigated successes to date. I think shortly LLMs will be able to take over a great deal here and I hope that will relieve a burden in the NHS.

I think you’re talking about a new scale of workings rather than refuting the scientific methodology, but I daresay scientists and engineers felt the same way during the Industrial Revolution! In 200 years the AI paradigm shift will seem slow paced.

poetryandwine · Yesterday 12:53

Elbowpatch · Yesterday 12:22

Even when that AI detection is done by a human?

Oh, humans are worse than machines! We have intuition but that is quite variable and doesn’t stand up well to challenge.

Elbowpatch · Yesterday 13:11

poetryandwine · Yesterday 12:25

PS @Elbowpatch and there are those who perform poorly on vivas for all sorts of reasons. Particularly some SEND students, but others also.

One of my best ever UG project students gave a dreadful viva. Luckily for her this was before AI. Also, she had worked hard and enthusiastically all year and done a lot of high quality thinking in my presence. I can easily imagine things going badly wrong for her nowadays, under different circumstances.

Well, we’ll see. Vivas have been part and parcel of assessment for many years and not given rise to any issues. Not that I am aware of anyway. Appropriate adjustments can be made for SEND students, just like for any other form of assessment.