Automating assessments with AI

Dawid Naude, Director, Pathfindr

We recently reflected on a common theme we’re noticing in the work we’re delivering. The projects had different names like “accreditation certifier”, “clause validator” or “skilled migrant assessment”, but they were all doing the same thing, assessing something.

What we also realised is that AI is often great at a first draft, or a last pass. In this case, an initial assessment.

Assessing contracts

These were tasks that in the past were hard for computers to assess because of the text content. Take a contract - in the screenshot below is an example from a recent projects where AI has been told to find discrpencies from rules in a contract. If you look at the rule and relevant exerpt columns, you’ll notice that they are completely different in language but the same in meaning, and the tool recognises this. With rule number 4, it’s picked up that the minimum liability should be $2 million not $1 million, flags the discrepancy and then allows you to view the source.



In this example the problem was that for non-disclosure agreements, most lawyers were typically looking for the same things. IP protection, liability and a few others.

Video and course content assessments

Another example was assessing regulated training content to see if it meets professional standards. In the past someone would manually review every minute of every course to make sure that it met professional standards, but what we discovered is that what they were looking for was very achievable to instruct to an LLM agent.

So now a solution reviews all the training content for markers on professional standards, readability, quality. This process used to take thousands of hours per year, now reduced by 90%.

Another example is automating the review of the content provided by a candidate for a skills assessment. What often happens in a complex assessment, like the skilled migrant assessment performed by many industry associations, is that most of the time is taken up by reviewing academic certificates to see if they meet what was mentioned in the application, or that both sides of the driver’s licence are scanned, or that they’ve provided evidence of employment.

The skills assessment is still performed by a specialist, but all the initial prep work validation is done by the bot.

10 ideas for automating assessment

  1. Tender response meets the requirements
  2. User story quality
  3. Contract criteria
  4. Job application with job description
  5. Tests
  6. Complaints submissions (assess severity and type)
  7. Compliance audits
  8. Customer feedback
  9. Supplier performance
  10. Statements of work


Go through your current process and see what takes you long to review. Find where the bottlenecks are - who are you waiting on, and could this just initially be reviewed by AI.

Other Blogs from Dawid


5 No Brainers

For all the hype around AI, it’s often not clear to business where they should get started. Some convince themselves they have a plan, you’re in this category if you say “We’ve given some people access to MS Copilot”. Let me guess - they seem to like the MS Teams meeting summarisation right?

AI Poor Data

I don’t know if it’s simply a convenient answer whilst there are hundreds of other competing priorities, or if it’s a misinformed opinion, but “our data isn’t ready yet to take advantage of AI” is something we hear regularly. Or similar variants like needing to spend 2024 getting data ready and then they’ll reassess AI in 2025.

How to think about copilots

Bots, like people, have two types of usefulness - knowledge and skills. Things it knows, and work it can do. In this Blog, Dawid Naude of Pathfindr provides practical advice to help you roll out or review your deployment of co-pilots and chatbots within your organisation, to generate both a short time impact and longer term strategic value.

Use whats in front of you

The first step your company should take is learning how to use ChatGPT properly. Very few do. There’s a lot of talk of AI, and businesses need to start tinkering right away. There is a huge opportunity accessible right now, right in front of you. You can completely automate processes with autonomous agents, have all your content created automatically for your learning management platform, and even enable the equivalent of a data science team with a few clicks.

AI Value not in Chat

Chat is only incremental The biggest mistake you’re making right now is thinking about AI in terms of ChatGPT, or chat interfaces. These assistant tools are great generalist support applications and a demonstration of the incredible power of LLM’s. But. They are a single step in a process, typically involving a lot of Ctrl+c, Ctrl+v, a lot of prompting, trial and error, to get a consistent outcome.

10 Boring Problems

Whilst we talk about ethics, avatars, deep fakes and other magical technology, the reality is that for most businesses, the value in AI will come from solving very boring problems. The average desk worker has a lot of boring tasks. There’s a substantial amount of download, copy, paste, save-as, etc. There’s a lot of responding to messages that could be found on the website. There’s a lot of vlookups and pivottables, meeting minutes and deal updates. We love solving boring problems, and get less excited about deep fakes and avatars.

How accurate is AI

Is it accurate? Along with “Is my data safe”, this is the most common question about AI. We’ve all experienced mild embraces with inaccuracy, possibly even hallucination of AI models. So how accurate is AI, let’s get a bit more specific.

Early AI pioneers

Our clients are trailblazers of AI, and it’s clear what the leaders are doing.