Read The Times Australia

Daily Bulletin

Errors in Centrelink's debt recovery system were inevitable, as in all complex systems

  • Written by: Simon Williams, Lecturer in Mathematics, Flinders University

Since it announced a crackdown on outstanding debt in June last year, Centrelink has sent debt recovery letters to thousands of Australian welfare recipients. Early reports indicated that around 20% were issued in error, although the true number may be substantially higher.

This led to the appointment last week of a Senate committee to investigate why so many erroneous letters were sent.

While there has been some controversy over the government’s reliance on big data to save taxpayers’ money, none of the commentary has come close to understanding the complexities hidden in the notion of the error rate.

The system that Centrelink employs is an example of artificial intelligence, and the problems it faces are intrinsic to all decision systems.

Working with error

Mathematicians, computer scientists and engineers have long grappled with these problems and developed ways of quantifying the errors, as well as determining the point that balances their relative importance.

The same mathematics is used to describe radar systems, disease screening programs, and speech recognition systems such as Siri and Google Cloud Speech.

But is it an error in itself to talk of a single error rate for such systems. That assumes there is only one way to make an error, or that all of the different types of errors are equally bad, which is almost never the case.

For example, there are at least two ways the Centrelink system can make a mistake in its job of sending letters to debtors. It can send a letter to someone without a debt or it can overlook sending a letter to someone who does have a debt.

The former error is a false positive, and the rate at which a system avoids this type of error is called the sensitivity. The latter error is a false negative, and the associated rate is called the specificity.

image How many of those radar blips are not really aircraft? Flickr/NATS Press Office, CC BY-NC

Radar systems, which use radio waves to detect aircraft, are where this notion of multiple sources of error was first formalised. It is important not to miss a detection of aircraft (false negative) but the avoidance of mistaken detections is equally important.

Having to constantly deal with incorrect reports is fatiguing for the operator and may cause them to discount valid detections – like a modern version of The Boy Who Cried Wolf.

Screening errors

Another example of the importance of multiple error mechanisms is in breast cancer screening programs.

While it is good to catch cancers early, it is also important to correctly clear women without breast cancer due to the psychological harm caused by the stress of an incorrect diagnosis.

These systems use a type of machine learning called classification. A classification system uses all of the available data to decide into which class a particular case falls. We have already seen that for a two-class problem (Centrelink: letter or no letter) there are two ways to make an error.

But it gets worse: for a three-class problem (a cancer test that decides between clear, benign and malignant) that number rises to six. The more decisions a system makes, the more ways it can fail.

Nobody’s perfect

It is not possible to make a perfect system. Errors in data entry, natural variability and human error all play their part in blurring the line between debt and no debt, malignant or benign, detection or false alarm.

The following sequence shows how a mathematician approaches the decision process using a simplified model of the Centrelink system, where we cross-reference someone’s declared income against their actual income. The following sequence uses simulated data that assumes only that debtors’ actual income is higher than their declared income.

image An animated explanation of how mathematicians understand how to measure the performance of a simplified version of the Centrelink robot-debt system.

But the intrinsic uncertainty in the data, and the freedom we have to choose the decision line, shows the impossibility of simply defining the error rate.

What we can do is summarise the performance of the system using a method known as receiver operating characteristic (ROC) where we move the decision line and record the true and false positive rates for each position.

image This chart shows that as the number of debtors correctly identified increases, so too does the number of non-debtors who are incorrectly identified as debtors. Author created

The true positive rate is the percentage of debtors sent a letter, and the false positive rate is the percentage of people incorrectly sent a debt letter. This graph is generated by marking all true positive/false positive pairs for each possible decision line against the same simulated data used earlier.

There are two things we can see from this graph: if you want to identify all of the debtors (100% true positive) you will also catch 90% of the debt-free in your net. This point is marked A on the curve. Furthermore, if you do not want to write any unnecessary letters (0% false positive) then you will only find 38% of those who should pay back a debt (marked B).

As usual, a trade off is unavoidable, and the sensible way lies somewhere in between. That’s why you have to accept that there will always be some errors in your system. The challenge is how to balance them, and how to deal with them whey they are occur.

Why now?

One puzzle remains for the Centrelink system. The data matching algorithm has been in use for several years in a manual form, but only now is it making headlines.

The answer lies in the expansion of its application. The automated system sends out 20,000 letters a week, a 50-fold increase on the the manual system. Broadening the population to which a system is applied has diminished its effectiveness.

Imagine we have a test for breast cancer that is 99% accurate at identifying cancer, and clears those without cancer 99% of the time. This sounds like a great test, but when we use it for mass screening, such as applying it to every Australian, things will go wrong.

The prevalence of breast cancer in Australia is 61 per 100,000 people. This means the test will correctly identify all but 14 of the approximately 1,400 new cases in the country each year.

But by the same token, 230,000 people will be incorrectly diagnosed with cancer and recalled unnecessarily for further expensive and invasive testing to clear them completely. That’s not to mention the stress and worry these incorrect diagnoses would cause. It would rightly be denounced as a disaster.

The actual screening program avoids this trap by narrowing its application to women over 50, which is a small enough population for whom the prevalence is high enough to make screening useful.

Lessons to learn

The Human Services Minister, Alan Tudge, says that sending a letter to a non-debtor is not an error.

But, as we have seen, downplaying these errors and concentrating only on identifying more and more debtors magnifies the number of this second type of error.

Also, expanding the data-matching system and removing the human element from the case-selection process has undermined the system’s performance even further.

By underestimating both the number of errors possible and the effects of their interaction, Centrelink is left dusting itself off after having been hit by a big steaming pile of data.

Authors: Simon Williams, Lecturer in Mathematics, Flinders University

Read more http://theconversation.com/errors-in-centrelinks-debt-recovery-system-were-inevitable-as-in-all-complex-systems-71409

Business News

Is Your Brand Showing Up in AI Search? Most Melbourne Brands Aren't.

The New Front Door Nobody Told You About Something changed. Quietly. Without a press release. The way buyers find businesses in Australia has been rewired. Not replaced, rewired. Google isn't dead...

Daily Bulletin - avatar Daily Bulletin

How Australian Businesses Can Measure SEO ROI

SEO can feel vague when you are staring at a dashboard full of numbers that do not clearly connect to revenue. The key is to measure the right signals in the right order, then tie them back to outcome...

Daily Bulletin - avatar Daily Bulletin

How Commercial Roller Shutters Improve Site Security Without Slowing Operations

Security upgrades can be frustrating when they make everyday work harder. A door that takes too long to open, creates bottlenecks at shift change, or fails at the worst time can turn “better protectio...

Daily Bulletin - avatar Daily Bulletin

Why a Document Destruction Service Still Matters for Modern Businesses

Businesses generate large volumes of information every day, from staff records and contracts to invoices, reports and customer files. While attention often focuses on how documents are stored, the way...

Daily Bulletin - avatar Daily Bulletin

Bicycle Rack Safety and Space-Smart Storage

Bike storage problems usually show up as small annoyances first: tangled handlebars, scratched frames, and bikes that topple when you pull one out. Over time, those issues become safety risks, especia...

Daily Bulletin - avatar Daily Bulletin

How to Tell if a Childcare Centre Is a Good Fit for Your Child

Choosing childcare can feel like you’re making a huge decision with limited information. Tours are short, centres are often on their best behaviour, and your child might act differently in a new space...

Daily Bulletin - avatar Daily Bulletin

Car Import Timeline: What Usually Happens at Each Stage

Importing a car into Australia can feel confusing because multiple agencies and checkpoints are involved, and the timeline is shaped as much by paperwork quality as it is by shipping speed. The most u...

Daily Bulletin - avatar Daily Bulletin

Portable Toilet Hygiene Standards Explained: Clean vs Sanitised vs Disinfected

In portable toilet servicing, the words clean, sanitised, and disinfected often get used as if they mean the same thing. They don’t. And that difference matters because a unit can look tidy and still ...

Daily Bulletin - avatar Daily Bulletin

Options Available When a Company Faces Financial Distress

Financial distress can develop gradually or arrive suddenly, and when it does, the decisions made in the early stages often determine what options remain available later. Directors who act promptly ...

Daily Bulletin - avatar Daily Bulletin

The Daily Magazine

What Actually Makes a Good Criminal Lawyer in Melbourne

Most people only think about this question once. That is usually too late. Most people charged wi...

Why Working With A Chatswood Tutor Can Improve Academic Performance

Academic expectations continue increasing for students across primary school, high school, and senio...

Is It Worth Getting Solar Panels in Melbourne?

The real question is not whether solar works in Melbourne. It works. The question is what it is co...

How A Diploma Of Project Management Builds Practical Skills For Modern Work Environments

Developing the ability to plan, execute, and deliver outcomes efficiently is a key requirement in to...

How to Choose the Right Football for Every Level

Choosing a football may seem straightforward, but the right option depends on who will be using it a...

What to Ask a Wedding Photographer Before You Book

Booking a wedding photographer can feel deceptively simple: you like the photos, you like the vibe...

Why Stress Relief For Dogs Is Essential For Emotional Balance And Long-Term Wellbeing

Managing emotional health is just as important as physical care when it comes to pets, which is why ...

Australia’s Best Walking Trails and the Shoes You Need to Tackle Them

Australia is not short on spectacular walks. You can follow ocean cliffs in Victoria, cross ancien...

Why Pre-Purchase Building Inspections Are Essential Before Buying a Home in Australia

source Have you ever walked through an open home and started picturing your furniture, family d...