Western researchers use math to decipher how machine learning works
Team makes breakthrough in understanding how computer models can perform brain-like activities
Western researchers have developed a novel technique using math to understand exactly how neural networks make decisions – a widely recognized but poorly understood process in the field of machine learning.
Many of today’s technologies, from digital assistants like Siri and ChatGPT to medical imaging and self-driving cars, are powered by machine learning. However, the neural networks – computer models inspired by the human brain – behind these machine learning systems have been difficult to understand, sometimes earning them the nickname “black boxes” among researchers.
“We create neural networks that can perform specific tasks, while also allowing us to solve the equations that govern the networks’ activity,” said Lyle Muller, mathematics professor and director of Western’s Fields Lab for Network Science, part of the newly created Fields-Western Collaboration Centre. “This mathematical solution lets us ‘open the black box’ to understand precisely how the network does what it does.”
The findings were published in the high impact journal PNAS, in collaboration with international researchers including University of Amsterdam’s machine learning research chair Max Welling.
‘Seeing things’ by segmenting images into parts
The Western team, which included Muller, post-doctoral scholars Luisa Liboni and Roberto Budzinski and graduate student Alex Busch, first demonstrated this new advancement on a task called image segmentation – a fundamental process in computer vision where machine learning systems divide images into distinct parts, like separating objects in an image from the background.
Starting with simple geometric shapes like squares and triangles, they created a neural network that could segment these basic images.
Muller and his collaborators next used a mathematical approach, which they previously developed to study other networks, to investigate how the new network performed this segmentation task when analyzing these simple images.
The mathematical approach allowed the team to understand precisely how each step of the computation occurred. Somewhat surprisingly, the team then found the network could also segment – or see and interpret – a handful of natural images, like photographs of a polar bear walking through the snow or a bird in the wild.
“By simplifying the process to gain mathematical insight, we were able to construct a network that was more flexible than previous approaches and also performed well on new inputs it had never seen,” said Muller, a member of the Western Institute for Neuroscience.