Learning what machine learning can achieve
At the 4th GovTech STACK-X Meetup, two experts showcased some of the practical uses and latest advances in machine learning.
In simpler times, the word âtrainingâ applied exclusively to living thingsâfor instance, you could train a dog to perform a trick or a person to excel at a sport. But nowadays, even machines can be trained to carry out specific tasks.
For something to be trainable, however, it must first be embedded with some form of capacity for learning, and this is indeed what computer scientists have enabled machines to do. This is the field of machine learning, an exciting and rapidly expanding frontier.
Dr Tok Wee Hyong, Principal Data Science Manager at Microsoft, and Mr Chong Jia Yi, a distinguished engineer at the Government Technology Agency of Singapore (GovTech), gave audiences a flavour of the practical uses of machine learning. They were speaking at the 4th GovTech STACK-X Meetup, a regular series of talks to engage the tech community.
Counting on AI
âToday, developments in machine learning and artificial intelligence (AI) are happening at an accelerated pace, but what is really important is that these are not just innovations in academia. A lot of the advancements are immediately applicable to real-world problems,â said Dr Tok.
For example, Project Natick at Microsoft seeks to understand the challenges of deploying subsea data centres worldwide; it is the worldâs first underwater data centre, and it was designed with an emphasis on sustainability. Together with the Project Natick team, Dr Tok and his team of data scientists used deep learning-based object detection techniques to automate the monitoring and counting of marine lifeforms and deeply understand the impact of deploying subsea data centres.
Dr Tokâs team also look at how they can solve some of the worldâs toughest challenges as part of the AI for Earth programme. For example, the team worked on training segmentation models to perform land cover mapping for the whole of the United States.
Today, a large segment of the worldâs population lacks access to trained radiologists, even when imaging equipment is readily available. His team investigated various approaches to using deep learning to predict diseases from chest X-ray images, providing additional inputs to radiologists and helping them make better decisions. To aid interpretability of the results, his team used Class Activation Maps (CAM) to understand and explain the prediction results from the deep learning models,
Painting a graphic picture
Speaking of complex problems, creating lifelike simulations of objects and living things, such as those portrayed in video games or animated movies, used to be a manual and painstaking process. Now, with the help of machine learning, animators can achieve their vision of realistic digital characters with less hassle.
Mr Chong, a distinguished engineer at GovTech, knows very well the potential of machine learning in graphics. Having worked with Pixar for close to ten years prior to joining GovTech, he was responsible for developing physics simulation and animation tools used in popular films such as WALL-E, Brave, Toy Story 3 and Up.
âFor instance, I worked on a skin simulation for digital characters which was used on all the animals of the movie Brave,â he said, adding that mimicking how the skin slides across the body during movement is actually a pretty complicated problem to solve mathematically.
He stated that in recent years, Machine Learning, in particular in the field of Deep Learning, has enabled the generation of smooth dynamic motion via trained Neural Networks. Essentially, we can now train machines to figure out the algorithms/methods to animate or simulate dynamic characters. Deep Learning methods for computer graphics and animation allow the automatic generation of complex motions like fluids, skin/flesh/muscle and character run cycles. This is now an additional tool that visual effects, animation studios, and game companies use to improve the overall expressiveness of their characters.
But the holy grail of machine learning for graphics, according to Mr Chong, is â2D RGB to 3D human pose estimationâ. Basically, this means taking a two-dimensional image (or video) of a person and then correctly identifying and assigning the positions of the torso and limbs as if they were in three-dimensional space. Existing machine learning algorithms still have difficulty performing this task in a robust manner, but Mr Chong thinks that whoever can solve this problem well âwill change the face of the motion capture industry as we know it.â
Clearly, the potential of machine learning is still largely untapped. Novel and useful applications of the technology are still waiting to emerge, limited only by our imagination.