Ep. 279: Tineke Distelmans - From Data to Decisions: AI in Business
Welcome to Count Me In. I'm your host Adam Larson and today we're exploring artificial intelligence and machine learning with our special guest Tineke Distelmans, an assistant professor at Vrayeruniversity. Amsterdam. Tanike will break down the differences between these often discussed terms and show how they impact our daily lives. We'll also delve into a case study she developed, which used machine learning to predict patient satisfaction at a hospital setting.
Adam Larson:From data quality to ethical considerations, Tanike shares invaluable insights. So join us as we uncover practical tips, discuss the challenges of AI implementation, and explore the evolving world of generative AI, all aimed at making your jobs easier and more efficient. Let's get started. Well, Tanika, welcome to the Count Me In podcast. I'm so excited to have you here, and we're gonna be talking a lot about artificial intelligence, machine learning, and some case studies that you've, you've been a part of.
Adam Larson:And so maybe we can start off just at the top level, you know, artificial intelligence, machine learning. Maybe we can talk about the differences in the in the, and because these aren't new these aren't new concepts. Everybody's hearing these terms, but we can break down the difference between the 2 to start off.
Tineke Distelmans:First of all, thank you for the invite. I'm, I'm very happy to be here. But first, indeed, maybe let's set it clear what we're, what we're talking about today. Now if we're talking about artificial intelligence, that's actually quite a broad field because that refers to the development of computer systems that traditionally would have required human intelligence to perform tasks such as, for example, learning, problem solving, language understanding, all of these things. So that's very broad field where we refer to as artificial intelligence.
Tineke Distelmans:Now if we're talking about machine learning, then we're talking about a subfield of artificial intelligence. And when we're talking about machine learning, we refer to the use of statistical programs or algorithms that enable computers to learn from existing data without being explicitly preprogrammed. So without getting very explicit instructions on what to do, machine learning enables the computer to learn from its experience.
Adam Larson:Okay. So it it sounds like we encounter this every day in our lives. Maybe, are there some examples you can give that you can say you're not even realizing that you're you're encountering either machine learning or AI?
Tineke Distelmans:Yeah. That's actually true. I think nowadays, we can say that it's it's everywhere around us in every little corner. If I'm taking my smartphone, I have my virtual assistant, Siri, who can understand and respond to my voice commands. If I go to my mailbox, I have an algorithm behind my spam filter determining what is a a spam mail and and what is not.
Tineke Distelmans:If I go to my social media, it's an algorithm that is determining what kind of content I see on my feeds based on previous interactions that I had so so that I get very personalized content. But, also, if I go to my Spotify, I wanna listen to some music. I get very personal recommendations based on what I have been previously listening, and the same goes for Netflix. There's an entire algorithm behind it determining what are recommendations based on my personal taste. And even my supermarket is sending me personal, recommendations, let's say, or or sales that might be interesting for me based on my previous buying behavior.
Tineke Distelmans:So it's in every little corner in our daily lives, I would say.
Adam Larson:Now there's one thing that you hear a lot too, especially when talking about things like Netflix algorithms and stuff like that that they're learning and they improve their performance. Now are they are they learning in our traditional sense? Because when you think about learning, you think, oh, humans learn things, but can the machines learn as well?
Tineke Distelmans:The machines learn indeed as well, and and this happens through a kind of iterative learning process. So what is actually happening is that we feed algorithms with data. We provide them the data, and then the algorithm will do the job. It will optimize and fine tune its parameters to make sure that it will better be able to make a prediction or to make a decision or to recognize certain patterns. And so the more data we feed these algorithms, the more opportunities these algorithms have to learn from and the better they will become and the better generalizable they will be.
Adam Larson:So one of the when you and I were chatting before, you mentioned something about supervised and unsupervised learning algorithms. Can you explain the difference between those and what those are?
Tineke Distelmans:Yeah. So within machine learning, there are basically 2 main streams. Let's say we have unsupervised machine learning, supervised machine learning. Now when we're talking about supervised machine learning, then what we actually need during the training process is we need a labeled training dataset. So in the first place, we need to tell the algorithm the inputs, but also the outputs.
Tineke Distelmans:And by providing the algorithm the correct outputs associated with the inputs, then the algorithm can learn and to make the associations itself. So, for example, if we want to detect fraudulent cases, then in the first place, we need to provide the algorithm with some cases where there was fraud or fraudulent transactions and no fraudulent transactions. And we need to tell the algorithm this is a transaction that was fraudulent. This was a transaction that was not fraudulent and so on. And then the learn through the learning process, the algorithm will learn that association and will be able to predict it itself.
Tineke Distelmans:Now when we're talking about unsupervised machine learning, there, we don't need this labeled training data. We just provide the algorithm with the data, and then it's up to the algorithm to find the right structure to recognize certain patterns within, that dataset.
Adam Larson:Now is one better than the other, or it just depends on the application?
Tineke Distelmans:It depends on the application, and it it depends on on the type of problem you're trying to solve. Let's say, that that you decide to either go for a supervised or an unsupervised approach.
Adam Larson:Gotcha. Okay. So and when we first started talking, I mentioned the case study. Maybe you can just give us an overview of the case study. Obviously, maybe everybody it probably isn't gonna read the case study, but maybe we can give an overview and kinda give an understanding of what of what you researched there.
Tineke Distelmans:Yeah. So, we're talking about a case study that, I developed together with some of my former colleagues at the at Lyric Business School. And so the case study is about predicting patient satisfaction in a hospital setting. Now long story short, the CFO wants to challenge the performance of the hospital. They know that patient centricity is key and everything, that they want to challenge their performance.
Tineke Distelmans:But also very important, they want to optimize their budget allocation. And so in order to do that, we, we developed a machine learning algorithm, let's say, that, predicts whether in the end, the patient will be or not satisfied in, or about the hospital.
Adam Larson:Wow. Wow. Well, did it work? Did it work? I guess that's jumping to the end of the case study.
Tineke Distelmans:It it did work, and it it gave actually some, some some very nice insights also for the budget allocation, because it's it's one thing to train the model, let's say, and and and therefore, we used, let's say, survey data that the hospital was collecting from their patients. So they started with, with putting iPads in in the rooms, and then, patients were asked, like, all kinds of questions related to different types of aspects in the hospital about the room, but also the security within the hospital, the food and the beverages that they got, the nurses, the doctors. So every little aspect was questions. And so we used that then to train the model and to predict whether in the end the patient was overall satisfied, yes or not. And so once we got that, model, once we got that algorithm, which was performing really well, we looked at feature importance.
Tineke Distelmans:Now what do we mean with feature importance? So we had all these input questions that we were looking at or that we were, using to predict whether a patient is satisfied or not. And if we look at feature importance, then we look at what is now or what are the most important features for the algorithm to make that prediction of whether the patient will be satisfied, yes or no. And if you know which features or which aspects within your hospital are driving patient satisfaction, are making that your patient will be satisfied in the end, then, of course, you can, you can adapt your your budget allocation in in line with that. For example, if it would come out that, the security in the hospital is one of the most important drivers for patient satisfaction, then you know that security within your hospital should be at all times at 100%.
Adam Larson:Wow. Wow. So you're able to take the data and make actionable insights. Now let's say somebody's listening to this, and they're like, oh oh my gosh. How do I can I do this in my own organization?
Adam Larson:Like, what what tips would you give them to try to say, hey. I wanna do my own study within my organization to make better better decisions.
Tineke Distelmans:Well, I think I mean, if if you wanna implement it it yourself, I I would always recommend to have, like, a very good understanding of of what these techniques are doing yourself. So getting very familiar with artificial intelligence, with machine learning. And I think, nowadays, you don't even need to have the coding skills to be able to do so. There are plenty of websites online where you can just play around and train your own model without the need to code because that's that's all done for you in the background. But like that, you get familiar with, like, the training process and everything.
Tineke Distelmans:And and, I mean, if you would like to to learn to code, let's say, there are plenty of opportunities of online courses that you can you can follow, for example, by Coursera or by DataCamp. And then it's a matter of just translating this into your own into your own setting, into your own profession, and and and how it can help you over there. But also nowadays, they're online. You find a lot of things, for every industry. You have a lot of blogs.
Tineke Distelmans:And, for example, the website towards data science, there is a platform where a lot of articles appear, but also very industry specific applications that appear, on there. But I think also nowadays, like, organizations that are overseeing certain industries, they're all concerned with this matter, and they're all publishing reports about it, like how machine learning can be used in that particular industry, what are the benefits, but but also what are the challenges, what are the difficulties maybe, what are the risks of implementing these things? Because these things are also very important to be aware about. But I think there are plenty of opportunities, let's say, and plenty of resources nowadays if people want to familiarize themselves with the with the techniques.
Adam Larson:Definitely. Well, with any technology, there's nothing's, gonna be a perfect solution. So maybe could you talk about some of those challenges or limitations you could should consider when trying to implement or trying to do your own type of study?
Tineke Distelmans:Yeah. I think there are, of course, multiple challenges and and limitations and trade offs that you need to make, but I think the very first important, thing to realize is, the thing that you need a lot of data. If you wanna train a good machine learning model, you need a lot of data. I explained it before. The more data you feed the algorithm, the better your algorithm will become.
Tineke Distelmans:But not only data, quantity matters, it's also data quality that matters, because there, we have this famous principle, what we call garbage in, garbage out. If you feed your algorithm with very low quality data, then you cannot expect your algorithm to perform well. And and so that's the first important thing is that it's not just these big volumes or big chunks of data that you need to arrive at a good algorithm, but at the same time, your data should also be of of sufficient, of sufficient quality, let's say. And I think the second important challenge is also with the trade off, let's say, that you need to make on, like, how complex do I want my algorithm to become. I mean, it's quite impressive what is possible nowadays and and how complex if you look at neural networks and the patterns that they can find within the data.
Tineke Distelmans:It gets super complex. It's it's way too complex for the human brain even to see these relationships between variables and so on. And it's super impressive. But at the same time, you lose some interpretability, let's say, of of your algorithm. Because even the people that are coding, let's say, or that are trading these algorithms do not really have a very clear understanding anymore of of how the algorithm is making that decision or is making that prediction.
Tineke Distelmans:And I think especially when when you're using, these algorithms, let's say, to drive your decision making, for example, within your own organization, I think it's at least, yeah, important to have to have some feeling about how the algorithm, let's say, is is is making its its decisions.
Adam Larson:It's like we still need the human intelligence side of artificial intelligence. You need both, especially when you're making strategic decisions. You can't, you know, like there's these science fiction science fiction novels out there where the whole society is run by a massive artificial intelligence. And in I don't think we want to go that way. We still want to have the human side of things.
Adam Larson:And so when you're looking at, you know, artificial intelligence machine learning, you know, some of the things that come up are things like biases or, you know, being ethical and responsible with the data. You know, what are some ways to avoid, getting into some of those, holes that we talked about?
Tineke Distelmans:Yeah. Yeah. I think that's that's indeed super important to to make sure that we use these type of things in an in an ethical and and and in a responsible way. But I think there are just some if if you would say, okay. Let's implement this within my organization, and and let's use this to drive our decision making and to help us, well, great.
Tineke Distelmans:But I think there are just a few important principles that you always need to adhere to, let's say. And the first one I would say is transparency. And this goes back to my my previous point, let's say, of of this model interpretability and and and complexity. I think whenever you're using these type of algorithms, for making decisions, I think at least the stakeholders, let's say, of the algorithm should have at least an an understanding and and get how and why certain decisions are made by the algorithm. Suppose that in in the banking industry, let's say, they will use an algorithm to determine whether a customer is creditworthy or not, and the algorithm at a certain point determines that a particular customer is not creditworthy, then I guess it's very important to have at least an an understanding of why the algorithm is arriving at this decision.
Tineke Distelmans:So this transparency around the model is, I think, very, very important. Now besides transparency, I would also say that fairness is, is super important. And this actually also goes back to to a point that I that I previously made about the garbage in, garbage out principle. If you use very poor quality data, then you will get poor results as well, and that holds as well for biases. If there are biases within your training data, then you will get a bias in your model and in your algorithm as well.
Tineke Distelmans:And I think that's also something that we wanna avoid at all times that we create biases in our algorithm against a certain gender, against a certain race, against certain age categories. So, therefore, it's super important to really look at at the quality of your input data and make sure that there are no, no biases in there, that at least in that way, we can guarantee that we can create a fair algorithm. And I think a third important, thing to make sure that we use it in in a responsible way, let's say, is also, data protection and and privacy. Because we're often using very sensitive information in in these type of algorithms. So making sure that the data is very properly protected, that the data is anonymized, making sure that nothing can leak, that nothing that there are no cyber threats or anything.
Tineke Distelmans:I think that's also super important, if we wanna or if we wanna use and implement these, these models, in in our decision making.
Adam Larson:It sounds like if if someone is looking to implement some sort of machine learning or AI within the organization, they have a lot of prep work to do based on what you're saying what you're saying.
Tineke Distelmans:Yeah. That's that's true. But I I would also say I mean, if I go back to to the case study that we developed, most of the work is actually in the preparation. Also, if if you just purely look at, like, training the model and everything, most of the work goes into the data preparation. The the the actual training part and developing part of your algorithm does not take that long.
Tineke Distelmans:It's all the preparation part that that takes up most time. And I think it's it's exactly the same whenever you start implementing that within your organization. There are a lot of things that you need to be, or that you need to think about, let's say, that you need to take into account. And, indeed, I would say that that the prep work is, is more work than actually implementing the algorithm.
Adam Larson:Oh, yeah. Because the machines can move a lot faster once we give them the data. They're just itching to have it.
Tineke Distelmans:Yeah. That's true.
Adam Larson:So when thinking about machine learning and AI, you know, are there things that within our daily jobs that we can say, hey, this will help improve my performance in doing things. And I know there's lots of technology. Every every 2 minutes you there's a dotaior.io, new website popping up with some new feature. You know, are there things that you've seen that work what really well?
Tineke Distelmans:Actually, yeah. And I I think it's it's quite impressive, like, particularly the very recent years, like generative AI, how it has evolved, but mainly how quickly this has evolved. But it's actually but as you said, there are quite some tools. I mean, I I almost have my virtual assistant on my laptop that can summarize email conversations for me. It can then generate or write an email for me or do at least a suggestion of a reply that that I can give.
Tineke Distelmans:It can help me to schedule meetings, but also just in generally for for writing. I think there are a lot of, tools based on AI where it can really work, let's say, and help you, yeah, for spelling checks and and grammar checks and everything when it's a very important text that that you're writing. But not just text, also PowerPoint slides and visualizations. And I think it's just these these small things here and there maybe. But if you would add that up, then it would make your life a little bit easier, let's say, and and and your job a little bit more fun because these are often, like, the tinier tasks, but that sometimes can take up a little bit more time than expected, which can be a little bit annoying from time to time.
Tineke Distelmans:And there, I think it can really support you and and and help you in a little in being a little bit more efficient, actually. Actually.
Adam Larson:Well, I I think that's some great recommendations. And if you're not using things, please get out there and try these new tools out. A lot of them are free, at least, to start with. And you you can really, you know, you can really do a lot, and and it can help make things easier for you.
Tineke Distelmans:Yeah. Exactly. And it's just very fun to to play around with it. And and even though you're familiar already and and you have the coding skills, I even from time to time also just check out some websites, and I then bump into, like, hey. Here, you can train your own algorithm.
Tineke Distelmans:And then I'm just playing around with it. And then half an hour later, I'm realizing, okay. I've been just playing around with it for, like, 30 minutes. Mhmm. But it's just super fun to do.
Tineke Distelmans:And and, yeah, it it makes yourself so familiar with with how it works, but it's also, yeah, quite impressive to see, let's say, nowadays, what what is possible and and how it can support us.
Adam Larson:Well, Tanike, thank you so much for coming on the podcast. This has been a great conversation, and I encourage everybody to check out Tanike on LinkedIn and connect with her. And, we just, thanks so much for coming on.
Tineke Distelmans:Thank you for the invitation.
Announcer:This has been Count Me In, IMA's podcast providing you with the latest perspectives of thought leaders from the accounting and finance profession. If you like what you heard and you'd like to be counted in for more relevant accounting and finance education, visit IMA's website at www.ima net.org.