Managing Machine Learning Projects

Simon Thompson

Another great question!
You need to do some hard yards :

Undertake organizational analysis
Understand the data as best you can
Understand the system architecture
Understand the application
Understand the ethics and security/privacy implications
Understand the path to production
By getting this knowledge you get yourself into a position where you can make sensible judgements about what the “real” requirement of the project are. Of course this is very hard to do, so commonly people run lighthouse/pathfinder projects and also do a “sprint 0” to try to get a better insight into what the actual blockers and path to value is.
Having said that an ML project is a journey of discovery - my view is that the whole process is about finding and eliminating risk - right up until the last pull request is signed off and the code is all pushed into prod.
Actually - quite a long time after that!!!!

Muhammad Awon

Hi Simon Thompson,
Thank you for sharing the book presentation and allowing us to learn from your invaluable knowledge and experience. Your presentation gives a holistic perspective on how to think about machine learning projects.
My question is, what would you advise a sole engineer who has minimal experience with real-world projects, but he is asked by the stakeholder to evaluate the initial cost of the project, from developing the model to put into production (at least for the first iteration)? What are the determining factors one should consider to evaluate the initial cost?
Thanks for giving us this opportunity!!

Simon Thompson

Hello Muhammad,
That’s a really tough ask, but the questions I would ask are:

is it small enough that you can do it yourself in a few weeks? In this case I would recommend doubling the amount of time that you think it will take you and offering that as an estimate because for sure you will discover things are more complex than you think.
If it’s something that will take you more than 20 days of work, I would suggest that this is an indicator that it’s a lot bigger than that!
Some factors to consider:
◦ is the data simple or complex - if it’s complex and hard to understand then double your estimate
◦ is the data small or big - if it’s big then… double your estimate
◦ is the processing/compute platform required complex/distributed… if it’s complex then (you guessed it) double your estimate
◦ is the data clean?
◦ are there ethical / security/ privacy issues
◦ how many models will need to be developed
◦ unstructured data?
◦ is evaluation complicated?
◦ what sort of model documentation is required…

Simon Thompson

hope that helps?

Muhammad Awon

That’s a big help. Thank you!

Simon Thompson

Another thought - if you are able to split the project into tasks then than helps the estimation alot, and it can give stakeholders insight as to how you came up with your estimate. I understand that that’s very difficult if you are inexperienced

Muhammad Awon

Indeed, it is going to challenging but getting answers from you is a big step forward to lay down the initial plan.

Muhammad Awon

One last thing, how can I maintain good relationship with the stakeholder so to speak? Just in general.

Simon Thompson

First try to understand the other person - what do they want, why do they want it? What sort of standards or behaviours can you expect from them - ask some other people who work with them about what their hot buttons are and what they learned about working with them.
Second, respect their time. Prepare for your first meetings, treat their time as precious to you - use it well.
Third be positive and clear - don’t be negative, don’t be evasive. If you don’t understand something say so, try to get misunderstandings out of the way, try to clarify things at the beginning.
Win their trust - you can’t be bringing failure at the beginning, but be honest and open. Concealed problems are your problem. Shared issues at the beginning are everyone’s problem.
The cliche is underpromise, over deliver. It’s a good mantra. Don’t promise things you can’t give - if you bend to the pressure things will get worse and worse until something snaps. Honesty and clarity are your friend. Show your workings, show early results, get feedback early.
Good luck!

Muhammad Awon

Thank you for your time. I really appreciate!!

Dr Abdulrahman Baqais

Hi Simon Thompson. Thank you for being here.
Do you believe that deep learning projects should be handled differently than traditional ML projects?
Probably In NLP or computer vision due to labeling, annotation reviwer and required infrastructure.
Also do you recommend assigning managing the project to a manager rather than a technical. Probably for his soft skills in leadership, conflicts handling, non-perfectiones mentality.

Simon Thompson

Hello Abdulrahman,
there are different types of deep learning projects. Some are not that different because they are dealing with relatively manageable data resources, and managable training resources. To be honest training a model using an isolation forest using data that’s squeezed out of a spark query that takes 5hrs to run isn’t that different from running a model on an A100… However, once we get into big scale training with a bunch of GPUs and training runs lasting days then things are very different and a very different management problem is on the table.
Deep learning projects do have a big issue about artefact management and tracking in comparison to statistical ML projects as well. Managing transfer models, training regimes, hyper parameters and evaluation results can become non-trivial. It’s easy to get into a place where you have a model but can’t rebuild it or reproduce it. This is a bad place.
You are right about handling training data resources and acquiring training data as well. Work on vision and text projects now includes creating data augmentations as well as the labelling and annotation steps - all of this has to be accounted for and executed.

Simon Thompson

> Also do you recommend assigning managing the project to a manager rather than a technical. Probably for his soft skills in leadership, conflicts handling, non-perfectiones mentality.
Ideally you need a technical person to lead the project; project led by people who don’t understand them can come apart really suddenly. However, project managers and people manager are really, really valuable and wise leaders will lean on their skills to organise and support the team.
Send your technical leaders on a people management course and a project management course - “T” shaped people are really valuable!

Simon Thompson

Thanks to everyone for the questions. Please feel free to email me at <mailto:simon.2.thompson@gmail.com

simon.2.thompson@gmail.com> if you want to talk further. Or follow @AiSimonThompson on Twitter

Rohit

HI Simon Thompson, one of the challenges we face is how we can deploy and monitor our model after the model is deployed into on-prem of our customer or their own cloud platform which is different from ours. How can we tackle this?

Simon Thompson

I think that you need to make sure that appropriate provisions and arrangements are in the contract (and if they aren’t then you aren’t doing it!) and that they are paid for as part of the bill of work… If you are liable for looking after it you need to be able to look after it!

Alexey Grigorev

Is there a better methodology for managing ML projects then CRISP-DM?

Simon Thompson

At the top level the CRISP-DM process covers everything - apart from in life management probably! But I think things have moved on a lot since CRISP-DM came along. I think in particular the tooling that teams need to manage the development of large complex models consisting of many artefacts and developed with very large complex data sets using many transforms needs to be emphasised. I also think that we now live in a world where our models get evaluated online and not just with a few statistical measures. Finally - ethics… you’ve got to think about the ethics all the way along, and that carrys with it the need to support reproducability and accountability. So - it’s not that there is anything wrong with CRISP-DM, it’s just that there are concerns in each part of the process that need to be addressed in a very different way from the way we used to work in the 1990’s!

Tim Becker

Hi Simon Thompson, thank you for being here and answering our questions. I have a couple of questions I would like to ask you:

What are the main considerations when resourcing a team for a ML project?
What are the main difference between a ML project and a software development project that does not include ML?
How to best estimate the time that is needed for monitoring and troubleshooting after the model has been put in production? How can this time be minimised?

Simon Thompson

> What are the main considerations when resourcing a team for a ML project?
For me I think you need to think about this :

Simon Thompson

So you need people who can do all of these tasks - but different project require different balances according to the challenges you see in front of you. If the data infrastructure is tricky you will need more data engineering. If the modelling is going to be demanding you need more data scientists - and so on.

Simon Thompson

Also a blend of people is important - think about how people will work together. The team must become more than the sum of the individuals.

Simon Thompson

> What are the main difference between a ML project and a software development project that does not include ML?
I think that the main thing is managing and understanding the models and the modelling process. You have huge uncertainties in the fact that model building with ML is a discovery process. You have to have a scaffold for that, and then 100’s of models come out of the process which you have to handle.

Simon Thompson

> How to best estimate the time that is needed for monitoring and troubleshooting after the model has been put in production? How can this time be minimised?
This is a great question - I have had models that have worked for years and then exploded over night. I think that they important thing is to build a capability that allows them to be looked after for as long as they will be needed and to make sure that there are appropriate commercial arrangements in place to cover that. For instance in the telecoms industry it’s quite common for experts in a particular technology to be on on a long term retainer which is called on when something goes wrong. The expert does no work whatsoever normally - but if there is an incident then they are obliged to come and help immediately (and are handsomly compensated as well). I saw a guy get $10k for 20 minutes of work because of this once… but he saved us much much much much more!

Tim Becker

thank you!

Simon Thompson

Hi there

Simon Thompson

Thank you to everyone for writing questions so far - I hope the answers are interesting and I will check back tomorrow ! Apologies for not checking the discussion until now but WORK IS CRAZY at the moment!

Simon Thompson

If anyone would like to listen to a presentation about the book there is one online here : https://www.brighttalk.com/webcast/9059/553113?utm_source=brighttalk-portal&utm_medium[…]m=search-result-1&utm_campaign=webcasts-search-results-feed

Alexey Grigorev

> Understanding an ML project’s requirements
Can you summarize the process of doing it? Let’s say I want to start a new project - what are the steps I need to finish to understand how to approach it?

Surnjani Djoko

Any suggestions how to link the ML performance measurements to the business impact when the stakeholders can’t provide KPI?

Simon Thompson

I think you have to understand the priorities of your business stakeholders - if they are interested in revenue then your system needs to impact revenue.
Typical things to try to impact :

revenue
cost saving
cycle time (time to get things done)
people intensity (free up people to do stuff)
customer experience

Simon Thompson

If you find what the stakeholders want to improve and figure out the mechanism that your system will use to deliver that then performance measures can be found quite directly (often)

Vladimir Finkelshtein

ML projects are well-known for the iterations and feedback loops. While iterating over the training data or model is understandable, how to avoid iterating over the infrastructure setup?

Simon Thompson

I think that you should accept that when /if the team find that they need to improve their infrastructure that will need to be done. It’s only justifiable if the improvement is going to enable the team to move faster and do more - but if it is going to do that then you will find that it’s an investment that really pays.

Vladimir Finkelshtein

Managing projects usually involves planning/estimating execution times. Is there a reason that your book doesn’t include a chapter on those?

Simon Thompson

…Simon looks in book TOC…

Simon Thompson

So - there isn’t an entire chapter - but in chapter 3 there is a long section about estimation. I agree it’s a very important and hard issue that’s got to be worked through. Many teams and people see it as a bad thing to do now as it’s linked to waterfalls and people want to work in an agile way - but I think a good estimate is a really important way to prevent waterfall thinking because it can give you the time and resources to enable an agile team to become really productive and effective and to start delivering. Once that happens your stakeholders become confident and start to give you the space and time to really do good work.

luckylittle

Hi Simon Thompson, thanks for coming. Q: How did you enjoy the process of writing this book with Manning Publications?

Simon Thompson

Manning were and are brilliant. I can’t recommend them highly enough. They provided lots of support and advice and I found the reviewing process that they offered incredibly good. I got 12 anonymous reviewers to read the book and make comments - and that really changed the book because… well they had some great points.
I have to say though that “enjoy” is not the right way to describe what it’s like to write a book like this - more like cope!

luckylittle

Second question I have Simon Thompson - I see a lot of good topics and I am particularly interested in 3.2.1 Time and Effort estimates . Q: How is estimating time & effort different in ML project different from traditional software projects? I find any estimation to be quite difficult with so many unknowns.

Simon Thompson

I think you have 1/2 the picture - there are a lot of unknowns in typical ML projects. For example typically you won’t have the full picture about the data you are working with at the beginning of the project. More importantly though the properties of the models that you can build using the data you get will only become clear when you build those models…. and that has some big implications for how you are going to structure your solution

Hareesh

Q: What should SME concentrate on (people - working in-house for the company OR technology - managed services via external contracts ?) to make most out of a ML project initiative ?

Simon Thompson

I am not sure what you mean… do you mean SME: Subject Matter Expert. or SME : Small Medium Enterprise?
I think you mean small/medium enterprise. If so I think that a small company will typically not have the resources to sustain an in house ML team and so I think that long term investment for the company should go into the companies core operations and concerns, and then something like an ML project should be bought from an external supplier. I think that an external supplier should be able to offer a fixed price whereas developing the skills and capabilities internally might look cheaper on paper - but in reality it might turn out to run out of control. Neither the in house or external team are going to be able to eliminate the project risk…

Hareesh

Thank you for the response Simon Thompson.
SME - Small Medium Enterprise. Sorry for not mentioning that in my question.

Simon Thompson

No problem Hareesh! Did my answer make sense to you?

Hareesh

Ohh.. yes!!👍 Your reply makes complete sense.
ML Projects that SMEs envision should add business value (irrespective if they avoid project risk ). There is not point managing a ML project without the business aspect. Having said that, i would argue that having 1-2 Data Specialist internally will help SMEs better manage (both technical, financials and risk related ) their ML projects handled by external providers.

Simon Thompson

I think you are right - SME’s can use a Data Specialist to run their ops with or without an ML project going on. Someone who understand where the data is buried (!) and how it’s used in the business will be valuable all the time - but especially if an ML team is engaged to do a project. Being a smart buyer of services requires some insight into the services and what’s involved in delivering them.
I just think ML expertise is too heavy a burden for a normal SME to carry.

DataTalks.Club

Managing Machine Learning Projects

by Simon Thompson

The book of the week from 10 Oct 2022 to 14 Oct 2022

Questions and Answers