The road so far
This is the second post regarding data analysis, focused on goal setting.
Figure 1: Data Analysis main steps.
So far, so good
So, you have a question. It can be a business or a technical problem, does not matter as long as you can identify it. Actually, here is the 80% of the problem: you need to state the problem very clearly and without ambiguities, if you really want to solve it.
A list of questions can help you in finding the way:
- What do I want to accomplish?
- For what reasons?
Up to here, we are in a “qualitative area”. These questions are fundamental and are the basis for the building, but you need to add the roof: that is the “quantitative area” questions:
- Can I state it as a numerical question?
- How can I calculate it? Do we already have a function, a KPI, a variable addressing the numerical question? If we do not have it, can we design it? Also, you can change it while going on. The point is to try to define it (at some time, you will also succeed ?!).
- What is the accuracy and/or the tolerance I envisage for that numerical result? Can I use standard statistics such as relative errors, absolute errors, RMSE, NMAE… Also, you could be the first to try to assess the accuracy, or that, at first, you do not have a clear idea on how to assess it. That’s not the point, the focus is to keep it in mind (and have it ready) by the end of the activity: if your result turns out to be 5 with an error of 200%, you should not feel very proud of your approach…
- Once I have the final result, how can I validate it? Do I have some hints looking at literature? Do we have in our team similar activities so to compare results?
- Do I have the technical tools to calculate it?
- Do I have the time to meet expectations? Deadlines are always coming!
- Can I ask for a double-check to someone I trust? (please, do not re-invent the wheel: they already did it some time ago!). If possible, ask someone you trust to double-check even your initial assumptions, not to fail just at the beginning.
Let’s give some examples on solar activities
- basic example: calculate your plant performance using the PR
- “advanced” example: power forecast of your plant
Figure 2: example 1: do I improve my plant performance?
Figure 3: example 1: do I forecast my plant production?
If you want me to use some fancy words, I can say that we are essentially rephrasing the SMART goal setting approach (specific, measurable, agreed, realistic, time-phased) in the data science context.
Now, you are ready to go: what is your question…? Do you already have a KPI answering the question…? How do you calculate it?
At i-EM S.r.l., we think that a long journey starts from a single and smart step; also, we know that the devil is in the details, and a good procedure can help to take them into account. Try us!