Part 5: Data Analysis fourth step: data pre-processing