DS Assignment (1)
DS Assignment (1)
DS Assignment (1)
Your task is to write a small Python or R script that predicts the engine rating based on the
inspection parameters using only the provided dataset. You need to find all the cases/outliers where
the rating has been given incorrectly as compared to current condition of the engine.
This task is designed to test your Python or R ability, your knowledge of Data Science techniques,
your ability to find trends, outliers, relative importance of variables with deviation in target variable
and your ability to work effectively, efficiently and independently within a commercial setting.
This task is designed as well to test your hyper-tuning abilities or lateral thinking.
Deliverables:
· One Python or R script
· One requirements text file including an exhaustive list of packages and version numbers used
in your solution
· Summary of your insights
· List of cases which are outliers/incorrectly rated as high or low and it should be backed with
analysis/reasons.
· model object files for reproducibility.
1. Briefly describe your approach to this problem and the steps you took
2. Basics:
c. What stats did you use to prove its predictive performance and why?
e. What insights did you obtain from this data? For example: What features are
important? Why? What visualizations help you understand the data?
3. Next steps:
b. What are some other things you would have done if you had more time?
Please send in all your code, model (jar, pickle, etc) and a documentation answering the above
questions.