Job Description
Job Responsibilities
Architect a data based solution for the business problem presented. Develop contextual automotive domain knowledge around the problem. Collaborate with the domain knowledge expert if required.
Understand the IoT device (Telematics Control Unit) of the vehicle. Develop understanding of Time-Series Data originating from TCU.
Understand the data landscape, establish the data adequacy beforehand
Clean up and Prepare datasets for modeling, get involved in ETL process if required. Apply data transformation techniques such as resampling, filtering, encoding etc.
Exploratory data and get insights. Present the descriptive stats and insights to the domain experts. Find meaningful patterns in data, detect seasonality and trend, establish cause and effect relationships in data. Develop & Test hypothesis in collaboration with the domain experts.
Design features, shortlist features, study feature importance, decide the ML strategy
Data modelling, selection of an appropriate machine learning / deep learning model, data pipeline setup for model training, hyper-parameter tuning, validation and test. Apply ensemble model techniques (if required)
Reporting & Visualization: Comprehension of reports, visualization of data in the form of plots. Creating Heat Maps using Google Maps API
Mentoring juniors and lead a team in data science/ analysts
Essential
Technical Skills / Experience
Must have minimum of 3 years of industry experience in developing data science models.
Experience using machine learning algorithms (e.g. Generalized Linear Models, Boosting, Decision Trees, Neural Networks, SVM, Bayesian Methods, time series models, etc.)
Hands-on experience in using machine learning models for regression and classification problems.
Hands on experience in unsupervised machine learning algorithms. Working knowledge of clustering techniques such has, centroid based clustering.
Working experience with Cloud Computing Platforms such as AWS / IBM
Strong programming skills in python is a must. Working experience with pandas, numpy, matplotlib, sklearn
At least 3 years of industry experience in one or more full-time Data Science/ML roles
Data Visualization techniques. Knowledge of visualization tools
Desirable
Experience in Spark or other distributed computing frameworks.
Understanding of AWS Sagemaker / Google AutoML / IBM AutoAI
Experience in time-series/IoT data analytics e.g. data streaming from vehicle on-board IoT device data in time-series format that may comprise of GPS data, on-board sensors of the vehicle powertrain, body and chassis systems.
Exposure to automotive systems, automobile basics, Controller Area Network protocol (CAN protocol) etc.
Experience working with remote team members
Data Visualization tools such as Tableau, PowerBI etc.
Working experience with Google Maps API
Experience or academic knowledge of Market Research data analysis
Educational Qualification
Essential : B.E / B.Tech/ M. Tech./ MCA/ BSc/ MSc
Desirable: a College Graduate in Statistics
Specialization in data science or statistics
Electronics / Computer Science or relevant stream