We are looking for an AssociateLead Software Engineer to join our Content Tech Big Data Engineering Team in India. This is an amazing opportunity to work on Real World Data using big data technologies.
nWe would love to speak with you if you have skills in Python, Spark and have experience on building big data platforms.
nAbout You experience, education, skills, and accomplishments
- nBachelors Degree or equivalent in computer science, software engineering, or a related field
- nMinimum 4 years of relevant experience.
- nGood experience working with Python, PySpark, AWS, AWS Glue, EMR and Delta Lake.
- nGood knowledge of ETL, including the ability to read and write efficient, robust code, follow or implement best practices and coding standards, design/implement common ETL strategies (CDC, SCD, etc.), and create reusable/maintainable jobs.
- nSolid background in database systems (such as Postgres, Oracle, Snowflake/Databricks) along with strong knowledge of PL/SQL and SQL.
- nExperience in handling large volume of data and building data pipelines.
nIt would be great if you also had . . .
- nFamiliarity with Airflow, Snowflake, Databricks would be added advantage.
- nExperience in building big data platforms.
- nUnderstanding on healthcare data.
- nKnowledge of Agile/other SDLC methodologies.
nWhat will you be doing in this role
nAs a member of Data Engineering Team, youll Step into a key role on an expanding data engineering team to build our data platforms, data pipelines, and data transformation capabilities.
- nDefine and implement our data platform strategy on Cloud, have a meaningful impact on our customers, and working in our high energy, innovative, fast-paced Agile culture.
- nDrive rapid prototyping and development with Product and Technical teams in building and scaling high-value medical data capabilities.
- nInterface with other technology teams to extract, transform, and load data from a wide variety of data sources using Apache suite (airflow, spark), SQL, Python, ETL, and AWS big data technologies.
- nCreation and support of batch and real-time data pipelines and ongoing data monitoring and validation built on AWS/Snowflake/Apache technologies for medical data from many different sources.
- nConduct functional and non-functional testing, writing test scenarios and test scripts.
- nEvaluate existing applications to update and add new features to meet business requirements.
nProduct you will be developing
nBig Data Platforms
nAbout the Team
nThe stakeholders for the role are Analytics team, Application Teams on the business side, Enterprise Solutions Teams and other Cross-Functional Internal IT Teams, External Vendors and Partners. The team consists of 20+ engineers and are reporting to the Director of technology.
nHours of Work
- nFulltime
- n45 hrs/week
- nHybrid working model
nAt Clarivate, we are committed to providing equal employment opportunities for all persons with respect to hiring, compensation, promotion, training, and other terms, conditions, and privileges of employment. We comply with applicable laws and regulations governing non-discrimination in all locations.