- This role contributes to the construction of the development data fabric and data strategy.
- This role will interact with architects, engineers, data modelers, product owners as we'll as other team members in Clinical Solutions and R&D.
- This role will actively participate in creating technical solutions, designs, implementations & participate in the relentless improvement of R&D Tech systems in alignment with agile and DevOps principles.
- The Data Engineer demonstrates both depth and breadth across key data engineering competencies eg Software Development, Testing, DevOps, Data Science/Analytics, and cloud. Can collaborate with experts from other subject domains.
- Primary responsibilities include using Azure cloud services and GSK data platform tools to ingest, egress, and transform data from multiple sources.
- In addition, the role will demonstrate core engineering knowledge/experience of industry technologies, practices, and frameworks such as data fabric and scaling data platforms, containerization, cloud-based platforms, data analytics, machine learning, and data streaming.
- Examples of technologies include Java/C#/Python, Denodo, GIT, Azure Devops, Data Bricks, Presto, Spark, Azure Data Factory, ADLS V2, Kafka, Selenium, JUnit/NUnit, SAFe, Kanban, Docker, AI/ML, Azure/GCP Cloud Architecture including networking principles and scaling applications.
The Data Engineer, Clinical Solutions role is a senior technical role and will provide you the opportunity to lead key activities to progress your career.
These responsibilities include the following:
- Working with other teams that are defining devops and data platform practices to meet the requirements of clinical solutions.
- Supporting engineering teams in the adoption and creation of data fabric best practices.
- Conducting PoCs of new technologies and helping to embed them in product teams
- Being part of a cutting-edge team creating the Development Data Fabric
- Ensures that technical delivery is fully compliant with GSK Security, Quality and Regulatory standards
- Ensures use of relevant R&D Tech / central services and collaborating with service partners in identification and delivery of service improvements
- Maintains best practices for engineering and architecture on our Confluence site. This requires hands on experience with cutting edge technology .
- Pro-actively engages in experimentation and innovation to drive relentless improvement
- Provides leadership, technical direction and GSK expertise to architecture and engineering teams composed of GSK FTEs, strategic partners and software vendors.
We are looking for professionals with these required skills to achieve our goals:
Total 15+ years of experience and proficient with at least 3 of the below skills and can demonstrate knowledge and value with relevant experience in all the following competencies:
- Must have experience in Spark, Python and Databricks
- Software development, architecture design & technology platforms/frameworks
- Data Platforms and Domain-driven design
- Agile, DevOps & Automation [of testing, build, deployment, CI/CD, etc]
- Data science (eg AI/ML), data analytics & data quality/integrity
- Testing strategies & frameworks
Role requires:
- Demonstrated skill in delivering high-quality engineered data products
- Knowledge of industry standards and technology platforms aligned to GSK and R&D roadmaps
- Excellent communication, negotiation, influencing and stakeholder management skills
- Customer focus and excellent problem-solving skills
- Computer Science or related bachelor s degree - MS in Computer Science is preferred
- Familiarity and use of various open-source ecosystems including JavaScript, Bigdata, java, python etc
- Good understanding of various software paradigms: domain-driven, procedural, data-driven, object-oriented, functional
- Familiar with .Net Core (C#), Java, Python
- Demonstrable knowledge depth in more than one area of software engineering and technology