Overview:
Data Engineer - AWS
India Remote/Ahmedabad/Bengaluru/New Delhi
Emmes Group: Building a better future for us all.
Emmes Group is transforming the future of clinical research, bringing the promise of new medical discovery closer within reach for patients. Emmes Group was founded as Emmes more than 47 years ago, becoming one of the primary clinical research providers to the US government before expanding into public-private partnerships and commercial biopharma. Emmes has built industry leading capabilities in cell and gene therapy, vaccines and infectious diseases, ophthalmology, rare diseases, and neuroscience.
We believe the work we do will have a direct impact on patients lives and act accordingly. We stive to build a collaborative culture at the intersection of being a performance and people driven company. We're looking for talented professionals eager to help advance clinical research as we work to embed innovation into the fabric of our company. If you share our motivations and passion in research, come join us!
Primary Purpose
We are seeking a talented Data Engineer to join our team. The ideal candidate will have a strong background in designing, building, and optimizing data pipelines and infrastructure to support our data analytics and machine learning initiatives.
Responsibilities:
Design, develop, and maintain scalable and efficient data pipelines to process large volumes of structured and unstructured data from various sources.
Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and translate them into technical solutions.
Implement data integration and ETL processes to extract, transform, and load data into data warehouses, data lakes, and other storage systems.
Optimize data pipeline performance for speed, reliability, and cost-effectiveness, leveraging technologies such as Apache Spark, Apache Flink, and AWS Glue.
Ensure data quality and integrity throughout the data lifecycle, implementing data validation and monitoring processes.
Design and maintain data models, schemas, and metadata to support data analysis and reporting requirements.
Stay up to date with emerging technologies and best practices in data engineering, recommending, and implementing innovative solutions.
Collaborate with cross-functional teams to drive data-driven decision-making and deliver actionable insights.Qualifications:
BE, B-Tech, BCA, MCA, M-Tech, or Bachelor's degree in Computer Science, Information Technology, or related field.
Minimum 5 years experience as a Data Engineer or similar role, with a focus on building and optimizing data pipelines and infrastructure.
Strong programming skills in languages such as Python, Scala, or Java.
Experience with big data technologies such as Hadoop, Spark, Kafka, and HDFS.
Proficiency in SQL and relational databases (e.g., PostgreSQL, MySQL) as well as NoSQL databases (e.g., MongoDB, Cassandra).
Familiarity with cloud platforms such as AWS, Azure, or Google Cloud Platform.
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration skills, with the ability to work effectively in a team environment.
Experience with data visualization tools (e.g., Tableau, Power BI) is a plus.
Knowledge of machine learning concepts and techniques is a plus.
CONNECT WITH US!
Follow us on Twitter - @EmmesCRO
Find us on LinkedIn - Emmes