We areseeking a seasoned and proficient Senior Python Data Engineer with substantialexperience in cloud technologies.
As a pivotal member of our data engineeringteam, you will play a crucial role in designing, implementing, and optimizingdata pipelines, ensuring seamless integration with cloud platforms.
The idealcandidate will possess a strong command of Python, data engineering principles,and a proven track record of successful implementation of scalable solutions incloud environments.
Responsibilities:
1. Data Pipeline Development:
Design, develop, and maintain scalable and efficient data pipelinesusing Python and cloud-based technologies.
Implement Extract, Transform, Load (ETL) processes to seamlessly movedata from diverse sources into our cloud-based data warehouse.
2. Cloud Integration:
Utilize cloud platforms (e.g., Google Cloud, AWS, Azure) to deploy,manage, and optimize data engineering solutions.
Leverage cloud-native services for storage, processing, and analysis oflarge datasets.
3. Data Modelling and Architecture:
Collaborate with data scientists, analysts, and other stakeholders todesign effective data models that align with business requirements.
Ensure the scalability, reliability, and performance of the overall datainfrastructure on cloud platforms.
4. Optimization and Performance:
Continuously optimize data processes for improved performance,scalability, and cost-effectiveness in a cloud environment.
Monitor and troubleshoot issues, ensuring timely resolution and minimalimpact on data availability.
5. Quality Assurance:
Implement data quality checks and validation processes to ensure theaccuracy and completeness of data in the cloud-based data warehouse.
Collaborate with cross-functional teams to identify and address dataquality issues.
6. Collaboration and Communication:
Work closely with data scientists, analysts, and other teams tounderstand data requirements and provide technical support.
Collaborate with other engineering teams to seamlessly integrate dataengineering solutions into larger cloud-based systems.
7. Documentation:
Create and maintain comprehensive documentation for data engineeringprocesses, cloud architecture, and pipelines.
Technical Skills:
1. Programming Languages: Proficiencyin Python for data engineering tasks, scripting, and automation.
2. Data Engineering Technologies:
Extensive experience with data engineering frameworks like distributeddata processing.
Understanding and hands-on experience with workflow management toolslike Apache Airflow.