AI/ML Engineer

PloPdo

Early Applicant

4 months ago
Be among the first 50 applicants

Exp: 4-6 Years

Full time

Bengaluru / Bangalore, India

Job Description

We are seeking a talented and experienced AI/ML Engineer with domain-specific

knowledge in multilingual audio signal and text processing, coupled with expertise in

data engineering and data processing pipelines. The ideal candidate will have a

strong background in artificial intelligence (AI) and machine learning (ML), with a

focus on audio signal processing, natural language processing (NLP), and

multilingual applications. This role requires proficiency in designing and

implementing robust data pipelines to handle large-scale multilingual datasets

effectively.

Responsibilities:

Design and develop AI/ML algorithms and models for multilingual audio signal and text processing tasks, including speech recognition, language identification, translation, sentiment analysis, and text-to-speech synthesis.
Implement advanced signal processing techniques for noise reduction, feature extraction, speaker diarization, and other audio preprocessing tasks to enhance the quality of audio data.
Develop NLP models and techniques for text preprocessing, tokenization, language detection, named entity recognition, and semantic analysis across multiple languages.
Design and implement scalable data engineering and processing pipelines to ingest, preprocess, and transform large-scale multilingual datasets for AI/ML model training and evaluation.
Collaborate with cross-functional teams, including data scientists, software engineers, and domain experts, to define project requirements, data schemas, and performance metrics for AI/ML models.
Optimize data pipelines for efficiency, scalability, and reliability, leveraging parallel processing, distributed computing, and cloud-based technologies as necessary.
Conduct exploratory data analysis (EDA) and feature engineering to extract relevant insights and patterns from multilingual audio and text data.
Evaluate and benchmark AI/ML models using appropriate metrics and evaluation methodologies, iterating on model design and hyperparameters to improve performance.
Document research findings, experimental results, and software development processes to contribute to project documentation and knowledge sharing within the team.
Stay updated with the latest advancements in AI/ML, signal processing, NLP, and data engineering through self-study, literature review, and participation in conferences and workshops.

Requirements:

Bachelor's or Master's degree in Computer Science, Electrical Engineering,
Linguistics, or a related field with a focus on AI/ML, signal processing, or NLP.
4+ years of industry experience in AI/ML engineering, with a proven track record of designing and deploying AI/ML solutions in real-world applications.
Expertise in multilingual audio signal processing techniques, including noise reduction, feature extraction, speaker diarization, and audio preprocessing.
Strong proficiency in NLP techniques and algorithms for text preprocessing, tokenization, language detection, named entity recognition, and semantic analysis.
Experience in designing and building scalable data engineering and processing pipelines using tools and frameworks such as Apache Spark, Apache Beam, TensorFlow Extended (TFX), or similar.
Proficiency in programming languages such as Python FastAPI and experience with relevant libraries and frameworks for AI/ML development (e.g., TensorFlow, PyTorch, scikit-learn).
Solid understanding of distributed computing principles and experience with cloud-based platforms such as AWS, Azure, or Google Cloud Platform (GCP).
Strong analytical and problem-solving skills, with the ability to analyze complex multilingual audio and text datasets and derive actionable insights.
Excellent communication skills and the ability to collaborate effectively withcross-functional teams to drive project success.
A passion for innovation and a drive to push the boundaries of AI/ML technology in multilingual audio and text processing applications.