Job Title - AI Engineer - (Artificial Intelligence) - Model Development & Research
Job Location - Pune, Maharashtra (offers Remote friendly work option)
Company Description
Our client is a product engineering company working with innovative startups and enterprises. They have provided core product development for 110+ startups across the globe building products in the cloud-native, data engineering, B2B SaaS & Machine Learning space. Their team of 400+ elite software & DevOps engineers solve hard technical problems while transforming customer ideas into successful products.
Skills Required -
- 5+ years of hands on experience in AI model development and deployment with a focus on edge computing and local LLM inference.
- Strong programming skills in languages such as Python and C++.
- Proficiency in LLM frameworks (e.g. vLLM, text generation inference, Open LLM, Ray Serve, and HuggingFace transformers) and deep learning libraries.
- Extensive experience with GPU/TPU acceleration for AI inference, including optimization techniques (tensor, pipeline, data, shared data parallelism) and performance tuning.
- Hands on experience with one or more GPU frameworks: CUDA, Vulkan, OpenCL
- Deep knowledge of GPU memory layout, familiarity with Nvidia Jatison, ARM Mali or relevant SoC configurations
- Knowledge of Parallel computation, memory scheduling, and structural optimization
- Excellent problem-solving and analytical skills, with a passion for innovation and continuous learning.
- Bachelors Degree in Computer Science, Engineering, or a related field; Master's degree preferred