About Us:
We're passionate about connecting top-tier remote developers with handpicked Silicon Valley companies. Our goal is to provide developers with competitive and stable income and access to the best companies in the industry, all while offering them a flexible work environment and a host of amazing benefits. We're proud to work with trusted partners such as Webflow, IMX, Deel, Immutable, O'Gara, LegalSoft, and more.
What we offer:
- Lots of PTO
- Monthly analysis for raises
- Healthcare insurance
- Wi-Fi costs covered
- Unlimited Udemy courses (and books)
- and much more
About the Role: We are looking for an experienced Senior Large Language Model (LLM) Engineer to join our team. You will be responsible for developing, deploying, and optimizing large language models in production environments. This role is ideal for someone with a strong background in both software engineering and machine learning, who thrives on building scalable solutions that deliver real-world impact.
Key Responsibilities:
- Lead the design and development of large language models for diverse applications.
- Optimize and scale GenAI applications to ensure high performance under heavy user loads.
- Develop advanced strategies for effective LLM utilization, including prompt engineering, context-based techniques, and RAG-based (Retrieval-Augmented Generation) bot development.
- Establish and manage end-to-end data pipelines, incorporating vector databases, caching layers, and embeddings.
- Work within containerized environments, utilizing Kubernetes to create efficient, scalable API endpoints.
Required Qualifications:
- 4-8 years of experience in software engineering, with a strong focus on LLMs in the last 2 years.
- Proven experience in Python development, with expertise in API creation using FastAPI.
- Deep understanding of production deployment processes, including API integration and WebSockets.
- Extensive experience with AWS services, particularly in deploying machine learning solutions on platforms like SageMaker and EC2.
- Hands-on experience with LLM frameworks such as Langchain, LlamaIndex, and tools like OpenAI, with a strong focus on real-world applications.
- Demonstrated experience in RAG-based bot development, integrating retrieval-augmented techniques with LLMs.
Preferred Qualifications:
- Familiarity with Azure services and cloud computing environments.
- Demonstrated experience in fine-tuning and deploying open-source models in production.