Search by job, company or skills
**ONLY FOR CANDIDATES WHO HAVE WORKED AT TURING.COM**
**Other candidates should apply to different job listing by me**
**RLHF for LLMs**
Type: Part-Time, Remote
Perks: US organisation, can offer competitive compensation to Turing
Compensation: Starting at $15/hour (Rs. 1200+ per hour)
Minimum Commitment: 10 hours/week
Signing Bonus: $300 for qualified candidates who onboard within the next week and stay for a month
About Us:
We are at forefront of AI and machine learning, and we're looking for motivated individuals to contribute to the next generation of intelligent models. The ideal candidate will have experience working with Turing.com and a strong background in data annotation, prompt engineering, and model fine-tuning. You will play a critical role in refining AI systems, providing essential human feedback, and enhancing overall model performance.
Key Qualifications:
Experience in RLHF:
Data Expertise:
Technical Proficiency:
Turing.com Experience:
Preferred Qualifications:
What You'll Do:
You will play a key role in annotating and curating data for the training and fine-tuning of large language models (LLMs), ensuring annotations are accurate, consistent, and project-aligned. You'll implement Reinforcement Learning with Human Feedback (RLHF) techniques, providing structured human feedback to guide model outputs and continuously fine-tune models to improve performance.
Why You Should Apply:
Flexible without any restrictions, opportunity work whenever it fits your schedule!
Remote work from anywhere in India!
Competitive pay starting from $15/hour based on experience and performance
How to Apply:
Fill the GoogleForm
Wait for shortlisting email
Receive offer letter
Take onboarding seriously
Follow for more AI Jobs + Entrepreneurship
Ayyush Sharma (Chhotapreneur)
Growth, Strategy & Revenue Operations | A+ track record in scaling startups.
Growth @ Outlier AI
Date Posted: 20/10/2024
Job ID: 97111731