- We are seeking talented and driven Data Scientists to join our growing team at Diligent. Reporting to Applied Science Manager, t his exciting role involves working on state-of-the-art projects in Natural Language Processing (NLP), Large Language Models (LLMs), and other advanced AI methodologies.
- Your contributions will be crucial in developing platform-level AI capabilities, such as document set comprehension, longitudinal insight extraction, semantic search, and structured retrieval augmented generation, to accelerate AI adoption across Diligents diverse product suite.
- Diligent is a global leader in modern governance, delivering innovative SaaS solutions that address governance, risk, and compliance (GRC) needs. Our mission is to empower leaders with cutting-edge technology, actionable insights, and vital connections to drive impactful decisions and lead with purpose.
Key Responsibilities:
- Design and develop machine learning models and algorithms for various applications.
- Test and evaluate the performance of these models and algorithms.
- Collaborate with the data science teams to develop and implement NLP solutions for processing large document sets.
- Collaborate with machine learning engineers and software engineers to productize AI-based capabilities.
- Work on projects involving sensitive customer content used for Diligent customers board communications, which requires stringent privacy and security measures.
- Develop and support semantic search, structured retrieval, and RAG use cases across various Diligent products.
- Utilize LLMs, knowledge graphs, traditional NLP methods, and custom models to provide document set comprehension capabilities and facilitate insight extraction.
- Focus on complex legal and regulatory language within the GRC industry, ensuring accurate and relevant model outputs.
- Implement and fine-tune models using Python and PyTorch, leveraging AWS infrastructure.
- Document and communicate your approach, progress, and results to the broader team.
- Staying up-to-date with the latest research and developments in the field of machine learning and AI.
- [Staff/Senior] Lead and oversee research projects, design and analyze experiments, build evaluation frameworks to measure the quality of our solutions.
Basic Qualifications:
- Proficiency in Python, NumPy, scikit-learn, Pandas, NLTK, spaCy, PyTorch, or similar.
- Experience with model deployment and scaling.
- Strong foundational knowledge in NLP, machine learning, and statistical modeling.
- Proficiency in building ML/AI pipelines and relevant tools such as MLFlow, AWS SageMaker, or similar.
Preferred Qualifications:
- Hands-on experience with LLMs including prompt engineering, fine-tuning, model evaluation, and building RAG-based functionality.
- Experience with semantic search and information retrieval systems.
- Familiarity with reinforcement learning methods.
Educational Requirements:
- Bachelor s degree in Computer Science, Machine Learning, Mathematics, or a related field.
- Master s or PhD preferred.