Search by job, company or skills

Agilite Global Solutions

NLP Data Scientist - Real World Data (RWD)

  • 4 months ago
  • Over 50 applicants

Job Description

  • We are seeking a skilled NLP data scientist with a focus on language models to join our AI and Life Sciences Solutions team
  • Your expertise in processing and understanding natural language data, along with your knowledge of Electronic Health Records (EHR) and laboratory report analysis, will be instrumental in driving our data science initiatives and innovations, particularly in the development of rich multimodal real-world datasets to expedite RWD-driven drug development in pharma
Responsibilities:
  1. Employ and leverage NLP and open-source Large Language Models (LLM) such as LLama2, Mixtral, BERT, etc., to extract, process, and interpret unstructured medical data from diverse sources like EHRs, medical notes, and laboratory reports.
  2. Collaborate with clinical scientists and data scientists to create efficient NLP models for healthcare, exhibiting an understanding of both the technical and medical aspects of the data.
  3. Conduct data cleaning, preprocessing, and validation to maintain the accuracy and reliability of insights gathered from NLP processes.
  4. Validate and present data findings to stakeholders, exhibiting clear and effective communication skills

Required Skills/Qualifications:
  • Masters or Ph.D. degree in Computer Science, Data Science, Computational Linguistics, or a related analytical field.
  • Deep understanding and direct experience (2+ years) in handling and interpreting electronic health records (EHR) and laboratory test results are a must.
  • Proven experience (2+ years) in NLP with a strong knowledge of NLP techniques such as Named Entity Recognition (NER), text summarization, topic modelling, etc. and their applied use in healthcare.
  • Expert-level understanding and practical experience (1+ years) with Large Language Models (LLM), e.g., inference and fine-tuning.
  • Proficient in Python and SQL, with strong experience in NLP libraries such as NLTK, SpaCy, Hugging Face Transformers, and deep learning libraries such as PyTorch and TensorFlow.
  • Familiarity with common data science and ML practices, e.g., version control systems, agile methodologies, and documentation.
  • Experience working with the AWS cloud environment and large databases (e.g., AWS Redshift).
  • Experience in managing the ML lifecycle using open-source tools (e.g., MLflow).
  • Detail-oriented with strong analytical and problem-solving abilities.
  • Excellent verbal and written communication skills, with the ability to present complex data to non-technical audience.

Preferred Qualifications:
  • Experience dealing with protected health information (PHI) and familiarity with healthcare-related data privacy laws such as HIPAA.
  • Familiarity with standard healthcare codes and terminologies such as ICD-10, CPT, LOINC, and SNOMED CT.
  • Experience in RAG (Retrieval-Augmented Generation) and vector storage in the context of storing a large volume of healthcare unstructured documents and querying those.

More Info

Industry:Other

Function:healthcare

Job Type:Permanent Job

Skills Required

Login to check your skill match score

Login

Date Posted: 17/07/2024

Job ID: 85073793

Report Job

About Company

Hi , want to stand out? Get your resume crafted by experts.

Similar Jobs

NLP Data Scientist AI Real World Data RWD

NorstellaCompany Name Confidential

Principal Data Scientist Senior Data Scientist Data Scientist NLP ML

Business ToysCompany Name Confidential
Last Updated: 17-07-2024 09:22:52 AM
Home Jobs in Delhi NLP Data Scientist - Real World Data (RWD)