Data Scientist

Virtual Req #621
Tuesday, April 29, 2025
 
 

TITLE: Data Scientist 

LOCATION: Remote/Alexandria VA 

DATE PREPARED:January2025 

FLSA: Exempt 

DIVISION: Technology Division 

DEPARTMENT: Operations 

 

TRAVEL REQUIREMENT:  

Less than 10% to fulfill job requirements.  
 

HOURS and SCHEDULE:  

Generally, Monday– Friday, 9:00am to 5:30pm (minimum 37.5 hours within five (5) days per week); unless otherwise required/or approved by management.  
 

REPORTS TO:  

This position reports to the Data Science Manager.  
 

SUPERVISION EXERCISED:  

Mentorship of junior staff.  
 

RESPONSIBILITY FOR PUBLIC CONTACT:  

Daily contact requiring courtesy, discretion, and sound judgment.  
 

LICENSING AND CERTIFICATION:  

None  
 

GENERAL DESCRIPTION:  

The Data Scientist (DS) is responsible for building and maintaining statistical, machine learning (ML), and artificial intelligence (AI)models and pipelines across the organization. The DS utilizes a range of technical tools to automate data processes, support complex analysis, and develops new methods of leveraginglarge-scale data.  

 

ESSENTIAL DUTIES AND RESPONSIBILITIES:  

·            Own the entire lifecycle of ML/AItechnologies to ensure continuous high-quality performance of classifiers and models.  

· Automatecollection and aggregation of NCMEC data into new formats and technologies to enable improved analysis.  

·       Build and maintainnew data pipelines to support data science initiatives.  

·       Build new tools, such as machine learning models, for use by users or other applications to evolve and enhance the usefulness of NCMEC data.  

·       Apply large language models to operational use-cases, including prompt engineering, model evaluation, workflow design, and model fine-tuning 

·        Partner with software engineering and dev-ops teams to implement solutions created by the Data Science team.  

·    Build and engineer statistical models based on NCMEC data.  

 

   

EDUCATION AND EXPERIENCE:  

·        Master's in data science, mathematics, or another relevant field and 3 - 4 years of relevant work experience in Data Science.  

·        Or 4 - 8 years of relevant work experience in Data Science.  

·        Or other equivalent education and experience in data science fields.  

   

KNOWLEDGE, SKILLSAND ABILITIES:  

   

·        Strong communicator of requirements and solutions with stakeholders.  

·        Excellent organizational, interpersonal, communication, problem solving and analytical skills.  

·        Experience with industry standard data science toolkits.  

·        Software engineering ability in languages such as Python, Java,etc.  

·        Experience training, evaluating ,and deploying machine learning models like convolutional neural networks (CNNs). 

·        Proficiency in natural language processing (NLP) tasks such as text preprocessing, text classification, sentiment analysis, named entity recognition (NER), and using NLP libraries like NLTK and spaCy. 

·       Experience developing, fine-tuning, and deploying large language models (LLMs) with a focus on prompt engineering, data preprocessing, and model evaluation. 

·        Ability to perform exploratory statistical analysis on various data sets.  

·        Experience developing and implementing data structures/architectures using multiple data sets (structured/unstructured).  

·        Ability to learn new technologies and frameworks while keeping up to date with technology developments relevant to NCMEC.  

·        Ability to handle and maintain the integrity and confidentiality of highly sensitive material and information.  

·        Knowledge of machine learning technologies such as Pandas, Tensor Flow, Spacy etc.  

·       Ability to manage and mentor team staff  

 

PREFERED SKILLS AND EXPERIENCE:  

· Experience with AWS, Azure, and/or Palantir Foundry 

· Experience developing and deploying computer vision models using deep learning frameworks (e.g. image processing, object detection, and optical character recognition (OCR) techniques). 

 

Content Exposure Warning: 

·       As part of regular responsibilities, the data scientist may be exposed to graphic content that depicts/describes child sexual abuse. The data scientist will be enrolled in the NCMEC SafeGuard program to minimize effects of content exposure. 

 

Other details

  • Pay Type Salary