Data Scientist
TITLE: Data Scientist
LOCATION: Remote/Alexandria VA
DATE PREPARED:January2025
FLSA: Exempt
DIVISION: Technology Division
DEPARTMENT: Operations
TRAVEL REQUIREMENT:
Less than 10% to fulfill job requirements.
HOURS and SCHEDULE:
Generally, Monday– Friday, 9:00am to 5:30pm (minimum 37.5 hours within five (5) days per week); unless otherwise required/or approved by management.
REPORTS TO:
This position reports to the Data Science Manager.
SUPERVISION EXERCISED:
Mentorship of junior staff.
RESPONSIBILITY FOR PUBLIC CONTACT:
Daily contact requiring courtesy, discretion, and sound judgment.
LICENSING AND CERTIFICATION:
None
GENERAL DESCRIPTION:
The Data Scientist (DS) is responsible for building and maintaining statistical, machine learning (ML), and artificial intelligence (AI)models and pipelines across the organization. The DS utilizes a range of technical tools to automate data processes, support complex analysis, and develops new methods of leveraginglarge-scale data.
ESSENTIAL DUTIES AND RESPONSIBILITIES:
· Own the entire lifecycle of ML/AItechnologies to ensure continuous high-quality performance of classifiers and models.
· Automatecollection and aggregation of NCMEC data into new formats and technologies to enable improved analysis.
· Build and maintainnew data pipelines to support data science initiatives.
· Build new tools, such as machine learning models, for use by users or other applications to evolve and enhance the usefulness of NCMEC data.
· Apply large language models to operational use-cases, including prompt engineering, model evaluation, workflow design, and model fine-tuning
· Partner with software engineering and dev-ops teams to implement solutions created by the Data Science team.
· Build and engineer statistical models based on NCMEC data.
EDUCATION AND EXPERIENCE:
· Master's in data science, mathematics, or another relevant field and 3 - 4 years of relevant work experience in Data Science.
· Or 4 - 8 years of relevant work experience in Data Science.
· Or other equivalent education and experience in data science fields.
KNOWLEDGE, SKILLSAND ABILITIES:
· Strong communicator of requirements and solutions with stakeholders.
· Excellent organizational, interpersonal, communication, problem solving and analytical skills.
· Experience with industry standard data science toolkits.
· Software engineering ability in languages such as Python, Java,etc.
· Experience training, evaluating ,and deploying machine learning models like convolutional neural networks (CNNs).
· Proficiency in natural language processing (NLP) tasks such as text preprocessing, text classification, sentiment analysis, named entity recognition (NER), and using NLP libraries like NLTK and spaCy.
· Experience developing, fine-tuning, and deploying large language models (LLMs) with a focus on prompt engineering, data preprocessing, and model evaluation.
· Ability to perform exploratory statistical analysis on various data sets.
· Experience developing and implementing data structures/architectures using multiple data sets (structured/unstructured).
· Ability to learn new technologies and frameworks while keeping up to date with technology developments relevant to NCMEC.
· Ability to handle and maintain the integrity and confidentiality of highly sensitive material and information.
· Knowledge of machine learning technologies such as Pandas, Tensor Flow, Spacy etc.
· Ability to manage and mentor team staff
PREFERED SKILLS AND EXPERIENCE:
· Experience with AWS, Azure, and/or Palantir Foundry
· Experience developing and deploying computer vision models using deep learning frameworks (e.g. image processing, object detection, and optical character recognition (OCR) techniques).
Content Exposure Warning:
· As part of regular responsibilities, the data scientist may be exposed to graphic content that depicts/describes child sexual abuse. The data scientist will be enrolled in the NCMEC SafeGuard program to minimize effects of content exposure.
Other details
- Pay Type Salary