AI/ML Engineer (Computer Vision)

ChatGPT Jobs • Full-time • Herndon, VA, US • 1d ago

Job Description

Job Title: (Not explicitly stated; inferred as Machine Learning Engineer specializing in VLMs)

Locations: Herndon, VA (On-site, Remote)

Responsibilities

Design and execute fine-tuning pipelines for Vision-Language Models (VLMs) on domain-specific imagery datasets, including data preprocessing, training orchestration, and hyperparameter optimization.
Develop and implement evaluation frameworks for multimodal model performance, including task-specific metrics for image understanding, visual question answering, and spatial reasoning.
Build scalable training infrastructure on AWS (SageMaker, EC2 GPU instances) for distributed fine-tuning of large multimodal models.
Engineer data pipelines for curating, annotating, and transforming geospatial imagery datasets into model-ready formats for supervised and instruction-tuning workflows.
Collaborate with applied scientists and solutions architects to iterate on model architectures, adapter strategies (LoRA/QLoRA), and inference optimization techniques.

Basic Requirements

TS/SCI with CI Poly required.
5+ years of professional machine learning engineering experience with a focus on deep learning.
1+ years of hands-on experience fine-tuning large foundation models (LLMs or VLMs).
Experience with parameter-efficient fine-tuning methods (LoRA, QLoRA, adapters).
Familiarity with supervised fine-tuning, instruction tuning, and RLHF/DPO alignment techniques.
4+ years of advanced Python development for ML workloads.
Strong proficiency with PyTorch and the HuggingFace ecosystem (Transformers, PEFT, Datasets, Accelerate).
Experience with distributed training frameworks (DeepSpeed, FSDP, or Megatron).
3+ years of experience with computer vision or multimodal models.
Understanding of vision transformer architectures (ViT, CLIP, LLaVA-family models, or similar).
Experience processing and augmenting image datasets at scale.
3+ years of experience with AWS ML infrastructure: SageMaker Training jobs, Processing jobs, and endpoint deployment; GPU instance selection, multi-node training, and cost optimization on EC2 (P4/P5/G5/G6e); S3 data management for large-scale training datasets.
2+ years of experience building ML evaluation pipelines: automated benchmarking, metric computation, and result analysis; experience with both quantitative metrics and qualitative/human evaluation approaches.
Strong software engineering fundamentals (version control, testing, CI/CD for ML workflows).

Preferred Qualifications

2+ years of experience with geospatial or remote sensing imagery.
Familiarity with electro-optical and SAR satellite imagery formats and characteristics.
Understanding of geospatial metadata, coordinate systems, and imagery preprocessing.
Experience with model quantization and inference optimization (vLLM, TensorRT, ONNX).
Experience with MLOps and experiment tracking tools (MLflow, Weights & Biases, SageMaker Experiments).
Familiarity with data annotation platforms and active learning workflows for imagery.
Experience with containerized ML workflows (Docker, ECR, ECS/EKS).
2+ years of experience with Authority to Operate (ATO) processes in government environments; implementation of NIST 800-53 controls and security compliance for ML systems; experience deploying models in air-gapped or disconnected environments.
Familiarity with multimodal evaluation benchmarks (MMMU, MMBench, GQA, or domain-specific equivalents).
Publications or demonstrated contributions in computer vision, VLMs, or multimodal AI.
Experience with synthetic data generation for training data augmentation.