Research Scientist - Vision Data Infrastructure Job at Storm3, San Francisco, CA

T0tkeVJ5TmQ4cTN6Y1NOL242bXdCZ0pwaUE9PQ==
  • Storm3
  • San Francisco, CA

Job Description

Research Scientists/Engineers (all levels)

🔍 Focus on Vision Data Infrastructure

🤖 Fundamental AI Research Institute

🌎 San Francisco Bay Area, USA

💸 $250,000 - $600,000 salary + annual bonus

Come join one of the only research institutions globally with resources to compete with top AI companies =>10s of 1000s of GPUs to explore state-of-the-art research in LLMs, Multimodal and Agentic AI.

Currently seeking AI talent with expertise in building scalable pipelines for vision data to support both image/video generative training and multi-modal alignment. You’ll design high-performance pipelines for large-scale image and video datasets , enabling efficient pretraining, alignment, and simulation-based data generation.

Responsibilities:

Vision Data Sourcing & Curation

  • Collect and organize image and video data from open datasets and the web.
  • Handle data cleaning, filtering, deduplication, and metadata generation.
  • Ensure ethical and compliant data collection at scale.

Processing & Augmentation

  • Build high-throughput pipelines for vision data preprocessing (frame extraction, resolution normalization, format conversion, latent caching).
  • Implement GPU-accelerated augmentation and distributed data loading (WebDataset, TFRecords, Parquet).

Synthetic & Simulation-Based Data Generation

  • Use simulation tools (e.g., Unreal Engine 5 , Isaac Sim, Unity) to generate high-quality synthetic vision data .
  • Create specialized datasets for VLM training , visual reasoning , and agent interaction .

Requirements:

  • Strong experience with data engineering , computer vision , or machine learning infrastructure .
  • Expertise in building and scaling ETL/data pipelines for large unstructured datasets.
  • Proficiency with Python , PyTorch , and distributed data frameworks (e.g., Ray , Spark , Dask ).
  • Experience with WebDataset , TFRecords , Parquet , or similar high-throughput data formats.
  • Familiarity with GPU-accelerated preprocessing , NVIDIA DALI , or equivalent systems.
  • Understanding of image/video codecs , data compression , and cloud storage optimization .

Preferred Experience:

  • Prior work with simulation-based or synthetic data generation using Unreal Engine , Isaac Sim , or Unity .
  • Experience curating datasets for multimodal or vision-language model training.
  • Knowledge of data ethics , privacy , and compliance frameworks for large-scale AI datasets.
  • Experience contributing to open datasets or data-centric AI research .

Why apply:

  • Opportunity to join a fast-growing core team that are already pushing AI breakthroughs
  • Highly competitive salary package
  • Work alongside ambitious and bright superstars from tech and academia
  • Medical, Dental and Vision Insurance
  • Relocation package available

🌎 San Francisco Bay Area, USA

📧 Interested in applying? Please click on the ‘Easy Apply’ button or alternatively email me your resume at stefani.lukic@storm3.com

Job Tags

Relocation package,

Similar Jobs

Propy Inc.

Applied AI Engineer Job at Propy Inc.

Who We Are Propy is revolutionizing the real estate industry by building the world's first AI-powered Title and Escrow platform onchain. We have processed over $5B in transactions, and we are on a mission to make closing on a home as easy as buying a stock. We combine...

Arrayo

Full Stack Software Engineer (Python / React) Job at Arrayo

 ...Role Overview Were seeking a Full Stack Software Engineer with strong backend development skills in Python and frontend expertise...  ...mentorship where appropriate Required Qualifications ~5+ years of experience in full stack development ~ M.S. degree in relevant domain... 

Midtown Athletic Clubs

Assistant General Manager Job at Midtown Athletic Clubs

Midtown Athletic Club is searching for an Assistant General Manager to join our world class team at our flagship club in Chicago, IL (2444 N Elston Ave, Chicago, IL 60647). Check out our beautiful club here: Midtown Athletic Club Chicago - Health Club and Gym Chicago...

Medical Services of America

Licensed Practical Nurse Hospice - Weekend Job at Medical Services of America

 ...Join a Team Where Compassion Meets Purpose Licensed Practical Nurse Hospice Care | Chesapeake, VA Employment Type: Full-Time, Weekend Hourly Range: $28-$30 At Medi Home Health & Hospice , part of the Medical Services of America family, we believe hospice... 

Compassus

Hospice RN - 24 Hrs/Week Job at Compassus

 ...compensation and benefits. Your position perks as a Hospice Registered Nurse / RN Case Manager ~ Competitive pay ~ Comprehensive onboarding ~ Health, dental, vision for part & full-time positions ~ Generous Paid Time Off plan that increases with tenure ~...