← all jobs

Member of Engineering – Pre-training, Data Engineering

Work from home Full-time role Hiring

Job Description:

  • Build and maintain high-performance pipelines for trillions of tokens.
  • Deliver diverse and high quality datasets for pre-training foundation models.
  • Closely work with other teams such as Pretraining, Posttraining, Evals and Product to to ensure alignment on the quality of the models delivered.

Requirements:

  • Strong background in building production-grade, distributed data systems for machine learning, with experience in:
  • Orchestration: Slurm, Airflow, or Dagster
  • Observability & Reliability: CI/CD, Grafana, Prometheus, etc.
  • Infra: Git, Docker, k8s, cloud managed services
  • Batched inference (ex: vLLM)
  • Performance obsession, especially with large-scale GPU clusters and distributed pipelines
  • Expert-level python knowledge and ability to write clean and maintainable code
  • Strong algorithmic foundations
  • Proficiency with libraries like Polars, Dask, or PySpark
  • Nice to have:
  • Experience in building trillion-scale SOTA pretraining datasets
  • Experience translating research to production at scale
  • Experience with OCR, web crawling, or evals
  • Prior experience pre-training LLMs

Benefits:

  • Fully remote work & flexible hours
  • 37 days/year of vacation & holidays
  • Health insurance allowance for you and dependents
  • Company-provided equipment
  • Wellbeing, always-be-learning and home office allowances
  • Frequent team get togethers
  • Great diverse & inclusive people-first culture

Apply To This Job

More open positions

Contract - REMOTE - Data Engineer/Modeler- $60-$65hr

Work from home Full-time role

Remote Data Engineering Specialist – Big Data Pipelines & Cloud Infrastructure | $28/Hour

Work from home Full-time role

Senior Distinguished Data Engineer; Remote-Eligible

Work from home Full-time role

Senior Data Engineer – Databricks - 1613

Work from home Full-time role

(REMOTE) Revenue Cycle - Sr. Business Intelligence Developer

Work from home Full-time role

[Remote] Director of Product Marketing

Work from home Full-time role

Experienced Medical Assistant Career Development Opportunities with careerzynith

Work from home Full-time role

Remote Customer Service Representative – Pet‑Lovers Support Specialist (Work‑From‑Home) at careerzynith

Work from home Full-time role

Franchise Business Consultant - Dunkin'(West Coast Florida Remote)

Work from home Full-time role

Patient Access Scheduler 1, BHMG Cardiology Scheduling, FT 8:30A-5P

Work from home Full-time role

[Remote] Staff Software Engineer - Search / AI

Work from home Full-time role

Remote Data Entry Specialist – Flexible Side‑Hustle Support for Entrepreneurial Communities

Work from home Full-time role

Senior Client Executive

Work from home Full-time role

Mechanical Engineer-Part Time Remote / Telecommute Jobs

Work from home Full-time role

Windchill Integration Engineer

Work from home Full-time role

Salesforce Developer

Work from home Full-time role

Remote Executive Assistant to Chief Digital Media and Marketing Officer

Work from home Full-time role

Senior Voice‑of‑Customer Data Engineer – Remote Live‑Chat Analytics, $35/hr, 2024

Work from home Full-time role

Technical Consultant - Patient Monitoring - Indiana

Work from home Full-time role

[Remote] Account Manager

Work from home Full-time role

Commercial E&S Underwriter (100% work-from-home)

Work from home Full-time role