RL Environment Engineer (ML Engineer)

Remote Full-time
Requirements • Master’s degree in Computer Science, AI, ML, or a related technical field, • (Desirable) Deep knowledge of transformer internals or LLM training/inference, • Strong Python skills with production-quality engineering standards, • (Desirable) Experience with inference libraries such as vLLM or SGLang, • Experience designing or working with RL environments or training pipelines, • (Desirable) CUDA or custom kernel optimization experience (e.g. Pallas), • Solid understanding of modern LLMs and their limitations, • (Desirable) Research experience with publications or high-quality open-source work, • Ability to work quickly, iterate reliably, and respond to feedback, • (Desirable) Experience building complex or open-ended RL-based learning systems, • Advanced English proficiency (C1/C2) What the job involves • Design and build reinforcement learning environments for training and evaluating LLMs, • Translate modern ML and AI research into structured RL problems, • Implement reliable, debuggable, and scalable training environments in Python, • Collaborate with researchers and engineers to improve model learning quality, • Complete an average of two well-scoped tasks per week, • Iterate quickly based on feedback and evaluation results Apply tot his job
Apply Now →

Similar Jobs

[Remote] Multi‑Target Tracking & Sensor Fusion Engineer (R4172)

Remote Full-time

Research Scientist - Algorithms Engineering

Remote Full-time

Data and AI Analyst

Remote Full-time

[Remote] TDAC Analyst

Remote Full-time

Enterprise Imaging / Artificial Intelligence (AI) Analyst (Remote)

Remote Full-time

Junior AI Analyst - Back Office Operations

Remote Full-time

[Remote] Experienced Senior, AI Senior Business Analyst

Remote Full-time

Sr Director Analyst - AI's Impact on HR and the Workforce (Remote - U.S.)

Remote Full-time

[Remote] Part Time Search Analysts

Remote Full-time

Lead AI and Automation Analyst (Remote)

Remote Full-time

iOS Developer (Swift) – Fully Remote

Remote Full-time

Metadata Operations Specialist

Remote Full-time

Ancillary Contracting Specialist I job at Horizon Blue Cross Blue Shield of New Jersey - Horizon BCBSNJ in NJ, NY, PA, CT, DE

Remote Full-time

**Experienced Data Entry Clerk – Remote Opportunity for High-Speed Typists**

Remote Full-time

Internal Audit and Risk Management - Data Analytics & AI (Summer 2026 Internship)

Remote Full-time

Experienced Remote Product Tester and Customer Service Representative – Flexible Work-from-Home Opportunity with arenaflex

Remote Full-time

Student Advisor/ DNP /Remote/

Remote Full-time

Experienced Data Entry Specialist – Remote Part-Time Opportunity for Career Growth and Development at arenaflex

Remote Full-time

Data Engineer - Not an Active Opening, Building Talent Pipeline

Remote Full-time

Senior Director - Safety Scientist, Pharmacovigilance Operations

Remote Full-time
← Back to Home