[Remote] Senior or Staff MLOps Engineer – LLMOps

Remote Full-time
Note:The job is a remote job and is open to candidates in USA. TRM Labs is a blockchain intelligence company committed to fighting crime and creating a safer world. As aSenior or Staff MLOps Engineer with a focus in LLMOps, you'll be at the core of building and scaling the technical infrastructure for AI/ML systems, developing robust pipelines and operational tooling for AI applications. Responsibilities• Build reusable CI/CD workflows for model training, evaluation, and deployment — integrating Langfuse, GitHub Actions, and experiment tracking, etc• Automate model versioning, approval workflows, and compliance checks across environments• Build out a modular and scalable AI infrastructure stack — including vector databases, feature stores, model registries, and observability tooling• Partner with engineering and data science to embed AI models and agents into real-time applications and workflows• Continuously evaluate and integrate state-of-the-art AI tools (e.g.LangChain, LlamaIndex, vLLM, MLflow, BentoML, etc.)• Drive AI reliability and governance, enabling experimentation while ensuring compliance, security, and uptime• Build and enhance AI/ML Model Performance• Ensure data accuracy, consistency and reliability, leading to better model training and inferencing• Deploy infrastructure to support offline and online evaluation of LLMs and agents — including regression testing, cost monitoring, and human-in-the-loop workflows• Enable researchers to iterate quickly by providing sandboxes, dashboards, and reproducible environmentsSkills• Write high-quality, maintainable software — primarily in Python, but we value engineering ability over language familiarity• Have a strong background in scalable infrastructure, including: Containerization and orchestration (e.g.Docker, Kubernetes), Infrastructure-as-code and deployment (e.g. Terraform, CI/CD pipelines), Monitoring and logging frameworks (e.g. Datadog, Prometheus, OpenTelemetry)• Understand and implement ML Ops best practices, including: Model versioning and rollback strategies, Automated evaluation and drift detection, Scalable model and agent serving infrastructure (e.g. vLLM, Triton, BentoML)• Deploy and maintain LLM and agentic workflows in production, including: Monitoring cost, latency, and performance, Capturing traces for analysis and debugging, Optimizing prompt/response flows with real-time data access• Demonstrate strong ownership and pragmatism, balancing infrastructure elegance with iterative delivery and measurable impactBenefits• PTO• Holidays• Parental LeaveCompany Overview• TRM Labs is a software company that offers blockchain, transaction monitoring, and analytics to help financial institutions and governments.It was founded in 2018, and is headquartered in San Francisco, California, USA, with a workforce of 201-500 employees. Its website isCompany H1B Sponsorship• TRM Labs has a track record of offering H1B sponsorships, with 1 in 2025, 4 in 2024, 3 in 2023, 3 in 2022, 1 in 2021. Please note that this does not guarantee sponsorship for this specific role. Apply tot his job
Apply Now →

Similar Jobs

Senior Machine Learning Operations (MLOps) Engineer

Remote Full-time

Principal MLOps Engineer (Remote)

Remote Full-time

[Remote] Cloud/MLOps Engineer — Secure Analytics Platforms

Remote Full-time

GenAI ML/MLOps Engineering Lead (Remote or Hybrid)

Remote Full-time

MLOps Engineer - 4 Day Week / Remote

Remote Full-time

MLOps Engineer / Thousand Oaks, CA / Remote

Remote Full-time

Experienced Machine Learning Engineering Manager for New Business Verticals – Lead High-Performing Teams and Drive Innovation in AI and ML Solutions

Remote Full-time

Senior Engineering Manager, Model Inference & Serving, Machine Learning Platform [Remote]

Remote Full-time

[Remote] Senior Manager, Software Engineering - Personalization and ML Enablement

Remote Full-time

Experienced Engineering Manager – AI/ML Technology for Disruptive Product Innovation (Fully Remote – USA)

Remote Full-time

Mortgage Loan Originator | Retail - Remote New York

Remote Full-time

Corp Social Responsibility Mgr

Remote Full-time

National Director, Facilities Management

Remote Full-time

Equity Research Associate – Alternative Data and Analytics

Remote Full-time

Advanced Analytics - Data Engineer

Remote Full-time

Hiring Now: Talent Acquisition - People Operations Generalist

Remote Full-time

Manager of Marketing

Remote Full-time

Blog Writer - Virtual Position / Native English

Remote Full-time

Talent Operations Specialist | Consensys | $75k-$120k | Remote (USA)

Remote Full-time

FSP Principal Biostatistician- Early Phase Clinical Development(PK)

Remote Full-time
← Back to Home