Senior Software Engineer – LLM Evaluation (Remote, Contractor)

Remote Full-time

I’m looking for aSenior Software Engineer to collaborate with a global tech company on advanced AI development, specifically evaluating and improving large language models for coding. If you enjoy deep technical work, reviewing complex code, and shaping the future of AI-assisted engineering, this project is a great fit. What you’ll doCurate high-quality code examples and write precise solutions across multiple languages: Python, JavaScript/React, C/C++, Java, Rust, Go. Review and refine AI-generated code to ensure efficiency, scalability, and reliability.Build tools and agents that detect code quality issues and common error patterns. Evaluate model capabilities across the full engineering cycle: prototyping, architecture, API design, implementation, launch, experimentation, and maintenance. Design mechanisms to automatically verify solutions to software engineering tasks. Work closely with research and engineering teams to elevate AI-powered development tools. RequirementsSeveral years of professional software engineering experience. At least 2+ years full-time at a top-tier product company(e.g., bolthires, bolthires, bolthires, Meta, bolthires, bolthires, Stripe, Shopify, Datadog, Dropbox, PayPal, IBM Research).Strong experience building and deploying scalable, production-grade systems. Deep knowledge of software design, architecture, debugging, and code review best practices. Excellent written and verbal communication to produce clear, structured evaluations. Offer DetailsFlexible contractor engagement: 10 to 40 hrs/week (partial PST overlap required). Duration: 1 month, with potential extensions based on performance. Competitive compensation. Fully remote. Apply tot his job

Apply Now →

Senior Software Engineer – LLM Evaluation (Remote, Contractor)

Similar Jobs

Senior Machine Learning Engineer, LLM Compressor and Quantization

Staff Applied Machine Learning Engineer, LLM Applications

Sr Applied LLM Engineer

Full Stack Engineer — AI & LLM Systems

AI/ML Engineer (LLM + Python) / Full-Stack Developer

AI LLM Engineer

Entry-Level Live Chat Support Specialist ($25-$35/hr) – No…

LIVE CHAT AGENT/CUSTOMER SUPPORT AGENT

Live Chat Agent - REMOTE (Part-Time & Full-Time)

Litigation Specialist - CGL

Privacy Lawyer

Order Management Specialist, Customer Service and OM Process Improvement

Medical CSR

PMO Specialist - LATAM Market

Cloud Software Consultant

Senior Fire Investigator

Business Development Manager - Residential Home Energy Storage

Sr. Aurora AWS MySQL DBA L3 Lead

Analyst, Statutory Financial Reporting – Analysis

Interfaith Chaplain, 16 Hour, Days, Weekends