Senior Software Engineer – LLM Evaluation (Remote, Contractor)

Remote Full-time
I’m looking for aSenior Software Engineer to collaborate with a global tech company on advanced AI development, specifically evaluating and improving large language models for coding. If you enjoy deep technical work, reviewing complex code, and shaping the future of AI-assisted engineering, this project is a great fit. What you’ll doCurate high-quality code examples and write precise solutions across multiple languages: Python, JavaScript/React, C/C++, Java, Rust, Go. Review and refine AI-generated code to ensure efficiency, scalability, and reliability.Build tools and agents that detect code quality issues and common error patterns. Evaluate model capabilities across the full engineering cycle: prototyping, architecture, API design, implementation, launch, experimentation, and maintenance. Design mechanisms to automatically verify solutions to software engineering tasks. Work closely with research and engineering teams to elevate AI-powered development tools. RequirementsSeveral years of professional software engineering experience. At least 2+ years full-time at a top-tier product company(e.g., bolthires, bolthires, bolthires, Meta, bolthires, bolthires, Stripe, Shopify, Datadog, Dropbox, PayPal, IBM Research).Strong experience building and deploying scalable, production-grade systems. Deep knowledge of software design, architecture, debugging, and code review best practices. Excellent written and verbal communication to produce clear, structured evaluations. Offer DetailsFlexible contractor engagement: 10 to 40 hrs/week (partial PST overlap required). Duration: 1 month, with potential extensions based on performance. Competitive compensation. Fully remote. Apply tot his job
Apply Now →
← Back to Home