Associate Site Reliability Engineer

Remote Full-time
Why We Need You! Site Reliability Engineering (SRE) is a growing team that partners closely with Product Engineering, Security, and Support. We are responsible for the reliability, deployment, and continuous operation of the Ivanti Cloud services. We need your help to take our existing platform to the next level with observability, release automation, chaos engineering, and more. The Associate SRE role is a blend of infrastructure, networking, operating systems, automation, development, and application administration.It is a hands-on technical position in a fast-paced atmosphere. The ideal candidate has prior experience managing cloud-based SaaS applications and strives to solve traditional operations problems through automation and software. More so, the candidate must possess a high standard of excellence, have a strong customer focus, and is capable of technical deep dives into code, app servers, databases, load balancers, operating systems, and networks. What You Will Be Doing· Work Monday through Friday, approximately 2 PM to 11 PM U.S.Pacific Time, including lunch)· US citizenship and must be located domestically in the U.S. · Deploying, managing, and securing Ivanti’s production Software-as-a-Service (SaaS) environments in AWS and Azure· Working with geographically dispersed, cross-departmental teams to solve difficult problems· Automating common and repetitive tasks· Write documentation and training material· Train other colleagues. · Participate in on-call rotations for 24x7 coverage (follow-the-sun model) for incident response, issue triage, and problem resolutionTo Be Successful inThe Role, You Will Have· A BSc in Computer Science, a related field, or equivalent practical experience· 3+ years of relevant industry experience (2+ with an achieved BSc in Computer Science or Equivalent Degree)· Proficiency with Python and experience with one of the following languages:o Javao Golango C#· Proficiency working with Bash or PowerShell programmatically· Familiarity with public cloud platforms (AWS or Azure preferred)· Experience troubleshooting Java and.NET applications· Experience troubleshooting network and storage infrastructure issues· Experience working with core Linux distributions (Debian, RHEL, SUSE, Slackware) Experience working with Windows· Experience working with one or more: SQL Server, PostgreSQL, Redis, Kafka, MongoDB, Elasticsearch, or similar· Ability to configure and fine tune at least one: HA Proxy, Apache, Nginx, IIS, or similar· Ability to configure: New Relic, DataDog, Splunk, or similar monitoring tools· Familiarity with container orchestration technologies (AWS EKS or Azure AKS preferred)· Experience with deployment pipeline tools such as Ansible, Jenkins, and/or GitHub Actions· Proficiency working and developing Infrastructure as Code (IaC)· A desire to adopt and implement emergent technologies and best practices· Strong verbal and written communication skills in English for the purposes of global collaboration‘Nice-to-haves’ include:· Prior experience as a Site Reliability Engineer or DevOps Engineer· Certificates in one or more of the following categories, or demonstrated certificate-equivalent knowledge:o Cloud Development and architectureo Kubernetes Administrationo Linux Administrationo Software engineering disciplines· Experience with compliance frameworks such as SOC 2 Type 2, ISO-27001, FedRAMP, or IRAP and privacy regulations such as GDPR and PIPEDARoadmap for Success90 Days:· Onboarding and role-training is complete· You're building foundational knowledge of the SRE-run product portfolio· You hold general knowledge of how SRE manages our SaaS environments· You've gotten to know the team and are building relationships with SRE peer teams6 Months:· Self-sufficiency in core job functions and existing processes· Participating in SRE on-call rotations· Contributing to handling SRE tickets to fulfillment and responsible for individual SRE tasks· Active participation in SRE stability discussions with direct interaction with SRE peers1 Year:· Contribute independently to improve reliability and compliance in our SaaS environments· Demonstrate ownership of SRE ticket management including triage and resolution· Lead one or more well-defined projects under guidance fromSenior SRE members· Identify areas where performance, scalability, security, and reliability can be improved in production systems and environments#LI-Remote Apply tot his job
Apply Now →

Similar Jobs

Lean Six Sigma Black Belt Project Manager - Remote (Preferred in Raleigh Durham, NC)

Remote Full-time

Site Reliability Engineering (SRE) Architect - REMOTE

Remote Full-time

Site Reliability Engineer

Remote Full-time

Remote SRE Jobs – Senior Site Reliability Engineer (Remote) – $130k‑$170k USD – Full‑Time – Escondido, California – Cloud/DevOps, Kubernetes, Terraform, Prometheus

Remote Full-time

Shopify Developer Needed TODAY — Fix Product Upsell Logic ($50)

Remote Full-time

Shopify Theme Developer Needed to Complete Shopify Store Setup (Multi-Store Linking + Theme Fixes)

Remote Full-time

Site Reliability Engineer 2 DevOps REMOTE (ship required)

Remote Full-time

Senior Solution Architect, ServiceNow Platform

Remote Full-time

Principal Customer Success Executive Telco and Media

Remote Full-time

ServiceNow Vulnerability Response (VR) Developer - Remote

Remote Full-time

In-House Counsel, Legal

Remote Full-time

Remote Marketing Contractor

Remote Full-time

Remote eLearning Developer

Remote Full-time

Tax Consultant - International

Remote Full-time

Registered Nurse Orthopedic Boneline Triage PRN, UT

Remote Full-time

Educational Consultant + Math Teacher

Remote Full-time

Human Resources Benefits & Compliance Analyst

Remote Full-time

Devops Architect / Lead Engineer with Confluent Kafka - Contract to hire - Remote

Remote Full-time

Longevity Nurse Practitioner - Telehealth (Multi-state Licensed, CA Required)

Remote Full-time

Senior Manager, Email Marketing

Remote Full-time
← Back to Home