Staff Production Engineer (Cloud Platform & Reliability – Machine Identity Security) – hybrid

Back to all jobs
  • CyberArk
  • Santa Clara, CA
  • Full-Time
  • 1 week ago
  • $181,000–$226,000
Published
April 29, 2026
Location
Santa Clara, CA
Category
Job Type

Staff Production Engineer (Cloud Platform & Reliability – Machine Identity Security) – hybrid: our view in 3 lines...

  • The Role: A senior production/platform engineer role for professionals experienced in cloud infrastructure and machine identity security products. The person will design, operate, and improve highly available cloud platforms, drive IaC and CI/CD practices, lead incident response, and mentor engineers to improve reliability and observability.
  • Requirements: Requires 8+ years in DevOps, Platform Engineering, or SRE with experience on AWS, Azure, or GCP and managing Kubernetes (EKS/AKS/GKE). The role explicitly calls for Terraform, Ansible, or Pulumi, CI/CD tools such as Jenkins, GitLab CI, ArgoCD, or GitHub Actions, and programming/scripting with Python or Go.

Job Description

Company Description

About CyberArk

CyberArk, a Palo Alto Networks company, is the global leader in identity security, trusted by organizations around the world to secure human and machine identities in the modern enterprise. CyberArk’s AI-powered Identity Security Platform applies intelligent privilege controls to every identity with continuous threat prevention, detection and response across the identity lifecycle. With Identity Security, organizations can reduce operational and security risks by enabling zero trust and least privilege with complete visibility, empowering all users and identities, including workforce, IT, developers and machines, to securely access any resource, located anywhere, from everywhere. Learn more at cyberark.com.

Copyright © 2026 CyberArk Software. All Rights Reserved. All other brand names, product names, or trademarks belong to their respective holders.

Job Description

The Production Engineering team is responsible for building, scaling, and operating the cloud platform for CyberArk’s machine identity security products. Our solutions are trusted by the world’s largest organizations to protect and manage TLS machine identities, SSH machine identities, and code signing identities.

As a Staff Production Engineer at CyberArk, you will play a key role in designing and evolving the reliability, scalability, and operational excellence of our cloud platform. You will work across infrastructure, services, and engineering teams to ensure systems are resilient, observable, and able to operate at scale.

This role is ideal for engineers who combine strong infrastructure expertise with a systems mindset, and who are comfortable driving improvements across production environments, tooling, and engineering practices.

What You’ll Do

  • Design, build, and evolve highly available cloud infrastructure platforms with a focus on scalability, resilience, and reliability
  • Lead improvements across production systems, including performance, availability, and incident response
  • Drive and standardize Infrastructure as Code (IaC) practices to improve consistency and reduce operational overhead
  • Design and optimize CI/CD pipelines to support fast, secure, and reliable software delivery at scale
  • Partner with development teams to improve system reliability, observability, and cloud-native design patterns
  • Define and implement monitoring, alerting, and observability strategies across distributed systems
  • Lead incident response efforts, including root cause analysis and long-term remediation strategies
  • Identify and eliminate operational toil through automation and system improvements
  • Mentor engineers and contribute to raising the bar for production engineering practices

#LI-JH1

#LI-Hybrid

    Qualifications

    What You Bring

    • 8+ years of experience in DevOps, Platform Engineering, or Site Reliability Engineering (SRE)
    • Strong experience designing and operating cloud infrastructure on AWS, Azure, or GCP
    • Deep expertise managing and scaling Kubernetes environments (EKS, AKS, or GKE)
    • Strong experience with Infrastructure as Code tools (Terraform, Ansible, or Pulumi)
    • Proven experience designing and maintaining complex CI/CD systems (Jenkins, GitLab CI, ArgoCD, GitHub Actions)
    • Strong programming/scripting skills (Python, Go, or similar) for automation and tooling
    • Experience operating in high-scale, 24/7 production environments with ownership of incident response and reliability
    • Solid understanding of Linux systems and networking fundamentals (DNS, TCP/IP, load balancing, VPC, mTLS)
    • Strong problem-solving skills and ability to work across teams

    Nice to Have

    • Experience implementing DevSecOps practices in cloud environments
    • Experience building or improving observability platforms and tooling
    • Professional certifications (CKA/CKAD, AWS Solutions Architect, Azure Administrator)
    • Experience using AI-assisted development tools to improve operational workflows and automation

    Additional Information

    CyberArk is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, creed, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.

    The salary range for this position is $181,000–$226,000 per year, plus discretionary bonus and equity, based on individual performance. Base pay may vary depending on job-related knowledge, skills, and experience. The total compensation package includes a comprehensive range of medical, dental, vision, financial, and other benefits.

    Key Skills
    ? Key Skills in dark blue have been inferred based on similar industry roles
    AWS Go Linux Networking (DNS TCP/IP) Azure GCP Ansible Jenkins Gitlab Kubernetes Terraform Python CI/CD

    Subscribe to Career Resources

    Get the latest career advice, industry insights, and job opportunities delivered to your inbox.