Site Reliability Engineer

BeyondTrust

BeyondTrust is a place where you can bring your purpose to life through the work that you do, creating a safer world through our cyber security SaaS portfolio.

Our culture of flexibility, trust, and continual learning means you will be recognized for your growth, and for the impact you make on our success. You will be surrounded by people who challenge, support, and inspire you to be the best version of yourself.

The Role

As a Site Reliability Engineer, you will be a part of the Endpoint Privilege Management (EPM) team that will design, develop, deliver, and maintain a cloud-based software solution utilizing modern technologies. This is a unique opportunity to lead the engineering organization in areas of standardized, automated infrastructure and service provisioning and orchestration.

What You’ll Do

  • Design long-term technical solutions and cross-team mechanisms to achieve reliability goals
  • Align and help drive execution of the EPM team’s roadmap
  • Collaborate with SREs and senior engineers across engineering organizations on best practices
  • Build and enhance monitoring and alerting for EPM deployments
  • Create and enhance automation tools and frameworks for infrastructure management
  • Design and implement automated processes where necessary, to support monitoring of EPM
  • Be on an on-call rotation to respond to incidents that impact EPM availability

What You’ll Bring

  • Experience and success designing and building enterprise-ready cloud-native platforms
  • Passion with researching, implementing, and managing solutions to build a cloud-native platform
  • Hold lofty standards, continuously raising the bar, and driving our teams to deliver high-quality products, services, and processes
  • Ability to simplify and encapsulate complexity to empower our development teams
  • Ability to balance speed and risks, making decisions based on available information
  • Real-world experience with Azure or AWS
  • Real-world experience with Infrastructure as Code (Terraform, etc.)
  • Expertise in software engineering and systems architecture
  • Experience with incident response and uptime metrics
  • Experience with incident management, root cause analysis, and proactive monitoring solutions
  • Experience developing and implementing system monitoring and alerting tools
  • Expertise in maintaining high-availability systems

Better Together

Diversity. Inclusion. They’re more than just words for us. They are the guiding values of how we build our teams, cultivate leaders, and create a culture where people feel connected.

We take care of our employees so they can take care of our customers. Customers who come from all walks of life just like us. We hire incredible people from diverse backgrounds because when we are different together, we are stronger together.

About Us

BeyondTrust is the worldwide leader in intelligent identity and access security, enabling organizations to protect identities, stop threats, and deliver dynamic access. We are leading the charge in innovating identity-first security and are trusted by 20,000 customers, including 75 of the Fortune 100, plus a global ecosystem of partners.

Learn more at www.beyondtrust.com

#LI-BS1

Source
remotive.com

Comments are closed.