Job Description

Summary

Are you an experienced engineering leader with a passion for reliability and scalability? We’re looking for an exceptional Site Reliability Engineering (SRE) Manager to join the Compute infrastructure team at Roblox. In this role, you will partner with other Eng leaders to establish a group to evolve Compute’s best practices and systems to ensure they meet the highest standards of performance, reliability, and efficiency. You’ll collaborate with teams across Compute and work closely with the central Roblox Reliability team to build robust infrastructure that supports our growth. If you have a track record of leading high-impact software engineering teams and a knack for solving complex technical challenges, we want to hear from you. Join us in shaping the future of our platform and delivering unparalleled value to our users.

At Roblox, our vision is to achieve 1 billion daily active users. We believe this leader will be instrumental in driving us towards that ambitious goal.

You will:

  1. Drive Reliability: Collaborate with cross-functional product partners (both within Compute and in the wider company) to enhance and elevate reliability of our systems.
  2. Champion Production Health: Engage directly with production environments, analyzing key trends and deriving actionable insights. Take ownership of maintaining and improving production health.
  3. Build and Implement: Build systems that improve reliability, automation, and reduce toil towards Compute’s North Star. Compute is responsible for everything from the machine lifecycle to providing high level orchestration services and service discovery to the wider Roblox development team.

You have:

  1. Team Leadership: Demonstrated ability to build, lead, and develop high-performing engineering teams.
  2. System Expertise: You’ve worked with large live systems at scale, ideally in an infrastructure setting.
  3. Software Engineering Background: Great foundation in software engineering principles and practices.
  4. Scalable Design Experience: Extensive experience in guiding teams on designing and implementing scalable systems. Expertise in creating robust architectures that efficiently manage growth and maintain performance under varying loads, ensuring systems are resilient and adaptable to evolving demands.
  5. Engineering Management: Over 3 years of experience in engineering management roles, with a consistent track record of leading successful projects and teams.
  6. Educational Background: Bachelor’s degree in Computer Science or a related field, or equivalent experience.

You are:

  1. A Visionary Leader: Capable of leading teams and fostering a problem-solving culture within large-scale engineering projects.
  2. Hands-On and Technical: Equipped with strong technical skills and a hands-on approach to achieving team objectives.
  3. Strategic Planner: Skilled in project management, strategic planning, and developing detailed roadmaps for successful delivery.

This role is Hybrid, requiring three days per week in our San Mateo headquarters.

For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.

Annual Salary Range

$283,780—$331,640 USD

Skills
  • Development
  • Leadership
  • Software Engineering
© 2024 cryptojobs.com. All right reserved.