Senior Site Reliability Engineer
About Pagoda
Pagoda is a technology services firm committed to developing essential components for the NEAR Ecosystem. The focus is on revolutionizing the creation and distribution of software to expand economic access for individuals not fully integrated into the global economy. Empowering people to discover opportunities, create new experiences, and collaborate is at the core of our products as we strive to create an Open Web world where individuals control their assets, data, and governance.
About The Role
Pagoda is looking for an enthusiastic and seasoned Senior Site Reliability Engineer (SRE) to join our team in constructing a resilient and scalable infrastructure for the NEAR blockchain ecosystem. You have the opportunity to contribute to shaping the future of Web3 by developing robust, self-repairing services that support the forthcoming generation of decentralized applications.
Responsibilities
- Work closely with engineering to ensure seamless 24/7 service uptime, employing your expertise to design self-healing, automated systems that proactively tackle potential issues. We offer a fair on-call rotation and compensate with time off.
- Define and monitor Service Level Objectives (SLOs) and critical metrics to ensure our systems adhere to the highest reliability standards.
- Create incident response playbooks and establish strong monitoring and alerting capabilities for prompt and efficient issue resolution.
- Collaborate with core blockchain, middleware, and applications teams to ensure the security and high availability of our services.
- Engage with our geographically dispersed team, engage in open-source projects, and connect with the dynamic NEAR community.
Requirements
- Ability to effectively explain technical concepts to both technical and non-technical audiences.
- Proficiency in Python, in-depth knowledge of UNIX internals, and experience in cloud provisioning, monitoring, and CI/CD tools.
- Strong problem-solving skills with a proactive, solution-oriented approach.
- Minimum 7 years of experience in Site Reliability, DevOps, or Platform Engineering overseeing large-scale distributed systems.
- Proficiency in automation and tooling.
- Bachelor's Degree in Computer Science or related fields.
Additional Skills
- Experience with Rust and/or Go, familiarity with multiple cloud providers, and knowledge of Kubernetes, Helm, and GitOps.
- Fundamental understanding of blockchain technology to quickly grasp the unique challenges and opportunities in the Web3 space.
Interview Process
Our interviews are conducted via Zoom and generally include the following stages:
- Recruiter Call
- Hiring Manager Call
- 1st Round: Coding Interview in Python or Go, DevOps Troubleshooting Interview
- Final Round: Large System Design Interview, Pagoda Values Interview
Benefits & Perks
- Offered 20 days of flexible PTO per year, plus local holidays
- Wellness weeks with 2 weeks of paid company-wide closures
- Medical, dental, vision, AD&D, and life insurance coverage
- Access to licensed therapists and mental health resources
- Generous parental leave options and fertility assistance through Carrot
- 401(k) retirement plan (for US employees)
- Annual company retreats and team offsites
- Continued Education and Home Office Reimbursement, Co-working Space Reimbursement
Our Values at Pagoda
Our values reflect our company culture and can be found on our careers page.
Pagoda is an Equal Employment Opportunity (EEO) employer and welcomes all qualified applicants. We provide fair and impartial consideration without regard to various factors.
Global Data Privacy Notice for Job Candidates and Applicants
Information collected and processed as part of your Pagoda Careers profile, and job applications you submit, is subject to our Privacy Policy. By submitting your application, you agree to our use and processing of your data as required.
