Senior DevOps Engineer (Network Specialist)
BitMEX is an innovative cryptocurrency derivatives trading platform at the forefront of the industry, known for its commitment to driving change and setting high standards in innovation, liquidity, and security.
As a pioneer in cryptocurrency trading, BitMEX operates a highly advanced peer-to-peer trading platform and API, empowering hundreds of thousands of traders worldwide with knowledge, precision, and confidence to transact significant volumes daily.
About Us
BitMEX stands out as a leading exchange for crypto derivatives, meeting the needs of institutional and professional traders. Noteworthy for creating the Perpetual Swap, one of the most popular crypto trading products globally, BitMEX continues to introduce cutting-edge cryptocurrency derivative products such as the ETH Stake Swap.
The platform offers users the ability to engage in cryptocurrency derivatives trading on a professional-grade platform with features like low latency, deep liquidity, and steadfast availability. BitMEX boasts a strong record of security, having safeguarded all cryptocurrencies since inception in 2014, providing users with a secure trading environment. Additionally, it supports spot trading and offers seamless purchase and conversion of cryptocurrencies.
BitMEX thrives on welcoming individuals who embody characteristics like determination, responsibility, and collaboration. The team values attention to detail, efficiency, and clarity in work. Flexible and adaptable professionals are sought after to work across different markets and time zones, supporting the platform's operations 24/7.
Role Overview
Joining the Platform Engineering team will have you taking charge of managing and supporting the infrastructure that underpins BitMEX's platform. The reliability and scalability of the technology are instrumental to the platform's success, and this role involves collaborating with development and security teams to create resilient and fault-tolerant systems.
Focused on optimizing network performance to uphold low-latency and high-throughput operation of the trading exchange, this role plays a crucial part in enhancing our systems.
Key Responsibilities
- Enhancing the resiliency, throughput, and latency profiles of the trading systems in close collaboration with trading technology teams
- Managing and maintaining AWS cloud infrastructure, EC2 instances, and physical servers
- Developing and managing Infrastructure as Code (IaC) to ensure infrastructure consistency
- Conducting security hardening of OS builds and configurations
- Configuring and maintaining config management tools to uphold consistency
- Integrating the stack with Kubernetes
- Implementing SRE best practices for stack design and operation
- Designing, executing, and testing disaster recovery capabilities for seamless business continuity during technology failures
- Participating in an on-call rotation for escalations
Qualifications
- A solid foundation in theoretical and practical networking knowledge with experience in multiple areas like routing protocols, Linux TCP stack implementation, AWS VPC, Kubernetes VPC CNI, etc.
- Professional experience in kernel troubleshooting, userland monitoring, logging, alerting, troubleshooting, and profiling/tracing
- Strong AWS knowledge with a minimum of 5 years of SRE/DevOps experience managing Linux-based systems; a degree in computer science or engineering is preferred
- Familiarity with Kubernetes, Ansible, Chef, and programming languages like Python, Golang, C, NodeJS
Join us in our journey of developing a vibrant cryptocurrency ecosystem and shaping the future of digital financial services through strategic investments in emerging cryptocurrency technology.