Site Reliability Engineer
RT² offers the most flexible cutting-edge Retail Management Solutions that encompass sales,
inventory management, frontline employee management and engagement, payments, business
intelligence, and digital automation tools for the wireless industry. We support Fortune 500
companies, unify their customer experience, and remove pain points across multiple retail touch
points. RT² prides itself on fostering a team-oriented culture and a dynamic work environment,
where team members are set up to make meaningful contributions across the organization.
Site Reliability Engineer - Our team here at RT² is looking for a Site Reliability Engineer to join our team. This SRE will be a key player in maintaining and improving the reliability of our systems. We're seeking SREs with experience in infrastructure tools like Terraform, Bicep, and Ansible, spanning both On-Premise and Cloud environments, including Azure. Your role will involve enhancing system stability, optimizing performance, and automating deployment processes. If you're a proactive problem solver with a passion for infrastructure and continuous improvement, this SRE position offers an exciting opportunity to make a meaningful impact.
Responsibilities:
- Help maintain and enhance production monitoring and notifications.
- Improve reliability and quality of production systems.
- Measure and help optimize system performance.
- Work with delivery. and other teams to identify points of potential failure and then work to help enhance and improve systems to mitigate.
- Participate in capacity planning.
- Create automation to improve deployment speed, testing, and responding to operational issues.
- Work to meet service level objectives.
- Help build runbooks, tools, and other supporting tools to improve incident response.
- Monitor production systems and help manage incident response.
- Participate in post mortems, document outages, steps to recovery, future mitigation strategies.
- Work on both on-premises (data center) and cloud-based infrastructure (Azure).
- Experience working with server operating systems like Windows, Unix, Linux
- Experience working with monitoring via tools such as ELK stack, Grafana, Azure Application Insights
- Experience with Git or other distributed source control systems.
- Bachelor’s degree (or equivalent) in computer science or related discipline
- Experience with tools TerraForm, Bicep, Ansible.
- Experience with both On-Premise and Cloud Providers preferably Azure
- Experience with Hyper-V and VMWare
- Experience with CI/CD Pipelines like Azure Pipelines, GitHub Actions, and OctoDeploy
- Experience with scripting languages like PowerShell, Python and Bash
- Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
- Experience with observability tools like Grafana, UptimeRobot, ELK, PagerDuty
- Experience working with Agile methodologies
Our pay structure takes into account various geographical markets within the United States. The base salary for this role reflects the typical expected earnings. However, the final compensation package is determined by several factors, such as your location, job-specific expertise, skills, experience, and other relevant job-related considerations.
What We Offer:
- A unique opportunity to shape the journey of RT²
- Working within a rapidly growing, game-changing business
- Remote, flexible working options
- Competitive compensation
- Generous STI and LTI provisions
- Health, Dental and Vision Insurance
- Paid Annual Leave
- Paid Sick Leave
- 401K, and more