You may not know it, but Regal Rexnord impacts your life every day. Our products enable the fans in HVAC systems that keep us comfortable; the power source that keeps smart buildings running; the agricultural and food service equipment that keeps us fed; and the conveyer systems that keep e-commerce flowing.
Our business purpose is to create a better tomorrow by energy-efficiently converting power into motion. For us, this means creating innovative solutions while focused on both customer needs and our commitment to sustainability. Join our team to create a better tomorrow together!
Position Title: Site Reliability Engineer
Reports To: IT COE Leader
Location: Eastern Standard Time – OR – Central Standard Time
Are you passionate about delivering highly responsive, scalable and top quality solutions that help solve key performance challenges? Regal Rexnord is seeking a highly motivated, innovative Site Reliability Engineer to help deliver performant software that assists our internal teams and customers. This role will heavily support our APM, while owning the design, development, testing and implementation of performance monitoring tools, and platform improvements. This role will collaborate across various functional teams/organizations to drive innovation in infrastructure capacity planning, reliability, delivery, and management.
We would like you to be a thought leader in the organization about how to use modern tools to resolve production issues with reliable, repeatable, automated processes to eliminate the errors that led to the problem. If you believe in automation as the key to reliability – if you like to think about infrastructure as code, auto-scaling, separating signal from noise in monitoring, and DevOps practices – if you feel comfortable owning the big picture of ecosystem performance, collaborating with specialists to solve problems and not just relying on them, then we would love to talk.
Successful traits for this role include the curiosity to understand complex environments from first principles, a driving need for root cause analysis, and a thought-leadership mentality about expanding automation for reliability.
You should have the background to be able to sort through monitoring data, tune what data is collected to pinpoint a problem and bring the right experts to the table. You should be able to identify the patterns in the granular monitoring data to tell whether a bad average performance metric is being driven by spikes of long delays, or a repeatable consistent problem.
Responsible for Application Performance Monitoring tool administration and management by monitoring availability and taking a holistic view of system health
Understand business / technical requirements and the overall business objectives of applications
Responsible for tracking and publishing of performance (availability, end-to-end response times, throughput, transactions per second, capacity, error rates, etc.) of applications and that it meet or exceed customer expectations, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Reduce organizational ‘toil’ via automation, scripting, and implementation and management of toolsets
Responsible for identifying non-compliance areas and partnering with development and/or infrastructure to correct performance issues
Partner with operations and developers to develop real-time tools i.e. automated performance dashboards provide periodic updates on application performance to all the stakeholders
Develop tools and dashboards to automate performance monitoring, testing, analyzing, and reporting issues
Debug, troubleshoot, and work with the development and/or infrastructure teams to resolve and correct performance or scalability issues
Facilitate Root Cause Analysis (RCA), and continue to improve upon existing RCA methods and conduct post-mortems
Support different types of performance tests, for example, load, stress, volume, scalability, and endurance
Strategize and execute end to end performance engineering efforts to achieve highly flexible, scalable, distributed cloud, hybrid, and on-perm systems
Understanding Database performance and collaborating with DBA’s (SQL, Solr, etc.)
Analyze performance test results, and work with cross functional teams to identify performance bottlenecks and their root cause
Passion for providing and driving change to improve Performance, Scalability and Reliability of all systems
Efficiently work with various profiling tools such as Dynatrace, Lighthouse, GTMetrix, and others as needed. Point to identify performance, scalability, and concurrency bottlenecks
Required Education / Experience / Skills
Bachelor’s degree in Computer Science or related field
2+ years of experience in software application development or test automation
5+ years of Performance Engineer or related experience with high-traffic, large-scale distributed systems, client-server architectures both on-prem and cloud (Primarily Azure)
GitOps / Kubernetes experience is a plus
Experience with Deployment automation tools a plus
Excellent interpersonal and influencing skills to establish trust, credibility, and rapport within Regal Rexnord and with external customers/vendors
Demonstrated track record of partnering effectively with other, ability to work effectively as part of a team
The ability to produce high-quality work in a high-volume, fast-paced environment that requires meeting time-sensitive deadline
Proven ability to be hands on, demonstrate resourcefulness, initiative, results-orientation. Has a continuous improvement mindset and can embrace Regal’s 80/20 principles
While this position can be fully remote, 20% domestic travel to headquarter locations is required
English as a primary mode of communication
Regal Rexnord is an Equal Opportunity and Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex/gender, sexual orientation, gender identity, age, ancestry, national origin, marital status, citizenship status (unless required by the applicable law or government contract), disability or protected veteran status or any other status or characteristic protected by law. Regal Rexnord is committed to a diverse and inclusive workforce. We are committed to building a team that represents diverse and inclusive backgrounds, perspectives, and skills.