Job Description
Job DescriptionDescription:Description
Do you excel at juggling multiple priorities across IT, Information Security, Cloud Engineering, Site Reliability, and Infrastructure? Do you have a knack for explaining complex technical ideas to senior leadership without sending them into acronym overload? Are you highly organized, hands-on, and ready to steer a diverse team to ensure our software is reliable, scalable, and secure? If so, read on…
I’m the CTO at RTA, and I’m looking for an Dev Ops Manager, that will be a player coach to guide our company’s backbone—our technology infrastructure. This role is about leading a diverse team (DEVOPS, Cloud Engineering, SRE, and Infrastructure) while also being hands on, and collaborating closely with Product, Engineering, and Support to keep our systems humming. You’ll need solid experience in Cloud Operations (AWS), Site Reliability Engineering (monitoring & observability), DEVOPS, or combination of. If this sounds like you, you could be the peanut butter to our jelly! Read on and apply!
What We’re Looking For
In general, someone who:
- Is passionate about serving others.
- Thinks of themselves less, while not thinking less of themselves. (You’re confident, yet humble.)
- Is comfortable being part of a team that thrives on healthy conflict. People with thin skin need not apply. No, seriously.
- Passionately cares about our clients by helping them be more successful. Our clients are fleet managers, parts clerks, and automotive technicians who maintain everything from squad cars to school buses—so everyone comes home safely at the end of the day.
- Is willing to lift boxes, clean floors, and hold doors if that’s what it takes to get something done, because no job is beneath them.
- Takes ownership and initiative in their job, identifying how to make processes and teams better without waiting for permission.
- Loves to read, learn, grow, and stretch themselves. Bonus points for each book they’ve read by Patrick Lencioni!
Specifically for This Job, Someone Who:
- Has 7+ years of hands-on experience in Cloud Operations (AWS), DEVOPS, and/or Site Reliability Engineering.
- Understands leadership concepts and can lead a diverse technical team spanning Cloud Operations, SRE, DEVOPS, and Infrastructure. Previous management experience is a big plus—but not strictly required—so long as you can demonstrate leadership, organization, and communication skills.
- Is highly organized, capable of tracking multiple tasks, projects, and priorities without missing a beat.
- Excels at cross-team collaboration, working closely with Product, Engineering, and Support to deliver reliability, scalability, and stability to the RTA world.
- Communicates effectively with senior leadership, translating technical speak into clear business terms.
- Gets hands dirty when needed—jumping in to troubleshoot a production incident, refine monitoring alerts, or optimize a CI/CD pipeline.
- Can articulate common software design methodologies (e.g., microservices, DEVOPS best practices) to ensure our solutions are robust and future-proof.
- Performs well under pressure and maintains composure during incidents, calmly guiding the team to resolution.
- Holds relevant certifications in Cloud (AWS, Azure, GCP), Site Reliability, DEVOPS, or other engineering tooling.
Key Responsibilities
- Team Leadership: Manage, mentor, and grow a diverse team (Cloud Engineers, SRE, Infrastructure, and DEVOPS) to ensure smooth operations.
- Reliability & Scalability: Champion best practices around monitoring, alerting, and capacity planning to keep our software running flawlessly.
- Cross-Functional Collaboration: Work closely with Product, Engineering, and Support to ensure new features and services are designed for reliability and security from the get-go.
- Project & Incident Management: Oversee critical projects, respond to incidents, and drive post-incident retrospectives to eliminate root causes.
- Strategic Planning: Maintain a high-level view of our technical infrastructure, anticipating future needs and proposing enhancements or new initiatives.
- Hands-On Troubleshooting: Be ready to dive into the details when necessary, whether it’s AWS configuration, Docker deployment, or investigating an InfoSec alert.
- Communication & Reporting: Present updates, proposals, and findings to senior leadership in a clear and concise manner—minus the mind-numbing jargon.
Key Results Areas (aka the Job Outcomes)
- Highly Reliable Systems: Our software and services are known in the industry for uptime, stability, and performance.
- Scalable Infrastructure: We’re prepared to handle growth and new initiatives without missing a beat.
- Strong Team Dynamics: Your diverse team collaborates effectively, feels supported, and consistently meets objectives.
- Transparent Communication: Stakeholders (internal and external) know exactly where we stand on technical initiatives and system health.
- Security & Compliance: We maintain robust, up-to-date security measures and practices, keeping client and company data safe.
Qualifications
OK, the “boring” HR part that’s necessary:
- 7+ years of hands-on Cloud Operations (AWS), Site Reliability Engineering, or related field.
- Leadership Skills: Proven ability (formal or informal) to lead, organize, and mentor teams.
- Proficient with monitoring & observability tools (e.g., Grafana, Prometheus, Datadog, NewRelic, Splunk), CI/CD pipelines, and DevOps practices.
- Experience with AWS, Docker, and modern infrastructure management (e.g., Terraform, CloudFormation) is a strong plus.
- Certifications like AWS Solutions Architect or SRE-specific certs are a big bonus.
- Bachelor’s Degree not required but preferred, especially in a related field (e.g., computer science, MIS).
The Bottom Line
You’ve made it this far—congratulations! We are really looking for ideal team players with an almost frightening intensity around customer service and a passion for serving others. Total compensation for the role is between $150k and $170k. This is a full-time, hybrid role in Glendale, AZ, working side by side with the engineering, product, senior leadership, and more. If all of this is checking off the items on your list, we’d love to hear from you!
About Us
RTA has been established since 1979 and has the reputation of providing the best customer service in the market. Our purpose is to help fleets succeed. We pride ourselves on creating a caring, family-oriented atmosphere for both staff and clients, and love that our work makes a positive impact on the lives we touch. Our clients carry kids in school buses, first responders in emergency vehicles, patients in ambulances, food and medical supplies in trucks, and people just taking the bus or train to work. We do meaningful work, and we want our clients to have the best tools available to them.
Our office spaces are open, spacious, and colorful, with plenty of natural light. We come together often as a company to enjoy freshly baked desserts or awesome lunches and genuinely enjoy each other’s company. We offer some pretty unique perks and benefits, as well as all the standard ones. We’re happy to talk through all the options!
Coming from Scottsdale? You’ll enjoy waving at the traffic going the other way while never having to stare at the blinding sun. It only takes about 25 minutes from downtown Scottsdale in the mornings. We are located close to Arrowhead Mall, with quick access to the 101 from multiple directions.
If this sounds like your kind of company—and your kind of role—then click apply! We may have asked you four times already, but you’re still here, so you must really be thorough (bonus points!). We can’t wait to see if you’re the Pepper to our Potts when it comes to orchestrating our tech ecosystem.
#LI-AE1
Requirements: