Job Description
Job Description
AssetWatch serves global manufacturers by powering manufacturing uptime through the delivery of an unparalleled condition monitoring experience, with a passion to care about the assets our customers care for every day. We are a devoted and capable team that includes world-renowned engineers and distinguished business leaders united by a common goal – To build the future of predictive maintenance. As we enter the next phase of rapid growth, we are seeking people to help lead the journey.
AssetWatch serves global manufacturers by powering manufacturing uptime through the delivery of an unparalleled condition monitoring experience, with a passion to care about the assets our customers care for every day. We are a devoted and capable team that includes world-renowned engineers and distinguished business leaders united by a common goal – To build the future of predictive maintenance. As we enter the next phase of rapid growth, we are seeking people to help lead the journey.
We are seeking a DevOps Engineer who brings innovation and forward-thinking to designing and managing cloud-native infrastructure. With deep knowledge of modern compute, you bring ideas for observing multi-faceted platforms to ensure high uptime for our customers. We believe modern infrastructure is a holistic endeavor that balances reliability, security, and flexibility for our engineering teams. You're excited to leverage multiple AWS services—and to share that excitement with others.
This is a remote position. Core work hours are in the Eastern Time Zone. US-only.
What's The Opportunity
- Empower engineering teams and allow them to focus on building features by ensuring the reliability and uptime of our platform, built in AWS.
- Actively seek to bring new ideas forward and look for opportunities for improvement.
- Build and own the observability platform for end-to-end user journeys across AWS (API Gateway, Lambda, RDS, SQS, AppSync), driving metrics/events/logs/traces, SLOs, and actionable alerts.
- Partner with squads to turn delivery needs into templates: opinionated GitHub Actions workflows, trunk/branch protections, required checks, Terraform plans, SBOMs, least-privilege deploy roles, and canary/blue-green releases.
- Build event-driven services on AWS (API Gateway + Lambda + EventBridge) with robust retry policies, cold-start tuning, autoscaling, and cost/perf guardrails.
- Create reusable deployment patterns for serverless APIs, workers, and schedulers with security (least-privilege IAM/KMS), observability, and release strategies baked in. Using Terraform and other tooling for deployment
- Optimize API performance, database interactions, and infrastructure bottlenecks impacting user experience.
- Build tools and scripts to streamline operational workflows and development efficiency.
- Continuously optimize cost/performance (profiling, right-sizing, autoscaling policies).
- Drive incident response; participate in a lightweight on-call rotation; champion postmortems and prevention.
- Write and maintain clear documentation for infrastructure, deployment, and operations.
Tech We Use
- Cloud & Infrastructure: AWS EC2, ECS, Lambda, RDS Aurora, S3, API Gateway, AppSync (GraphQL), SQS, SNS, CloudWatch, Terraform
- CI/CD & Automation: GitHub Actions, Python, Docker, Bash/Zsh
- Security: AWS Inspector, WAFv2, Security Hub, Macie, GuardDuty
- Databases: Amazon Aurora (MySQL-compatible), DynamoDB
- Monitoring & Observability: Grafana, SigNoz, Datadog, CloudWatch
- Development: Python, TypeScript
- Tools: Jira, Slack, Notion, VS Code (GitHub Copilot, Cursor, Windsurf)
Qualifications
- A DevOps engineer with strong experience in AWS services and cloud infrastructure management.
- 4+ years of experience designing, building, and maintaining AWS infrastructure using Terraform to support backend systems.
- Strong knowledge of AWS cloud services: Lambda, SQS, SNS, CloudWatch, API Gateway, AppSync, RDS Aurora, and S3.
- Proven Terraform experience—module design/versioning, CI policy checks (fmt/validate/plan), and production-grade IaC workflows (reusable modules, environments, promotions).
- Experienced in observability systems, performance monitoring, and alerting infrastructure.
- Deep experience with CI/CD pipelines, GitHub Actions, and automation workflows.
- Excellent communicator with a proactive, execution-focused mindset.
- Ability to troubleshoot and support backend services built with TypeScript, Node.js, or Python, with a solid understanding of runtime behavior.
- Independent problem-solver with a collaborative spirit and strong sense of ownership.
- Familiar with GraphQL, REST APIs, and mobile-focused backend systems.
Nice to Have
- AWS Certified Developer and/or Solutions Architect certification.
- Experience with Linux-based operating systems.
- Understanding of security best practices and API protection.
- Skilled in MySQL and NoSQL database administration and optimization.
- Familiarity with feature flag systems and A/B testing.
#LI-REMOTE
What We Offer:
AssetWatch is a remote-first company that puts people at the center of everything we do. We want our team members to thrive - that's why we offer a range of benefits and perks designed to support your well-being, growth, and work-life balance.
- Competitive compensation package including stock options
- Flexible work schedule
- Comprehensive benefits including retirement plan match
- Opportunity to make a real impact every day
- Work with a dynamic and growing team
- Unlimited PTO
We have a distributed team that works remotely across locations in the United States and Ontario, Canada. Collaboration within core working hours is required.