Job Description
Job Description
ComResource is looking for a DevOps Engineer.
Responsibilities:
- Design, implement, and scale technologies, approaches, and platforms to support high-availability systems.
- Deploy and maintain deliverables in all phases of the Software Development Life Cycle (SDLC) by collaborating with business, development, and systems engineering teams.
- Define scope for infrastructure buildouts, application environment maintenance, application deployment, and performance testing.
- Establish milestones for infrastructure buildouts, engineering, development, testing, and implementation of application environments.
- Manage multi-channel software development lifecycle, including enhancements, modifications, system configuration, migrations, upgrades, and production support.
- Provide support and troubleshooting for all related systems and technologies.
- Participate in troubleshooting efforts to identify root causes and provide recommendations to prevent recurrence.
- Implement application optimization techniques such as CDN, cloud, and caching.
- Build and operate a high-performance, stable, and resilient cloud platform.
- Focus on Site Reliability Engineering practices.
- Build and refine CI/CD pipelines using GitHub Actions and Argo Workflows to support engineering teams.
- Work closely with cross-functional IT and technology partners to ensure system interactions meet standards.
- Align technology with business strategy, including scope definition, cost estimations, resource allocation, process design, technical specifications, data management, compliance, and testing.
Essentials:
- Bachelor's degree in Information Technology, Computer Science, Engineering, Business Administration, or related field.
- 3–5 years of experience, including 3+ years of hands-on experience implementing, operating, and maintaining infrastructure for high-volume enterprise applications.
- 3+ years of experience in distributed system development, including design and support of systems with scalability and disaster recovery robustness.
- 3+ years of operations experience with production systems in public cloud environments; GCP preferred.
- Proficient with containerization and cluster management technologies, including Kubernetes and Docker.
- Deep understanding and hands-on experience with cloud-native deployment and monitoring tools, including Kubernetes, Helm charts, container-based deployment, Service Mesh, Prometheus, and Grafana.
- Experience in operating systems (Windows, RedHat, CentOS, Amazon Linux), networking (Akamai, Nginx, Apache, AWS/GCP VPC), and software packages such as Terraform and Bash.
- Experience integrating monitoring, alerting, and reporting tools such as NewRelic, Akamai, Grafana, Elasticsearch, and Prometheus.
- 3+ years of cloud engineering experience with Terraform and GCP or AWS.
- 3+ years of experience implementing and supporting microservices architecture using containers with tools such as Docker, AWS ECS, or GCP Compute/GKE.
- Experience working with database systems such as Cloud SQL, BigQuery, AWS RDS, Oracle SQL, MongoDB, and Elasticsearch.
- Experience building CI/CD pipelines using ArgoCD, Argo Workflow, and GitHub Actions.
- Experience supporting open-source web and application services, including Java, Ruby, PHP, Python, and Perl.
- Experience with bash, Perl, or other shell scripting.
- Experience with Git fundamentals.
- Experience with Level 1 & 2 support, system monitoring, and participation in 24/7 on-call support activities.
Desired:
- Experience in SOX/PCI compliance.
- Retail industry experience.
Req ID: CG424958527