Infrastructure Operations Engineer
Job Description
Job DescriptionJob Title: Infrastructure Operations Engineer (Contract)
Contract Duration: 12 Months
Work Schedule: 35 hours per week (Monday–Friday); on-call and after-hours support for incidents and maintenance windows as needed
Work Location: New York, NY, with occasional travel to CUNY locations across the five boroughs
Pay Rate: $34-35.50/hr. This is a W2 position only.Position Overview
A major public education client is seeking a hands-on Infrastructure Operations Engineer to support and enhance enterprise infrastructure operations in a complex hybrid IT environment. This role will focus on stabilizing and improving infrastructure monitoring, alerting coverage, and operational readiness across servers, virtualization, storage, networking, and cloud platforms.
The engineer will play a key role in strengthening incident response processes, supporting infrastructure lifecycle maintenance and security remediation efforts, and improving backup and disaster recovery (DR) readiness. This role requires a service-oriented professional comfortable collaborating across teams, vendors, and operational units.
Project Description
The Infrastructure Operations Engineer will support an initiative to stabilize and enhance enterprise infrastructure operations by:
-
Strengthening infrastructure monitoring and alerting coverage
-
Improving incident detection and response execution
-
Supporting infrastructure lifecycle management and security remediation
-
Enhancing backup, replication, and disaster recovery operational readiness
-
Improving operational processes through automation, documentation, and monitoring improvements
This position is classified as Engineer I with an intermediate to senior operations focus.
Key ResponsibilitiesInfrastructure Monitoring & Incident Management
-
Monitor enterprise infrastructure environments including servers, networking, storage, virtualization, and cloud platforms to ensure reliability and availability.
-
Configure, maintain, and optimize monitoring and alerting dashboards, alerts, and reports to detect anomalies and potential service disruptions.
-
Respond to alerts and operational incidents by investigating issues and coordinating resolution with infrastructure, application, and cybersecurity teams.
-
Support enterprise observability platforms and improve alert accuracy and noise reduction.
Infrastructure Operations & Support
-
Assist in the design, deployment, and operational support of infrastructure solutions across on-premises, hybrid, and cloud environments.
-
Support deployment, configuration, and lifecycle management of:
-
Servers (Windows and Linux)
-
Virtualization platforms
-
Storage systems
-
Networking infrastructure
-
Cloud services
-
-
Participate in system upgrades, patching cycles, maintenance windows, and performance optimization initiatives.
Cloud & Hybrid Environment Support
-
Provide operational support for public cloud and hybrid infrastructure environments including IaaS, PaaS, and SaaS platforms.
-
Assist with cloud administration, monitoring, and operational best practices across environments such as Azure, AWS, or Google Cloud.
Security, Compliance & Resilience
-
Support implementation of security controls and vulnerability remediation in accordance with cybersecurity standards.
-
Assist with data protection operations including backups, replication, and disaster recovery configurations.
-
Participate in disaster recovery testing and resilience planning activities.
Process Improvement & Automation
-
Improve operational efficiency through automation, scripting, configuration management, and documentation.
-
Develop and maintain operational runbooks, monitoring documentation, and technical procedures.
-
Participate in ITSM processes including incident, problem, and change management.
Collaboration & Operational Support
-
Work closely with internal infrastructure teams, application teams, cybersecurity teams, and external vendors.
-
Participate in on-call rotations and after-hours maintenance activities as needed.
-
Support special projects and infrastructure initiatives across the organization.
Required Qualifications
-
5+ years of experience supporting enterprise IT infrastructure environments including servers, storage, networking, virtualization, and cloud platforms.
-
3+ years of experience supporting enterprise monitoring, alerting, and operational management tools in complex environments.
-
Strong operational knowledge of:
-
Windows and Linux server administration
-
Virtualization platforms
-
Networking protocols and infrastructure
-
Storage systems
-
Cloud services
-
-
Experience supporting hybrid or cloud environments (Azure, AWS, or Google Cloud).
-
Familiarity with enterprise observability and monitoring platforms such as SolarWinds, Splunk, Datadog, Dynatrace, Nagios, or similar.
-
Experience with automation and scripting tools such as PowerShell, Python, Ansible, or Terraform.
-
Strong troubleshooting, communication, and technical documentation skills.
-
Ability to work independently while collaborating effectively across cross-functional teams.
-
Willingness to participate in after-hours incident response and maintenance activities.
Tools & TechnologiesTypical Software Used
-
Monitoring and observability platforms (SolarWinds, Splunk, Datadog, Dynatrace, Nagios or similar)
-
ITSM and ticketing systems
-
Virtualization management tools (e.g., vCenter)
-
Server, storage, and network administration tools
-
Cloud management portals (Azure, AWS, Google Cloud)
-
Automation and scripting tools (PowerShell, Python)
-
Configuration management tools (Ansible, Terraform)
Security Requirements
Candidates must comply with security policies and procedures, including:
-
Multi-factor authentication (MFA) requirements
-
Least-privilege access for privileged accounts
-
Background check and confidentiality requirements as applicable
Knowledge Transfer & Training
The selected candidate will be expected to:
-
Conduct working sessions with CIS staff covering:
-
Monitoring standards and observability practices
-
Alert tuning and noise reduction strategies
-
Incident response playbooks
-
Operational runbooks
-
-
Maintain and deliver updated operational documentation including dashboards, alert configurations, and procedures.
-
Provide structured knowledge transfer to CIS operations staff throughout the engagement.
Travel Requirements
Occasional travel may be required to locations across New York City’s five boroughs for on-site support, meetings, or operational activities. Participation in on-call rotations may also be required.
#ZR
APPLY NOW!
Integrated Staffing values a diverse, inclusive workforce and we provide equal employment opportunity for all applicants and employees. All qualified applicants for employment will be considered without regard to an individual’s race, color, sex, gender identity, gender expression, religion, age, national origin or ancestry, citizenship, physical or mental disability, medical condition, family care status, marital status, domestic partner status, sexual orientation, genetic information, military or veteran status, or any other basis protected by federal, state or local laws. Integrated Staffing will reasonably accommodate qualified individuals with disabilities to the extent required by applicable law.
Staffing solutions that exceed expectations and build relationships.