Job Description
Job Title: AI / ML Engineer – Observability
Location: Dallas, TX or Tampa, FL (Hybrid w/ 3 days onsite)
Work Type: Contract-to-Hire (CTH)
Position Summary
We are seeking a highly skilled AI/ML Engineer with strong Observability expertise to join our team as a strategic contributor. This role goes beyond hands-on engineering and requires a subject matter expert who can act as a thought partner, helping to design, recommend, and implement AI-driven observability solutions across applications, platforms, and business data.
The ideal candidate has experience building end-to-end observability solutions, a strong foundation in automation, and a deep understanding of modern observability principles and tooling.
Key Responsibilities
- Design, build, and enhance AI/ML-driven observability solutions across applications, infrastructure, and business data
- Act as a subject matter expert in observability, providing guidance and recommendations to engineering and leadership teams
- Implement and support observability frameworks using OpenTelemetry
- Develop dashboards, alerts, and analytics using Grafana or similar observability platforms
- Leverage AI/ML techniques to improve monitoring, anomaly detection, forecasting, and operational insights
- Build automated, scalable solutions to support end-to-end observability across multiple technologies
- Collaborate with cross-functional teams to align observability strategy with business and technical goals
- Evaluate and recommend new tools, platforms, and AI-driven approaches to enhance observability capabilities
- Ensure observability best practices are consistently applied across systems and environments
Required Qualifications
- Strong experience in AI/ML development and engineering
- Solid understanding of core observability concepts (metrics, logs, traces)
- Hands-on experience with OpenTelemetry
- Experience with observability platforms such as:
- Grafana (preferred), or
- Splunk, Dynatrace, or similar tools
- Strong background in automation and building innovative, scalable engineering solutions
- Ability to design and implement end-to-end observability architectures
- Excellent problem-solving and communication skills
Preferred Qualifications
- Experience recommending or implementing AI-driven observability or monitoring solutions
- Background working in complex, distributed systems environments
- Ability to quickly learn and adapt to new observability tools and platforms
- Experience influencing technical direction and strategy beyond individual contributor tasks