Job Description
Job Description
Title: Site Reliability Engineer (SRE)
Location: Austin, TX
Job Type: Full Time
Job Description:
Technical Skills:
- 6+ years of professional engineering experience developing, managing, or supporting distributed systems
- 4+ SRE experience managing multi-cloud platforms
- Strong trouble shooting skills in debugging multiarchitecture systems and experience with microservices architecture patterns is must.
- Strong Experience in Issues Resolution and Incident management, RCA Creation, and follow-up.
- Enterprise Cloud infrastructure experience e.g., GCP, AWS
- Strong working knowledge of modern development technologies and tools e.g., Agile, CI/CD, Git, Jira, and Confluence.
- Experience in developing and managing operations leveraging key event streaming, messaging, and DB services e.g., MQ/JMS/Kafka, Cloud SQL, etc.
- Strong experience in using industry standard monitoring tools e.g., AppDynamics, Dynatrace, Splunk, Grafana, Nagios, Datadog, New Relic, Tempo, Loki, etc.
- Experience working with containers e.g., Docker, Kubernetes, Cloud Foundry, etc.
- Deep knowledge of Internet protocols and web services technologies e.g., HTTP, DNS, TCP/UDP, SOAP, JSON, and REST