Hi, We are hiring for a Site Reliability Engineer (SRE).
Rate: $73 Type: C2C Location: Hybrid Dallas Texas Visa type: Any Visa Role Overview We are looking for a Site Reliability Engineer who combines reliability engineering with a strong testing mindset. This role focuses on building resilient systems, improving service availability, and validating system behavior through automated unit and integration testing. You will work closely with development and platform teams to ensure changes are safe, observable, and production-ready. Key Responsibilities • Design, build, and operate reliable, scalable, and observable production systems. • Define and enforce SLOs, SLIs, and error budgets to guide engineering and release decisions. • Develop and maintain automated unit, integration, and end-to-end test suites to validate system behavior across environments. • Use Selenium to implement and maintain automated UI and workflow tests as part of CI/CD pipelines. • Integrate test automation into deployment pipelines to enable early detection of reliability, performance, and functional regressions. • Perform failure analysis, incident response, and post-incident reviews with a focus on long-term reliability improvements. • Partner with application teams to improve testability, resiliency, and deployment safety of services. • Build tooling and automation to reduce toil and improve operational efficiency. • Continuously improve monitoring, alerting, and logging to support proactive issue detection. Required Skills & Experience • Strong experience as an SRE, Production Engineer, or Reliability-focused DevOps Engineer. • Hands-on experience with unit and integration testing frameworks (e.g., JUnit, TestNG, PyTest, or similar). • Solid experience with Selenium for automated UI and workflow testing. • Experience integrating automated tests into CI/CD pipelines. • Strong scripting or programming skills (Java, Python, or similar). • Practical experience with monitoring, alerting, and observability tools. • Experience troubleshooting distributed systems in production environments. Nice to Have • Experience with performance and load testing tools. • Knowledge of chaos testing or fault-injection practices. • Familiarity with cloud-native environments and containerized workloads. • Exposure to service-level driven engineering practices. *Thanks,* *Lyra Dass* *Human Resources ManagerDigital Resource Partners LLC* *+1(945)248-3020* *https://drpscorp.com/ <https://drpscorp.com/>* -- You received this message because you are subscribed to "rtc-linux". Membership options at http://groups.google.com/group/rtc-linux . Please read http://groups.google.com/group/rtc-linux/web/checklist before submitting a driver. --- You received this message because you are subscribed to the Google Groups "rtc-linux" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion visit https://groups.google.com/d/msgid/rtc-linux/CAEL7yFSqZj3UjvGz32b4dZitEm-Ng_WEe1O5NAS_7EQ_BncuTg%40mail.gmail.com.
