Job Opening
Lead Site Reliability Engineer - SRE
Permanent
Dublin
09-08-2023
RealTime are looking for a Lead Site Reliability Engineer to lead a site reliability function to design, implement, & lead a team responsible for delivering on growth and industry-changing strategic objectives. A key responsibility will be monitoring & remediating systems, security, and network issues.
You will play a critical role in the design & development of tooling, monitoring, self-service reporting, analysis approach, establish policies & procedures governing incident, change & problem management protocols.
What you get
* 100% remote work - 100% of the time
* Excellent salary
* Unrivalled benefits
* Opportunity to develop and try new things
* WFH allowance
Skills & Responsibilities
* 7 years in a Senior technical role: DevOps, Software Engineering, Sys. Engineering.
* Experience designing, installing & configuring monitoring solutions - 24x7 environments.
* Strong cloud experience ideally Azure (AWS, GCP)
* Monitoring fundamentals associated with SNMP, WMI, Synthetic Transaction Engines
* Open-source monitoring: Splunk, Nagios, Zabbix, OneSite, Gomez, CA, Client OpenView...
* Scripting: Powershell/ Python.
* Monitoring tools: Open Tracing, Open Telemetry
* APM: Elastic, DataDog, New Relic
* Understanding of Networking
* iSCSI/FC SAN/NAS/DAS storage, Hypervisor/Virtualization: VMware, Hyper-V
* AD/DNS/DHCP
* Architect & develop monitoring solutions for alert response & troubleshooting
* Technical lead, hands-on scripting, tooling & automation for continuous operations.
* Triage incidents & document steps to resolve
* Work closely with technical support, security, engineers, & customers
Apply
1.