OCI Network Availability seeks a Senior Network Reliability Engineer to enhance the availability of Oracle Cloud Network Infrastructure services.
Job Description
A Network Reliability Engineer applies an engineering approach to measure and automate network reliability, aligning with service-level objectives, agreements, and goals. The role entails promptly responding to network disruptions, identifying root causes, and collaborating with stakeholders for full restoration. The team oversees automation of recurring tasks to streamline processes and increase productivity in OCI's cloud-based network with a global footprint.
This position involves:
* Supporting the design, deployment, and operations of Oracle Cloud Infrastructure (OCI)
* Using and contributing to procedures and tools for network change development and execution
* Developing solutions for support team members to act on network failure conditions
* Participating in network solution and architecture design processes
* Providing break-fix support for network events, serving as an escalation point for event remediation, and leading post-event root cause analysis
* Developing scripts to automate routine tasks for teams and business units
* Coordinating with network automation service teams for support tooling development and integration
* Coordinating with network monitoring teams to gather telemetry data and create network event alert rules
* Building dashboards to represent and analyze network performance data
* Collaborating with network vendor technical account teams and quality assurance teams to drive bug resolution and assist with new firmware and/or operating system qualification
* Participating in an on-call rotation
Requirements
The ideal candidate possesses:
* Bachelor's degree in Computer Science or related engineering field with 5+ years of Network Engineering experience or Master's with 2+ years of Network Engineering experience
* Experience working in a network operations role
* Strong knowledge of protocols and services such as MPLS, BGP/OSPF/IS-IS, TCP, IPv4, IPv6, DNS, DHCP, VxLAN, and EVPN
* Extensive experience with scripting or automation and data center design – Python preferred but must demonstrate expertise in scripting or compiled language
* Experience with networking protocols such as TCP/IP, VPN, DNS, DHCP, and SSL
* Experience with network monitoring and telemetry solutions
* Experience with network modelling and programming – YANG, OpenConfig, NETCONF
* Ability to use professional concepts and company objectives to resolve complex issues creatively and effectively
* Capable of working under limited supervision
* Excellent organizational, verbal, and written communication skills
* Excellent judgement in influencing product roadmap direction, features, and priorities
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Information Technology
Industries
IT Services and IT Consulting