OCI Network Availability is looking for a Senior Network Reliability Engineer to build and operate services that enhance the availability of the Oracle Cloud Network Infrastructure.
A Network Reliability Engineer (NRE) is primarily focused on applying an engineering approach to measure and automate a network's reliability to align with the organization's service-level objectives, agreements, and goals. The duties of the NRE team entail promptly responding to network disruptions, pinpointing the underlying cause, and collaborating with internal and external stakeholders to fully restore functionality. The NRE team members also oversee the automation of recurring tasks in daily operations to streamline processes, enhance workflow efficiency, and increase overall productivity. As OCI is a cloud-based network with a global footprint, this support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLoS Network, and the Internet.
In this role you will:
* Support the design, deployment, and operations of a large-scale global Oracle cloud computing environment (Oracle Cloud Infrastructure - OCI)
* Use and contribute to procedures and tools to develop and safely execute network changes
* Develop solutions for support team members to act on network failure conditions
* Participate in network solution and architecture design processes
* Provide break-fix support for network events, serve as the escalation point for event remediation, and lead post-event root cause analysis
* Develop scripts to automate routine tasks for teams and business units
* Coordinate with network automation service teams for the development and integration of support tooling
* Coordinate with network monitoring teams to gather telemetry data and create network event alert rules
* Build dashboards to represent and analyze network performance data
* Collaborate with network vendor technical account team and quality assurance team to drive bug resolution and assist with qualification of new firmware and/or operating systems
* Participate in an on-call rotation
Preferred Qualifications:
* Bachelor’s degree in CS or related engineering field with 5+ years of Network Engineer experience or Master’s with 2+ years of Network Engineering experience.
* Experience working in a network operations role.
* Strong knowledge of protocols and services such as MPLS, BGP/OSPF/IS-IS, TCP, IPv4, IPv6, DNS, DHCP, VxLAN, and EVPN.
* Extensive experience with scripting or automation and data center design – Python preferred but must demonstrate expertise in scripting or compiled language
* Experience with networking protocols such as TCP/IP, VPN, DNS, DHCP, and SSL
* Experience with network monitoring and telemetry solutions
* Experience with network modelling and programming – YANG, OpenConfig, NETCONF
* Ability to use professional concepts and company objectives to resolve complex issues in creative and effective ways
* Capable of working under limited supervision
* Excellent organizational, verbal, and written communication skills
* Excellent judgement in influencing product roadmap direction, features, and priorities
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Information Technology
Industries
IT Services and IT Consulting
#J-18808-Ljbffr