DescriptionAt Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world.
Values are OCI’s foundation and how we deliver excellence. We strive for equity, inclusion, and respect for all. We are committed to the greater good in our products and our actions. We are constantly learning and taking opportunities to grow our careers and ourselves. We challenge each other to stretch beyond our past to build our future.
You are the builder here. You will be part of a team of smart, motivated, and diverse people and given the autonomy and support to do your best work. It is a dynamic and flexible workplace where you’ll belong and be encouraged.
OCI Network Availability is looking for a Senior Network Reliability Engineer to build and operate services that enhance the availability of the Oracle Cloud Network Infrastructure. A Network Reliability Engineer (NRE) is primarily focused on applying an engineering approach to measure and automate a network's reliability to align with Organization's service-level objectives, agreements, and goals. The duties of the NRE team entail promptly responding to network disruptions, pinpointing the underlying cause, and collaborating with internal and external stakeholders to fully restore functionality. The NRE team members also oversee the automation of recurring tasks in daily operations to streamline processes, enhance workflow efficiency, and increase overall productivity. As OCI is a cloud-based network with a global footprint, this support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLoS Network, and the Internet. Career Level - IC3ResponsibilitiesIn this role you will:
Support the design, deployment, and operations of a large-scale global Oracle cloud computing environment (Oracle Cloud Infrastructure - OCI)
Use and contribute to procedures and tools to develop and safely execute network changes
Develop solutions for support team members to act on network failure conditions
Mentor junior engineers
Participate in network solution and architecture design processes
Provide break-fix support for network events, serve as the escalation point for event remediation, and lead post-event root cause analysis
Develop scripts to automate routine tasks for teams and business units
Coordinate with network automation service teams for the development and integration of support tooling
Coordinate with network monitoring teams to gather telemetry data and create network event alert rules
Build dashboards to represent and analyze network performance data
Collaborate with network vendor technical account team and quality assurance team to drive bug resolution and assist with qualification of new firmware and/or operating systems
Participate in an on-call rotation
Preferred Qualifications:
Bachelor’s degree in CS or related engineer field with 5+ years of Network Engineer experience or Master’s with 2+ years of Network Engineering experience.
Experience working in a large ISP or cloud provider environment
Experience working in a network operations role.
Strong knowledge of protocols and services such as MPLS, BGP/OSPF/IS-IS, TCP, IPv4, IPv6, DNS, DHCP, VxLAN and EVPN.
Extensive experience with scripting or automation and data center design – Python preferred but must demonstrate expertise in scripting or compiled language
Experience with networking protocols such as TCP/IP, VPN, DNS, DHCP, and SSL
Experience with network monitoring and telemetry solutions
Experience with network modelling and programming – YANG, OpenConfig, NETCONF
Ability to use professional concepts and company objectives to resolve complex issues in creative and effective ways
Capable of working under limited supervision
Excellent organizational, verbal, and written communication skills
Excellent judgement in influencing product roadmap direction, features, and priorities