DescriptionThe Oracle Cloud Infrastructure (OCI) delivers mission-critical applications for top tier enterprises around the world. Our cloud offers unmatched hyper-scale, multi-tenant services deployed in more than 40 regions worldwide.
The mission of our Network Reliability Engineering team is to provide services that allow our customers to drive operational excellence in OCI networks at scale. Our customers want auto-remediation of incidents, touchless and automated operations such as upgrading 1000s of devices and pushing configuration changes safely, adding network capacity seamlessly. We help Oracle build best-in-class network automation services to meet our customers' needs.
As a Principal Network Reliability Engineer, you will be responsible for helping design, build, test, deploy and operate highly scalable solutions to support Oracle’s next-generation Cloud Infrastructure. You should be comfortable with building within complex distributed systems involving the management of 1000s of devices. You should be a rock-solid coder with a strong understanding of networking and distributed systems. You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn. You should also be able to manage multiple projects and priorities in a fast-paced environment, be results-driven, and thrive even when faced with ambiguity and uncertainty. Most importantly, you should be a team player, someone who is eager to learn, and willing to do whatever it takes to help the team succeed.Career Level - IC4ResponsibilitiesIn this role you will work with Network Reliability Engineering (NRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
Required Qualifications:• 5+ years of experience in network and software development• Proficiency with Java/Python/C++ and Object Oriented programming• Proficiency with network technologies and protocols (TCP/IP, BGP, OSPF, MPLS, etc.)• Excellent knowledge of basic data structures, search/sort algorithms• Excellent organizational, verbal, and written communication skills• Bachelor’s degree in Computer Science or a related field
Preferred Qualifications:• Knowledge of cloud computing & networking technologies• Experience developing service-oriented systems, REST, python• Experience developing network automation or device management solutions• Experience working in a network support role• Experience working in a large ISP or cloud provider environment• Experience with network modeling and programming – YANG, OpenConfig, NETCONF• Experience with Jira, Confluence, BitBucket• Knowledge of Scrum & Agile Methodologies• Masters / Ph.D. in Computer Science