Campus Pride Jobs

Mobile Campus Pride Logo

Job Information

Amazon Software Dev Engineer II, Enterprise Engineering in Austin, Texas

Description

Amazon strives to be the world’s most customer centric company. To succeed, our products and services must be available at all times to our customers. The Enterprise Engineering Availability (EEA) team is responsible for improving the availability of internal systems (software, hardware, network) used by millions of Amazonians around the world.

A software engineer on the EEA team will develop greenfield observability and detection systems used to predict, detect and triage customer-impacting outages of internal Amazon IT systems. These will streamline the incident management process by diagnosing the root cause of an outage, identifying the right team(s) to engage and providing recommendations on how to mitigate the problem.

This role is a perfect fit for an experienced engineer who is passionate about availability (alerting, metrics, monitoring, observability), incident management and machine learning. You thrive in a fast-paced, startup-like environment, build full-stack applications, communicate effectively to all types of stakeholders (tech, non-tech), enjoy learning new technology and ship complex software at scale in fast iterations.

A day in the life

  • Deliver high-impact, high-visibility projects that improve the productivity of millions of Amazonians around the world

  • Invent processes, tools, and technology to force multiply the effect of your contributions across many organizations.

  • Be responsible for owning, scoping, leading and delivering projects and experiments end-to-end, leveraging statistical evaluation, pattern recognition, and machine learning.

We are open to hiring candidates to work out of one of the following locations:

Austin, TX, USA

Basic Qualifications

  • 3+ years of non-internship professional software development experience

  • 2+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience

  • Experience programming with at least one software programming language

Preferred Qualifications

  • 3+ years of full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, and operations experience

  • Bachelor's degree in computer science or equivalent

  • Experience building and maintaining large-scale, high-availability distributed systems

  • Excellent oral and written communication skills with both technical and non-technical stakeholders

  • Experience using data to make priority decisions and taking those initiatives from scoping through production launch into daily operation.

  • Understanding of CI/CD, test automation and robust system health monitoring (metrics, monitors, alarms)

  • Experience with incident management

  • Experience with telemetry and observability systems

  • Experience with anomaly detection, time-series data and storage, data streaming

  • Experience with Site Reliability Engineering (SRE) concepts, practices

  • Experience with statistical analysis and machine learning

  • Experience influencing engineering team members on best practices (full SDLC inclusive of coding standards, code reviews, source control management, build processes, testing, and operations)

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.

DirectEmployers