Job Information
Amazon Software Dev Engineer III - AGI Nova, AGI Foundations in Gdansk, Poland
Description
The Artificial General Intelligence (AGI) team is looking for a highly-skilled Senior Software Engineer III to lead the development and implementation of state-of-the-art technologies to push the boundaries of efficient inference for Generative Artificial Intelligence (GenAI) models.
This role offers the opportunity to work on Amazon Nova and other generative AI solutions to deliver optimal inference solutions for next-generation AI applications. The successful candidate will join a team focused on pushing the boundaries of inference optimization, working to make large language models more efficient and accessible while maintaining high performance standards.
Key job responsibilities
Inference Solutions Development: Design, develop, test, and deploy inference solutions for state-of-the-art Large Language Models (LLMs).
AI Workflow Optimization: Create solutions that facilitate the usage and building of artificial intelligence workflows, optimizing them for cost and latency.
Innovative Technology Design: Design and build innovative technologies in a large distributed computing environment, contributing to fundamental changes in the industry.
Scalable Solutions on AWS: Build solutions on top of AWS that are user-friendly, scalable, fault-tolerant, and easy to manage.
Collaboration and Problem-Solving: Collaborate with other engineers to identify and solve well-defined problems in GenAI.
Mentorship: Mentor and guide junior engineers, contributing to the overall growth and development of the team.
Operational Excellence: Perform on-call duties to ensure an exceptionally frictionless experience for customers.
Agile Development: Work in an agile environment to deliver high-quality software.
A day in the life
Software Dev engineers in our team are autonomous contributors to their team software. They deliver high quality features spanning the full software lifecycle including working backwards from customers, design, implementation, testing, operations and documentation. They solve well-defined business problems, sometimes through creating software solutions, and sometimes just through finding and reusing something that exists. They are coaching others on following best programming practices, as well as lead operational efforts. They partner with seniors to focus on right things and make good decisions.
About the team
Our team is at the forefront of building efficient and scalable inference solutions for Generative AI models. Working closely with the Science team, we transform state-of-the-art research into production-ready systems, ensuring our large language models (LLMs) deliver optimal performance in real-world applications. The recent launch of Amazon Nova exemplifies our commitment to innovation and excellence, as we continuously push the boundaries of what's possible in LLM inference optimization, focusing on enhanced latency, cost-effectiveness, and reliability.
By joining our team, you'll become part of a highly motivated group that's passionate about solving challenging problems and enabling Amazon to deliver best-in-class AI solutions.
Basic Qualifications
8+ years of professional software development experience.
8+ years of experience programming with at least one modern language such as Java, Python, C++, or C# including object-oriented design.
3+ years of experience contributing to the architecture and design (architecture, design patterns, reliability, scaling) of new and current systems.
Strong fundamentals in Computer Science, including object-oriented design, data structures, algorithm design, problem-solving, and complexity analysis.
Fundamental knowledge of Machine Learning algorithms and techniques.
Preferred Qualifications
Master’s degree in Computer Science or a related field, or equivalent experience.
Experience developing workflows that run and train machine learning models.
Experience with inference optimization in production environments.
Knowledge of model-serving architectures and frameworks.
Understanding of hardware acceleration techniques (e.g., GPU, CPU optimization).
Experience with distributed inference systems.
Hands-on experience with major cloud platforms (AWS, GCP, Azure, or equivalent).
Strong problem-solving skills, with the ability to think creatively and critically about complex problems.
Strong communication and collaboration skills, with the ability to work effectively within a team.
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.