DevJobs

Manager, Software Development (Hands-On Technical), ML Network Stack - Annapurna Labs

Overview
Skills
  • C C ꞏ 3y
  • C++ C++ ꞏ 3y
  • Rust Rust ꞏ 3y
  • Testing
  • Build processes
  • SW co-design
  • Source control management
  • Livesite operations
  • HW
  • Certification
  • ML applications
  • NCCL
  • NCCL GIN
  • NIXL
  • NVIDIA stack
  • NVSHMEM
  • Perplexity kernels
  • HPC networking
  • Frameworks
Description

We are hiring a hands-on Software Development Manager for the team that owns the network stack for EC2 distributed AI/ML systems. The team develops support for a variety of frameworks and communication libraries including NCCL, NVSHMEM, NIXL, NCCL GIN, Perplexity kernels and others.

We are seeking an experienced engineering manager for a mid-sized team, with multiple years of hands-on experience in systems programming, HW/SW co-design, and familiarity with networking (HPC networking preferred).

Experience with the NVIDIA stack, ML applications, and frameworks will be highly regarded. You'll be leading senior, mid-level, and junior SDEs and directing work to ensure the team delivers functions and features required for the latest and largest ML workloads.

About The Team

The organization you would be joining is Annapurna Labs, an integral part of AWS that develops hardware and software components that are critical building blocks for EC2 infrastructure. Every instance in EC2 is running some type of hardware designed by Annapurna Labs. We specialize in designing software, systems, and chips that optimize the AWS customer experience.

Diverse Experiences

AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.

Work/Life Balance

We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.

Mentorship & Career Growth

We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Basic Qualifications

  • 5+ years of engineering team management experience
  • Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
  • Experience partnering with product or program management teams
  • 3+ years of C or C++ or Rust development experience
  • 5+ years of hands-on engineering experience, maintaining active programming proficiency

Preferred Qualifications

  • Experience in communicating with users, other technical teams, and senior leadership to collect requirements, describe software product features, technical designs, and product strategy
  • Experience in recruiting, hiring, mentoring/coaching and managing teams of Software Engineers to improve their skills, and make them more effective, product software engineers

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.


Company - Annapurna Labs Ltd.

Job ID: A10435478
Amazon Web Services (AWS)