DevJobs

AI Software Director

Overview
Skills
  • PyTorch PyTorch
  • kernels
  • XLA
  • Triton
  • toolchains
  • SDK delivery
  • runtime
  • performance engineering
  • ONNX
  • memory ops
  • accelerator bring-up
  • IREE
  • GEMM
  • developer ecosystems
  • CUDA
  • compiler optimizations
  • attention
  • AI compiler stacks
  • heterogeneous compute
  • dataflow architectures
NextSilicon is reimagining High-Performance Computing (HPC) and Artificial Intelligence (AI). Our accelerated compute solutions leverage intelligent adaptive algorithms to vastly accelerate supercomputers, driving them forward into a new generation. We have developed a novel software-defined hardware architecture that is achieving significant advancements in both the HPC and AI domains.

At NextSilicon, everything we do is guided by three core values:

  • Professionalism: We strive for exceptional results through professionalism and unwavering dedication to quality and performance.
  • Unity: Collaboration is key to success. That's why we foster a work environment where every employee can feel valued and heard.
  • Impact: We're passionate about developing technologies that make a meaningful impact on industries, communities, and individuals worldwide.

We are seeking an exceptional Director of AI Software to lead the full AI software stack that powers our next-generation accelerator platform. This executive-level role owns the strategy, architecture, and execution of all AI software components- including frameworks, compilers, runtime, kernels, developer tools, and end-to-end AI workloads.

You will lead multiple teams responsible for running state-of-the-art models (LLaMA, DeepSeek, diffusion, MoE and future generations of AI), building high-performance kernels, integrating industry frameworks, and delivering a developer-friendly stack to customers, which showcases and materializes the full potential of our hardware.

Requirements:

  • Bachelor’s or Master’s degree and/or equivalent experience in computer science or a related field.
  • 5+ years in AI/ML systems, GPU/accelerator software, or ML frameworks.
  • 10+ years managing multiple teams or leading large AI software efforts.
  • Deep expertise in PyTorch internals, Triton, IREE, XLA, ONNX, or similar AI compiler stacks.
  • Strong background in kernels (GEMM, attention), performance engineering, and accelerator bring-up.
  • Proven ability to deliver production-grade AI software stacks on new hardware.
  • Knowledge of dataflow architectures, or heterogeneous compute is an advantage.
  • Experience with customer deployments, SDK delivery, and developer ecosystems.
  • Track record of founding or scaling high-performance engineering organizations.

Responsibilities:

  • Own the vision, roadmap, and execution of the full AI software stack (frameworks, compilers, runtime, kernels, tools).
  • Lead teams responsible for running and optimizing AI workloads
  • Deliver first-class support for PyTorch, Triton, IREE, CUDA-compatible flows, and emerging AI frameworks.
  • Oversee development of high-performance kernels (GEMM, attention, memory ops) and compiler optimizations.
  • Drive HW/SW co-design with silicon, architecture, and runtime teams to maximize performance and efficiency.
  • Ensure end-to-end product delivery, including SDKs, toolchains, developer experience, and documentation.
  • Own benchmarking, performance targets, and competitive analysis
  • Build, mentor, and scale a multi-disciplinary AI software organization.
  • Support bring-up, validation, and tuning of new hardware accelerator generations (pre-silicon and post-silicon).
NextSilicon