NextSilicon is reimagining High-Performance Computing (HPC) and Artificial Intelligence (AI). Our accelerated compute solutions leverage intelligent adaptive algorithms to vastly accelerate supercomputers, driving them forward into a new generation. We have developed a novel software-defined hardware architecture that is achieving significant advancements in both the HPC and AI domains.
At NextSilicon, everything we do is guided by three core values:
- Professionalism: We strive for exceptional results through professionalism and unwavering dedication to quality and performance.
- Unity: Collaboration is key to success. That's why we foster a work environment where every employee can feel valued and heard.
- Impact: We're passionate about developing technologies that make a meaningful impact on industries, communities, and individuals worldwide.
We are seeking an exceptional
Director of AI Software to lead the full AI software stack that powers our next-generation accelerator platform. This executive-level role owns the strategy, architecture, and execution of all AI software components- including frameworks, compilers, runtime, kernels, developer tools, and end-to-end AI workloads.
You will lead multiple teams responsible for running state-of-the-art models (LLaMA, DeepSeek, diffusion, MoE and future generations of AI), building high-performance kernels, integrating industry frameworks, and delivering a developer-friendly stack to customers, which showcases and materializes the full potential of our hardware.
Requirements:
- Bachelor’s or Master’s degree and/or equivalent experience in computer science or a related field.
- 5+ years in AI/ML systems, GPU/accelerator software, or ML frameworks.
- 10+ years managing multiple teams or leading large AI software efforts.
- Deep expertise in PyTorch internals, Triton, IREE, XLA, ONNX, or similar AI compiler stacks.
- Strong background in kernels (GEMM, attention), performance engineering, and accelerator bring-up.
- Proven ability to deliver production-grade AI software stacks on new hardware.
- Knowledge of dataflow architectures, or heterogeneous compute is an advantage.
- Experience with customer deployments, SDK delivery, and developer ecosystems.
- Track record of founding or scaling high-performance engineering organizations.
Responsibilities:
- Own the vision, roadmap, and execution of the full AI software stack (frameworks, compilers, runtime, kernels, tools).
- Lead teams responsible for running and optimizing AI workloads
- Deliver first-class support for PyTorch, Triton, IREE, CUDA-compatible flows, and emerging AI frameworks.
- Oversee development of high-performance kernels (GEMM, attention, memory ops) and compiler optimizations.
- Drive HW/SW co-design with silicon, architecture, and runtime teams to maximize performance and efficiency.
- Ensure end-to-end product delivery, including SDKs, toolchains, developer experience, and documentation.
- Own benchmarking, performance targets, and competitive analysis
- Build, mentor, and scale a multi-disciplinary AI software organization.
- Support bring-up, validation, and tuning of new hardware accelerator generations (pre-silicon and post-silicon).