WEKA is redefining what's possible in data infrastructure — delivering the world's fastest AI-ready storage platform, purpose-built for the most demanding workloads at scale. Our software-defined architecture powers leading enterprises, research institutions, and cloud providers worldwide.
The Role
We are looking for a seasoned engineering leader to take ownership of WEKA's deployment and operational excellence functions. As Director of Deployment Engineering, you will lead three high-impact teams — Cloud & Operator Deployment, Supportability, and Observability — responsible for making WEKA's platform effortless to deploy, operate, and monitor at scale.
This is a highly cross-functional role sitting at the intersection of engineering, product, and customer success. You will drive the strategy and execution that shapes how WEKA lands in customer and cloud environments globally.
What You'll Do
Leadership & Team Building
- Lead, mentor, and grow three teams of engineers across cloud/operator deployment, supportability, and observability disciplines.
- Foster a culture of ownership, technical excellence, and customer empathy.
- Define team structure, hiring plans, and career development frameworks.
Cloud & Operator Deployment
- Own the strategy and roadmap for deploying WEKA in cloud environments (AWS, GCP, Azure) and via Kubernetes operators.
- Drive automation and repeatability in deployment pipelines to reduce time-to-value for customers.
- Collaborate closely with product and architecture teams to ensure deployability is a first-class design consideration.
Supportability
- Build and evolve tooling and processes that make WEKA systems easier to diagnose, debug, and support — both internally and by customer teams.
- Partner with the Customer Success and Support organizations to close the feedback loop between field issues and engineering improvements.
- Define supportability standards and integrate them into the development lifecycle.
Observability
- Drive the vision for how WEKA systems expose metrics, logs, and traces — enabling customers and internal teams to understand system health at a glance.
- Oversee the development of dashboards, alerting frameworks, and monitoring integrations (e.g., Prometheus, Grafana, OpenTelemetry).
- Ensure observability capabilities meet enterprise and cloud-native customer expectations.
Execution & Delivery
- Own quarterly and annual roadmap planning for your teams, balancing new capabilities with technical debt and operational resilience.
- Establish clear OKRs and KPIs — including deployment success rates, MTTD/MTTR, and observability coverage — and drive accountability against them.
- Identify and remove blockers; drive decisions at pace without sacrificing quality.
What You Bring
- 10+ years of software engineering experience, with 4+ years in engineering management leading multiple teams.
- Deep hands-on background in one or more of: distributed systems, cloud infrastructure, Kubernetes/operators, or storage platforms.
- Proven track record of building and scaling deployment, DevOps, SRE, or platform engineering functions.
- Experience owning observability strategy — metrics, logging, tracing — in a complex, distributed product.
- Strong customer orientation: comfortable in pre-sales, escalation, and executive customer conversations.
- Excellent communication skills — able to translate complex technical concepts for both engineering teams and business stakeholders.
- Experience working in a fast-paced, high-growth B2B software environment.
Nice to Have
- Background in storage, HPC, or AI/ML infrastructure.
- Experience with cloud-native ecosystems (EKS, GKE, AKS, OpenShift).
- Familiarity with enterprise support processes and supportability tooling.