DevJobs

AI LLM Architect

Overview
Skills
  • Python Python
  • GCP GCP
  • Cloud infrastructure
  • Embedding models
  • LLM APIs
  • RAG frameworks
Fetcherr experts in deep learning, e-commerce, and digitization, Fetcherr disrupts traditional systems with its cutting-edge AI technology. At its core is the Large Market Model (LMM), an adaptable AI engine that forecasts demand and market trends with precision, empowering real-time decision-making. Specializing initially in the airline industry, Fetcherr aims to revolutionize industries with dynamic AI-driven solutions.

We are seeking an experienced LLM (AI) Architect to lead the design and implementation of a production-grade, LLM-powered question-answering and graph plotting system that allows users to interact with complex internal data using natural language.

You will not train foundation models—but instead orchestrate LLM-powered architectures (e.g. using OpenAI, Claude, Gemini, etc.), focused on retrieval-augmented generation (RAG), prompt engineering, and context-aware querying across structured and unstructured internal data sources.

Responsibilities

LLM-Powered System Design

  • Design and build systems that let users query internal data using natural language, simulating an AI analyst.
  • Create robust pipelines that use LLMs + internal structured/unstructured data to provide accurate and explainable responses.
  • Architect and optimize RAG systems

Prompt Engineering & Tooling

  • Develop advanced prompt strategies for dynamic querying, chaining, and task delegation.
  • Implement fallback strategies, guardrails, and context control for reliability and consistency.
  • Tune prompts and system behavior to balance accuracy, latency, and cost.

Infrastructure & Deployment

  • Work with data engineers and MLOps to deploy and scale LLM-based services in production.
  • Integrate vector databases, embedding pipelines, and caching layers to optimize performance.
  • Ensure systems are monitored, observable, and cost-aware.

Collaboration & Productization

  • Partner with product managers and analysts to define use cases and measure business impact.
  • Translate user needs and business logic into scalable LLM-powered applications.
  • Educate internal teams on the capabilities and limitations of LLMs in the company context.

Requirements:

You'll be a great fit if you have…

  • 5+ years of experience in machine learning, AI engineering, or backend systems.
  • 2+ years working specifically with LLM architectures or generative AI applications.
  • Hands-on experience with:
  • RAG frameworks (LangChain, LlamaIndex, etc.)
  • Embedding models and pipelines
  • LLM APIs (OpenAI, Claude, Gemini, etc.)
  • Strong Python skills and familiarity with cloud infrastructure (GCP preferred).
  • Proven track record building reliable, production-grade AI systems.
  • Fluent in English (spoken and written) for documentation and cross-team collaboration.

Mindset & Approach

  • Deeply product-oriented with a strong user empathy.
  • Balances experimentation with engineering discipline.
  • Collaborative, hands-on, and outcome-driven.

Nice to Have:

  • Experience in analytics, BI, or data exploration interfaces.
  • Familiarity with semantic search, question decomposition, and tool-augmented LLMs.
  • Background in pricing, forecasting, or airline data domains.
Fetcherr