Description Fetcherr experts in deep learning, e-commerce, and digitization, Fetcherr disrupts traditional systems with its cutting-edge AI technology. At its core is the Large Market Model (LMM), an adaptable AI engine that forecasts demand and market trends with precision, empowering real-time decision-making. Specializing initially in the airline industry, Fetcherr aims to revolutionize industries with dynamic AI-driven solutions.We are seeking an experienced LLM (AI) Architect to lead the design and implementation of a production-grade, LLM-powered question-answering and graph plotting system that allows users to interact with complex internal data using natural language.You will not train foundation models—but instead orchestrate LLM-powered architectures (e.g. using OpenAI, Claude, Gemini, etc.), focused on retrieval-augmented generation (RAG), prompt engineering, and context-aware querying across structured and unstructured internal data sources.ResponsibilitiesLLM-Powered System Design<ul><li>Design and build systems that let users query internal data using natural language, simulating an AI analyst.</li><li>Create robust pipelines that use LLMs + internal structured/unstructured data to provide accurate and explainable responses.</li><li>Architect and optimize RAG systems </li></ul>Prompt Engineering & Tooling<ul><li>Develop advanced prompt strategies for dynamic querying, chaining, and task delegation.</li><li>Implement fallback strategies, guardrails, and context control for reliability and consistency.</li><li>Tune prompts and system behavior to balance accuracy, latency, and cost.</li></ul>Infrastructure & Deployment<ul><li>Work with data engineers and MLOps to deploy and scale LLM-based services in production.</li><li>Integrate vector databases, embedding pipelines, and caching layers to optimize performance.</li><li>Ensure systems are monitored, observable, and cost-aware.</li></ul>Collaboration & Productization<ul><li>Partner with product managers and analysts to define use cases and measure business impact.</li><li>Translate user needs and business logic into scalable LLM-powered applications.</li><li>Educate internal teams on the capabilities and limitations of LLMs in the company context.</li></ul> Requirements You'll be a great fit if you have…<ul><li>5+ years of experience in machine learning, AI engineering, or backend systems.</li><li>2+ years working specifically with LLM architectures or generative AI applications.</li><li>Hands-on experience with:</li><li>RAG frameworks (LangChain, LlamaIndex, etc.)</li><li>Embedding models and pipelines</li><li>LLM APIs (OpenAI, Claude, Gemini, etc.)</li><li>Strong Python skills and familiarity with cloud infrastructure (GCP preferred).</li><li>Proven track record building reliable, production-grade AI systems.</li><li>Fluent in English (spoken and written) for documentation and cross-team collaboration.</li></ul>Mindset & Approach<ul><li>Deeply product-oriented with a strong user empathy.</li><li>Balances experimentation with engineering discipline.</li><li>Collaborative, hands-on, and outcome-driven.</li></ul>Nice to Have:<ul><li>Experience in analytics, BI, or data exploration interfaces.</li><li>Familiarity with semantic search, question decomposition, and tool-augmented LLMs.</li><li>Background in pricing, forecasting, or airline data domains.</li></ul>

AI LLM Architect

Similar Jobs

Recent Jobs

You May Also Like