What you'll do
About the role
We are looking for a Lead AI Engineer to design and build the generative intelligence core of a new product.
This is not a research-focused role. We’re looking for a hands-on builder who can turn LLM capabilities into reliable, scalable, and cost-effective product features. You will be responsible for defining the architecture, selecting the right models, and ensuring high-quality AI outputs.
You will work closely with the Technical Leader to integrate AI capabilities into the broader system, while also guiding best practices in AI engineering across the team.
Responsibilities
Design and implement RAG (Retrieval-Augmented Generation) architectures.
Define and select the appropriate LLM stack (proprietary and/or open-source).
Build and orchestrate LLM workflows, chains, and agents using frameworks like LangChain, LlamaIndex, or similar.
Optimize prompts using advanced prompt engineering techniques to improve output quality and reduce hallucinations.
Design and manage vector databases and embedding strategies.
Develop robust APIs and backend services to integrate AI capabilities into the platform.
Ensure AI outputs are grounded, safe, and performant.
Implement evaluation and observability frameworks to measure accuracy, latency, and cost.
Collaborate with engineering teams to integrate AI services into production systems.
Mentor team members and promote AI engineering best practices.
Requirements
Must have
8+ years of experience in Software Engineering.
2+ years of hands-on experience building and deploying GenAI-powered applications.
Deep expertise in LLM orchestration frameworks (LangChain, LlamaIndex, Haystack, or similar).
Proven experience implementing RAG architectures (chunking, embeddings, vector databases such as Pinecone or pgvector).
Strong experience in advanced prompt engineering (few-shot, Chain-of-Thought, optimization techniques).
Deep understanding of model selection and trade-offs (OpenAI, Anthropic, Gemini vs open-source like Llama, Mistral).
Expert-level Python skills, including async programming and performance optimization.
Experience with evaluation and observability tools (RAGAS, TruLens, LangSmith, or similar).
Experience designing APIs and backend services (FastAPI, Flask) for AI-driven applications.
Advanced English (C1), with the ability to communicate complex AI concepts to non-technical stakeholders.
Nice to have
Experience with fine-tuning techniques (LoRA, QLoRA, PEFT).
Experience in LLMOps and model deployment (BentoML, Modal, AWS SageMaker).
Knowledge of AI security risks (prompt injection, data leakage) and mitigation strategies.
Experience with multi-modal AI (vision, audio, or hybrid models).
Strong product mindset, especially around AI-driven user experiences and workflows.
Benefits
People First culture.
Free access to streaming platforms.
Free access to Spotify premium.
GYM discount.
Legal and accountant advise.
Travel discount.
E-Learning discount.
About the company
We are a young and fast-growing recruiting company with five years of experience working across Latin America and the United States. We partner closely with teams and founders to help them build strong, high-impact teams through recruitment, outsourcing, and team-building services.
Our culture is built on effective communication, trust, and transparency. We believe great work happens when people feel heard, supported, and empowered to grow. Today, our team is made up of more than 50 professionals working across different projects throughout the region, collaborating remotely and learning from each other every day.