AI Agent Architect

Back to all jobs
  • Emergent
  • Bengaluru, KA
  • Full-Time
  • 4 days ago
Published
May 17, 2026
Location
Bengaluru, India
Category
Job Type

AI Agent Architect: our view in 3 lines...

  • The Role: Design and improve autonomous AI coding agents that plan, build, test, and ship production software by turning LLM and system capabilities into measurable improvements.
  • The Person: Own experimentation and shipping decisions to define and run experiments, build evaluation frameworks, measure agent quality at scale, and ship staged rollouts with rollback criteria.
  • Requirements: The role requires 5+ years of software engineering experience and comfort with metrics, experimentation, SQL, and/or Python and being up-to-date with emerging AI research.

Job Description

 

Emergent builds autonomous coding agents that replace traditional software development by generating, testing, and deploying production applications directly from plain-language intent. Our systems run in production at global scale and are used to build millions of real applications.

Since public launch, Emergent has reached $100M ARR in 8 months. 6M+ users across 190+ countries have built 6.5M+ applications on Emergent. We've raised $100M+ , backed by Khosla Ventures, SoftBank, Google, Lightspeed, Prosus, Together, and Y Combinator.

We're solving the hard part of AI-driven software creation: correctness, reliability, security, and scale in real production systems. The team is built by repeat founders, Olympiad medalists, IIT & IIM alumni, and leaders from Google, Amazon, and Dropbox.

We're hiring builders who want ownership, speed, and impact at global scale.

The Role: We're building AI coding agents that can plan, build, test, and ship real software. Your role is to turn LLM model and system capabilities into measurable, shippable improvements in agent performance. You will own experimentation and shipping decisions, what we try, how we measure better, what goes live, and what gets rolled back.

What You'll Do:

  • Develop deep intuition for LLM and agent behavior, identifying failure modes and regressions
  • Define and run high-leverage experiments to improve agent quality, reliability, and code outcomes
  • Ship improvements with clear metrics, evaluation gates, staged rollouts, and rollback criteria
  • Define evaluation frameworks and work with engineering teams to measure agent quality at scale
  • Drive initiatives around context engineering, memory systems, and frontier agent capabilities
  • Think like the agent, continuously making it smarter, more reliable, and more useful

Who You Are:

  • 5+ years of software engineering experience with end-to-end ownership
  • Strong technical depth paired with sharp product intuition
  • Comfortable working with metrics, experimentation, SQL, and/or Python
  • Able to thrive in ambiguity and move fast with rigor
  • Hands-on and up-to-date with emerging AI research and industry trends

Benefits and Perks:

  1. Daily Meals: Lunch and Dinner provided
  2. Family Insurance: 3 Lakhs worth of coverage for you and your family
  3. Unlimited Paid Time Off: Take the time you need to recharge and come back refreshed
  4. Flexible Working Hours: Work arrangements that fit your life and commitments

Let's build the future of software together.

Emergent enables anyone to build production software by describing what they want in natural language. Our coding agents handle the entire development stack—from frontend design, database setup, app deployment, and scaling—translating domain expertise directly into working applications without requiring any coding knowledge.

Founded by Mukund Jha, and Madhav Jha, we've grown to 700,000+ users and $10M ARR in just 2 months from launch, making us one of the fastest-growing AI companies in history.

Our customers range from entrepreneurs launching SaaS products, to development agencies delivering prototypes to win deals, to Fortune 500 companies automating enterprise workflows.

Key Skills
? Key Skills in dark blue have been inferred based on similar industry roles
LLM Evaluation A/B And Experimentation Design Model Evaluation Metrics Staged Rollouts And Rollback Procedures Context Engineering Memory Systems Python SQL LLM

Subscribe to Career Resources

Get the latest career advice, industry insights, and job opportunities delivered to your inbox.