
About this role
Full Time Senior Tech Lead, Software Engineer - AI Agent Memory Infrastructure in AI at ByteDance in San Jose, California, United States. Apply directly through the link below.
At a glance
- Work mode
- Office
- Employment
- Full Time
- Location
- San Jose, California, United States
- Salary
- 245k - 588k USD
- Experience
- Senior
Core stack
- Infrastructure
- Optimization
- Performance
- Innovation
- Design
- LLMs
- AI
Quick answers
What is the salary range?
The salary range is 245k - 588k USD annually.
What skills are required?
Infrastructure, Optimization, Performance, Innovation, Design, LLMs, AI.
ByteDance is hiring for this role. Visit career page
San Jose, United States
About the Team
Join ByteDance’s AI Agent Memory Infrastructure team, where we build the core memory systems that power next-generation intelligent agents. Our focus is on creating a unified platform for long-term, conversational, and task-oriented memory, enabling more personalized and context-aware AI experiences.
We design and operate large-scale, low-latency, and highly reliable memory infrastructure, covering the full lifecycle from storage and retrieval to updating and optimization. Working at the intersection of LLMs, data systems, and context engineering, we tackle challenges in memory representation, retrieval, and multimodal fusion.
Partnering closely with model and product teams, we turn advanced research into scalable production systems that support a wide range of AI-driven applications.
Responsibilities
- Design, build, and evolve the next-generation memory infrastructure for AI agents, developing a unified platform that supports long-term memory, conversational memory, and task-oriented memory.
- Architect and optimize memory system pipelines for large-scale, low-latency, and high-availability environments, including data ingestion, storage, indexing, retrieval, updating, compression, and forgetting mechanisms to support real-time inference and personalized interactions.
- Explore key challenges at the intersection of large language models, context engineering, and data management, including memory representation, retrieval and ranking, conflict resolution, summarization and fusion, and memory lifecycle management.
- Design unified memory models and processing workflows for multimodal data (text, image, audio, behavioral signals), enhancing agents’ long-term consistency, personalization, and task completion in complex scenarios.
- Collaborate closely with model, application, and platform teams to productionize memory capabilities, and continuously optimize system performance across quality, latency, cost, reliability, and safety.
- Stay up-to-date with cutting-edge advancements and contribute to the long-term technical roadmap of AI agent memory systems, driving innovation and capability evolution.
The base salary range for this position in the selected city is $244800 - $588000 annually.
Join ByteDance’s AI Agent Memory Infrastructure team, where we build the core memory systems that power next-generation intelligent agents. Our focus is on creating a unified platform for long-term, conversational, and task-oriented memory, enabling more personalized and context-aware AI experiences.
We design and operate large-scale, low-latency, and highly reliable memory infrastructure, covering the full lifecycle from storage and retrieval to updating and optimization. Working at the intersection of LLMs, data systems, and context engineering, we tackle challenges in memory representation, retrieval, and multimodal fusion.
Partnering closely with model and product teams, we turn advanced research into scalable production systems that support a wide range of AI-driven applications.
Responsibilities
- Design, build, and evolve the next-generation memory infrastructure for AI agents, developing a unified platform that supports long-term memory, conversational memory, and task-oriented memory.
- Architect and optimize memory system pipelines for large-scale, low-latency, and high-availability environments, including data ingestion, storage, indexing, retrieval, updating, compression, and forgetting mechanisms to support real-time inference and personalized interactions.
- Explore key challenges at the intersection of large language models, context engineering, and data management, including memory representation, retrieval and ranking, conflict resolution, summarization and fusion, and memory lifecycle management.
- Design unified memory models and processing workflows for multimodal data (text, image, audio, behavioral signals), enhancing agents’ long-term consistency, personalization, and task completion in complex scenarios.
- Collaborate closely with model, application, and platform teams to productionize memory capabilities, and continuously optimize system performance across quality, latency, cost, reliability, and safety.
- Stay up-to-date with cutting-edge advancements and contribute to the long-term technical roadmap of AI agent memory systems, driving innovation and capability evolution.
The base salary range for this position in the selected city is $244800 - $588000 annually.
Job details
Workplace
Office
Location
San Jose, California, United States
Job type
Full Time
Experience
Senior
Salary
245k - 588k USD
per year
Company
Website
Visit siteJobr Assistant extension
Get the extension →