company logo

Site Reliability Engineer, Lark - USDS

TikTok.com

Office

San Jose, California, United States

Full Time

Team Intro:
Lark is the next-generation collaborative tool that boosts organizations' efficiency, creativity, and engagement by integrating Messenger, Docs, Calendar, Video calls, Emails, and more into one easy-to-use app.

The USDS Lark team is seeking an experienced Site Reliability Engineer to help us continue improving the Lark system. If you are passionate about ensuring software reliability, love problem-solving, and are prepared for exciting challenges, we would like you on our team.

Responsibilities:
- Responsible for overall reliability of Lark product
- Perform lifecycle management of production systems including change management, service deployment, operations and emergency response.
- Monitor the system and respond to incidents to maintain system service level agreement (SLA), review and follow up all production incidents.
- Perform capacity management of compute, storage and network bandwidth resources to ensure system stability and save infrastructure costs.
- Provide strong support during big events to ensure the system is capable of consuming a large volume of Internet traffic.
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure.

Site Reliability Engineer, Lark - USDS

Office

San Jose, California, United States

Full Time

January 16, 2026