Site Reliability Engineer - Video Platform - USDS (LA)
- Los Angeles, CA
- Permanent
- Full-time
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
Join us.Team Intro
TikTok video system is a world-leading video platform that provides multimedia storage, delivery, transcoding services. As part of the USDS, the Video Platform team is responsible for building the next generation video processing platform which provides excellent experiences for billions of users around the world.
The USDS Video Platform team is seeking an experienced Site Reliability Engineer to help us continue improving TikTok's video system. If you are passionate about ensuring software reliability, love problem-solving, and are prepared for exciting challenges, we would like you on our team.
In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager/department. We regularly review our hybrid work model, and the specific requirements may change at any time.
Responsibilities
- Responsible for overall reliability of TikTok's video system, including video publishing and distribution.
- Perform lifecycle management of production systems including change management, service deployment, operations and emergency response.
- Monitor the system and respond to incidents to maintain system service level agreement (SLA), review and follow up all production incidents.
- Perform capacity management of compute, storage and network bandwidth resources to ensure system stability and save infrastructure costs.
- Provide strong support during big events to ensure the system is capable of consuming a large volume of Internet traffic.
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global infrastructure.Qualifications:Minimum Qualifications
- Bachelor's degree in Computer Science or a related technical background involving software/system engineering, or equivalent working experience.
- 2+ years of SRE or DevOps experience in large scale online services
- Programming experience with at least one of the following languages: C, C++, Java, Python, C# or Go.
Preferred Qualifications
- Extensive knowledge of networking, operation system, database system and container technology.
- Good understanding of every aspect of microservice architecture, and hands on experience in troubleshooting in large scale distributed systems.
- Hands on experience in common opensource systems such as Linux, MySQL, MongoDB, Redis and ELK.
- Experience in building solutions with AWS, Google, Azures and other cloud services is a plus.
- Passionate, self-motivated and good teamwork skills.D&I Statement
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.Accommodation Statement
TikTok is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at https://shorturl.at/ktJP6Data Security Statement
This role requires the ability to work with and support systems designed to protect sensitive data and information. As such, this role will be subject to strict national security-related screening.