Senior Site Reliability Engineer

Senior Site Reliability Engineer
Company:

Accelbyte


Details of the offer

At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality games. The company was founded in 2016 by industry veterans who have engineered online systems for some of the largest game and distribution platforms in the world including Fortnite, Epic Store, Xbox Live, PlayStation Network, and EA Origin. We are backed by top investors including Softbank, Sony Interactive Entertainment, Galaxy Interactive, NetEase, and Krafton. Our latest Series B funding has firmly solidified our place as a top player in the gaming industry. AccelByte's talent has decades of experience building and shipping some of the largest game and distribution platforms in the world.We believe that the best companies empower employees to make decisions, obsess about the best user experience, and are not afraid to make and learn from their mistakes. Our culture is based on humility, openness to feedback, drive, and collaboration, which we feel results in the best performing teams. As a company that values diversity, inclusion, and employee growth, our employees have opportunities to work with and learn from teams all over the world. We offer competitive salaries, a full range of health benefits, social activities, career growth opportunities, and an amazing team. Come join us!**Position Summary**As a Senior Site Reliability Engineer (Observability & Cost), you design, implement, and maintain infrastructure and operational systems that accomplish a given goal. You discover requirements and guide other engineers collaborating in an area and do exemplary work on complicated problems.**Essential Functions/Responsibilities**The Senior Site Reliability Engineer (Observability & Cost) is accountable for the following functions and responsibilities:- Review, provide feedback, and mentor coworkers on changes to maintain reliability.- Design, implement, and enhance observability strategies and tools to monitor the performance, availability, and reliability of our distributed systems.- Contributing in automating solutions to optimize tasks, improve efficiency, and reduce manual effort.- Collaborate with development teams to implement and promote best practices in observability, including logging, tracing, and metrics collection.- Conduct performance analysis and capacity planning to optimize system performance and resource utilization.- Identify and address bottlenecks, inefficiencies, and potential failure points in the system.- Establish and maintain cost control measures, monitoring resource utilization and identifying opportunities for optimization.- Collaborate with finance and operations teams to develop and enforce cost management policies and guidelines- Collaborate with development teams to implement cost-aware design patterns and practices.- The ability to train and mentor less experienced engineers and set the direction for other engineers.- Model standards for engineering excellence- Discover requirements by working with PMs and stakeholders- Perform other duties as assigned**Qualifications/Experience Required**- Minimum of 5+ years of professional experience as an SRE or similar role, with a focus on observability and cost control in a distributed system.- Solid understanding of cost optimization techniques in cloud environments.- Extensive experience in analyzing cloud resource usage patterns and identifying opportunities for cost optimization.- Experienced in designing and implementing effective tagging strategies for cloud resources to ensure accurate cost attribution and allocation- Familiarity with cloud cost modeling and forecasting techniques to provide accurate cost projections and budgeting.- Eagerness to learn new languages, technologies, and containerization principles (e.g., Docker, Kubernetes).- Practical knowledge of networking, storage, and container technologies.- Robust knowledge and experience in cloud computing (preferred AWS/GCP).- Proven experience with automation, CI/CD, and GitOps tools.- Familiarity with infrastructure-as-code tools (e.g., Terraform, Ansible) for provisioning and configuration management.- Experience with distributed tracing systems (e.g., Jaeger, Zipkin), log aggregation tools (e.g., Prom Tail, Loki), monitoring tools (e.g., Prometheus, Fluentbit, Grafana), alerting tools (e.g. PagerDuty, OpsGenie)- Software development and scripting experience with Bash, Python, and/or Golang.- Proficiency in written and verbal English language for remote work.- Flexibility to adjust work routines/schedules to meet company and customer needs.- Previous professional infrastructure or operational experience preferred.- Experience at a AAA game studio or software product company preferred.- Experience working with cloud platforms or web products preferred.- Experience in a multinational technology startup is a big plus.


Source: Whatjobs_Ppc

Job Function:

Requirements

Senior Site Reliability Engineer
Company:

Accelbyte


Qa Engineer (Yogyakarta Based)

What you will do Currently, we are looking for someone who can fill the position of QA Test Engineer (Yogyakarta Based) . In general, you need to work togeth...


From Cakap - Special Region of Yogyakarta

Published a month ago

Youtube Specialist

Job Description He or she will be responsible for implementing content, managing and monitoring company's Social Media in order to increase brand awareness, ...


From Foreximf - Special Region of Yogyakarta

Published a month ago

Lifestyles Supervisor - Leading Industry Pay

We are on the lookout for a confident Lifestyles Supervisor to join our talented team at PULLMAN in Bandung. Growing your career as a Full Time Lifestyles Su...


From Pullman - Special Region of Yogyakarta

Published a month ago

Senior Backend Software Engineer (Java)

At AccelByte, our mission is to empower game creators by providing them with the backend platform and tools required to make scalable, reliable AAA-quality g...


From Accelbyte - Special Region of Yogyakarta

Published a month ago

Built at: 2024-05-19T21:37:40.462Z