Launch your Reliability Engineering career!
Ever wondered how massive apps stay online 24/7? Site Reliability Engineering (SRE) is the “special ops” of the software world. We sit right between coding and operations, making sure our systems are fast, reliable, and smart.
Don’t worry if you haven’t managed a cloud environment before - that's what this co-op is for! You’ll work alongside our senior engineers to learn how to monitor and manage huge systems, handle real world incidents, and build AI-powered bots that make our team faster. If you love solving puzzles and want to learn how things work, we want to meet you.
What you’ll learn (and do!):
The “Vital Signs” of Code: You’ll help us manage and instrument observability solutions - tracking metrics, logs, and traces so we can see exactly how our apps are performing in the cloud.
- AI-Powered SRE: We use the latest AI tools like Claude, Cursor, and Copilot. You’ll help us build “agent skills” for internal SRE bots that help us troubleshoot complex issues in seconds.
- Collaborative Problem Solving: You’ll shadow our team during on-call shifts and learn how we diagnose production issues and build permanent fixes to prevent them from happening again.
- Testing at Scale: You’ll help us run load tests to see how many users our system can handle before it breaks (and then help us make it stronger!).
The Tech Stack:
- Cloud: Google Cloud Platform (GCP) & Kubernetes.
- Monitoring: Prometheus & Grafana (the industry standards).
- Languages: We mostly use Go + Terraform for configuration management
- AI Tools: Claude, Cursor, Codex, Copilot
Who you are:
- Naturally Curious: You’re the type of person who likes to take things apart to see how they work. You might have already poked around with cloud platforms like GCP or AWS in your own time or during a previous co-op.
- A “Lazy” Coder (in a good way!): If you have to do something twice, you’d rather write a script. You love the idea of automating manual tasks so you can focus on the fun stuff.
- Comfortable with Code: You have hands-on experience with languages like Go or Python. Whether it’s from school projects, a previous co-op, or a personal side-hustle, you know your way around a codebase.
- Excited about AI: You’re already using (or want to use) LLMs to help you write better code, faster. You’re eager to learn how to build AI-driven automations.
- A Great Communicator: You enjoy explaining what you’ve found - whether it’s a bug in code or a cool new automation - and working with a team to fix it.
- A Systems Thinker: You’re interested in the big picture of how data flows through a system, from the first line of code to the final metric on a Grafana dashboard.
Security-related Responsibilities:
- Compliance with Information Security Policies
- Compliance with League’s secure coding practice
- Responsibility and accountability for executing League's policies and procedures
- Notification of HR, Legal, Compliance & Security of any incidents, breaches or policy violations
As part of your application, please record a 1-minute video answering:
👉 “Why do you want to work at League?”
Upload your video to Google Drive or YouTube, and share the link in a clickable PDF with your application.
Please note: This is an in-office position at our Toronto HQ from Monday to Thursday and Fridays are work-from-home. We really value fostering in-person connection and collaboration which is especially advantageous as learning opportunity for a Co-op student.