Senior Software Engineer - from Developer to SRE



Software Engineering
Prague, Czechia
Posted on Tuesday, June 18, 2024

Are you a software developer looking to switch gears and explore the exciting world of Site Reliability Engineering (SRE)? Join our team and blend the power of code with the magic of infrastructure.

The Site Reliability Engineer is a pivotal role in our SaaS strategy. You will work closely with our engineering team to ensure unrivaled observability, availability, and performance of Tricentis SaaS Products.

Site Reliability Engineers (SREs) are responsible for keeping all user-facing services and other production systems running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply engineering principles, operational discipline, and mature automation to our operating environments and our codebase.

Your Impact as an SRE:

  • Design, build, and maintain the product cloud infrastructure that enables seamless scaling to support hundreds of thousands of concurrent users.
  • Build monitoring that alerts on symptoms rather than on outages.
  • Run our infrastructure with Terraform, GitHub Actions, and Kubernetes.
  • Improve operational processes (such as deployments and upgrades) to make them as boring as possible.
  • Collaborate with product engineers daily influence product architecture designs and help scale our products.
  • Be part of an on-call (PagerDuty) rotation to respond swiftly to incidents affecting availability, offering support to product engineers during customer incidents.

As a valuable member of our SRE team, you'll have the opportunity to:

  • Coding infrastructure automation with Terraform and/or Pulumi, and GitHub Actions.
  • Improving our monitoring or building new metrics.
  • Helping to deploy and fix new versions of our products.
  • Develop a relationship with products define their SLAs, share data on those SLAs and improve their reliability.
  • Be the first responder during emergencies and on-call duties, promptly addressing symptoms and conducting root cause analysis to implement corrective actions and prevent recurring issues.

Our Tech Stack:

Terraform, Pulumi, GitHub Actions, Kubernetes, DataDog, Prometheus, Grafana, AWS, AZURE

About You:

  • Experience in programming in Java, .NET, PHP, Python, or another OOP language
  • Working knowledge of SaaS architecture concepts and designs.
  • Proven professional experience building and debugging software in large-scale, high-volume environments.
  • Experience with Docker and Kubernetes in production use cases
  • Understanding of infrastructure & application deployment models (Kubernetes, Docker, AWS/GCP/Azure).
  • Experience with observability tools, including metrics (DataDog is a plus), structured logging, and distributed tracing.
  • A passion for learning new languages & technologies.
  • If you're ready to make a lasting impact as a Site Reliability Engineer and be at the forefront of revolutionizing Tricentis SaaS Products, don't miss this.

Our Culture

We don't just preach our values; we embody them in everything we do. We are committed to creating an environment that empowers, supports, and includes individuals, where trust, transparency, creativity, curiosity, and continuous improvement thrive daily.

Tricentis Core Values:

Knowing what we need to achieve and how to achieve it is important. Tricentis core values define our ways of working and the behaviours we model that creates an enjoyable and successful Tricentis life.

  • Demonstrate Self-Awareness: Own your strengths and limitations.
  • Finish What We Start: Do what we say we are going to do.
  • Move Fast: Create momentum and efficiency.
  • Run Towards Change: Challenge the status quo.
  • Serve Our Customers & Communities: Create a positive experience with each interaction.
  • Solve Problems Together: We win or lose as one team.
  • Think Big & Believe: Set extraordinary goals and believe you can achieve them.

You can look forward to:

  • Full-time permanent job
  • Informal and open start-up culture
  • A team of passionate professionals who are experts in their fields
  • Hybrid work environment (no core hours)
  • 25 days of paid time off
  • 4 days of paid Volunteering Leave per year to get involved in your local community or in a cause that matters to you
  • Meal allowance
  • Numerous events for employees (GIG - Tricentis annual Hackathon, Bug Festival, Magic the Gathering Afternoon, Diversity & Inclusion, Global Learning days, Training, Team Events, Company Events, …)
  • Employee Engagement/Wellbeing – recognition of excellence across all functions and levels; regular opportunities to celebrate and socialize (parties, sports events, BBQs) and much more.

About Tricentis:

Tricentis is a software company officially founded in 2007, with a primary focus on software quality assurance. Whether exploratory or automated, functional or performance, API or UI, as well as mainframes or custom applications or packaged applications, or cloud-native applications - our comprehensive suite of specialized Continuous Testing tools makes DevOps real by giving our clients the confidence to release on demand.

Tricentis has more than 1500 employees working in across over 20 global offices in US, EMEA, APAC serving over 2100 customers, and currently expanding our R&D centers to two new locations in the Czech Republic – Prague and Brno - Hybrid office environment.

Tricentis is proud to be an equal opportunity workplace. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran.