Join the b2venture community at one of our portfolio companies. See all open positions below.

Interested in working directly for b2venture? Learn more.

Senior Site Reliability Engineer (m/f/d) Berlin, Munich, or Hamburg

1KOMMA5°

1KOMMA5°

Software Engineering
Remote
Posted on Jun 10, 2024

1KOMMA5°

We are looking for you as an addition to our tech-team in Berlin, Munich or Hamburg. 1KOMMA5° is building Germany's largest one-stop-shop for sale, installation and services related to solar, heat pumps, electricity and charging infrastructure. And they are all connected! Be a part of our mission!

Your mission

In tech we are building software systems in two main areas:

  • Heartbeat: Building a virtual power plant leveraging our energy manager “Heartbeat” and evolving the customer experience app around this
  • Operating System for installation companies: Digitisation of operations and processes from planning to installation and configuration

As a Senior Site Reliability Engineer (SRE) you will join our Platform Engineering Team, focusing on consulting with software delivery teams on SRE principles and tech platform solutions while enhancing the internal developer platform. You are placing a strong emphasis on helping teams meet and exceeding Availability and Performance Service Level Objectives (SLOs) for the systems mentioned above.

Key responsibilities include but are not limited to:

  • Implement and improve monitoring, alerting, and incident response systems and processes to ensure high reliability for our customers and meet defined SLOs
  • Design, build, and maintain resilient, scalable infrastructure utilizing SRE principles and best practices
  • Conduct post-incident reviews and contribute to continuous improvement efforts
  • Execute performance testing, analyze system bottlenecks, and formulate strategies for capacity planning to ensure our systems meet current and future demands effectively

Technologies we work with include:

  • Google Cloud Platform (CloudRun, CloudSQL, CloudMonitoring, etc.)
  • Terraform / Terramate
  • Datadog
  • GitHub Actions
  • Python / GoLang / TypeScript

Your profile

  • 5+ years of experience in a Platform Engineering (DevOps), Site Reliability, or comparable Cloud Engineering position
  • Strong understanding and practical application of Site Reliability Engineering (SRE) principles, methodologies, and best practices
  • Proficiency in programming/scripting languages such as Python, GoLang, or TypeScript
  • Prior experience in incident management, post-incident reviews, and implementing improvements to prevent future incidents
  • Ability to troubleshoot complex technical issues systematically and effectively
  • Good experience working with a public cloud provider, ideally Google Cloud Platform (GCP), and a solid understanding of its observability services
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks
  • Excellent communication skills to convey technical concepts and collaborate effectively with diverse teams
  • Very good knowledge of spoken and written english, german is a plus
  • Residency in Germany
  • Interest in climate tech industry

Bonus points for:

  • Prior experience with IoT applications
  • Having worked in a scale up environment at a company of similar size

We encourage candidates to apply even if they do not meet all the requirements, as we are seeking individuals at various seniority levels who can bring diverse perspectives and skills to our team. We value the opportunity to consider a wide range of experiences and qualifications in our selection process.

Benefits

  • The possibility to work remotely
  • Individualized opportunities for professional development
  • Being part of a global change movement
  • Working in a diverse team of motivated team players
  • Seeing the impact of your work, daily