We use cookies to enhance your browsing experience, serve personalised ads or content, and analyse our traffic.
By clicking "I accept", you consent to our use of cookies.

Cubbit

Site Reliability Engineer

Inserzione valida fino al 30/09/2026

Skill ed esperienza

Skill necessarie

  • Python
  • AWS
  • Kubernetes
  • Docker
  • Pulumi
  • Golang
  • Grafana
  • Prometheus

A quick overview

Work location:
full remote or hybrid (possibility to work in our offices in Bologna, Milano)
Experience:
Mid level 2-4 years of experience
Languages requested:
English full professional + italian nice to have
Compensation:
€35,000–€50,000 gross/year, depending on your experience and the impact you'll bring + Employee Stock Ownership Plan (ESOP)

We not only apply cutting-edge technology. We create it.

At Cubbit, we're building the next generation of cloud storage: globally distributed, geo-distributed by design, and independent from hyperscalers. Reliability is at the heart of everything we do.

We're looking for a Mid-Level Site Reliability Engineer to join our Tech Operations team and help keep our platform resilient, scalable, and always available. You'll work at the intersection of software and infrastructure, collaborating with engineering teams to automate operations, improve observability, and solve complex production challenges before our users even notice them.

If you enjoy building reliable systems, automating everything that shouldn't be done twice, and turning operational pain into engineering improvements, we'd love to meet you.

What you’ll do

  • Keep Cubbit's geo-distributed platform healthy, reliable, and performant across production environments.
  • Build automation that eliminates manual work and makes operations safer and faster.
  • Improve observability through meaningful metrics, dashboards, logging, and alerting.
  • Investigate production incidents and complex customer escalations, perform root cause analysis, and turn learnings into long-term reliability improvements.
  • Partner with Software Engineers to design systems that are reliable by default.
  • Continuously improve CI/CD pipelines and deployment workflows.
  • Participate in the on-call rotation while helping make it quieter every week.
  • Optimize performance, scalability, and operational efficiency across our infrastructure.
  • Contribute to operational best practices, documentation, and a healthy engineering culture.

What you'll need

  • 3+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, or similar roles.
  • Strong Linux administration skills and a solid understanding of production environments.
  • Hands-on experience with Kubernetes and containerized workloads.
  • Experience with Infrastructure as Code tools (Pulumi preferred).
  • Good understanding of networking fundamentals (DNS, TCP/IP, HTTP, TLS, load balancing).
  • Experience with monitoring and observability tools such as Prometheus, Grafana, Loki, ELK/OpenSearch, or similar.
  • Scripting skills in Bash, Python, or similar languages.
  • Familiarity with CI/CD pipelines and modern deployment practices.
  • Confidence troubleshooting production systems under pressure.
  • Professional proficiency in English.

Bonus points

You'll stand out if you have experience with:

  • Technical Support (L2/L3), Technical Operations, or customer-facing infrastructure support.
  • Managing complex production incidents and technical escalations.
  • Distributed systems, cloud infrastructure, or S3-compatible object storage.
  • GitOps practices and tools such as ArgoCD or Flux.
  • Experience with cloud platforms such as AWS, Azure, or GCP.
  • Security best practices, infrastructure hardening, or open-source contribution.

Profile and mindset

We're looking for someone who:

  • Thinks in systems, not just servers.
  • Automates first and hates doing the same thing twice.
  • Takes ownership and follows problems through to the root cause: not just the quick fix.
  • Thrives in collaborative environments where engineering and operations work as one team.
  • Enjoys working in a fast-paced scale-up environment, where priorities evolve, ownership is expected, and everyone contributes beyond their role.
  • Stays calm when production gets noisy and enjoys solving problems that matter.
  • Is curious, eager to learn, and continuously looking for ways to improve.
  • Balances pragmatism with engineering excellence: you know when "good enough" is actually the best solution.
  • Believes that reliability is a feature, not an afterthought.

Why you’ll love working with us

  • Join an ambitious and supportive team of builders and innovators
  • Remote-friendly environment with flexible schedules
  • Opportunity to contribute to one of Europe’s most advanced cloud technologies

‍Location

Full remote work is welcome in Cubbit, and you can freely use the lounge area in one of our co-working partner offices worldwide. But remember that our headquarters, in the centre of Bologna, Italy, is always available for you. Let's find the right workspace for you.

Cubbit


Site Reliability Engineer

  • Full time
  • Mid
  • Remoto

Salary EUR 35000 - 40000

Apply

FAQ

Easy-peasy: on the job posting page, click the "Apply" button, and we will connect you directly with the company.

First of all, thank you: we aim to create a service that closely aligns with the needs of developers like us, so feedback is crucial. Write to us at support [at] improove.tech, and we will consider it to improve the service.

Contact us at talent [at] improove.tech, and we will provide you with all the information to evaluate our service.