Site Reliability Engineer, Cloud Operations
OverIT - Field Service Management
📍 Italy, IT0🕐 5 giorni fa
Candidati ora →
Crea un account gratis in 30 secondi: ottieni anche il match score AI con il tuo CV.
Descrizione
Job Description
OverIT at a glance
OverIT is a global Software-as-a-Service (SaaS) company with a strong presence in North America and Europe.
We empower organizations in the power, utility, telco, and transportation industries to manage their mission-critical infrastructures efficiently and safely through cutting-edge Field Service Management software solutions.
At OverIT, we leverage advanced technologies like ML (Machine Learning), AR (Augmented Reality), IoT (Internet of Things), and GIS (Geographic Information System) to help ensure the infrastructures essential to our daily lives are always on.
If you want to be part of a top technology brand, join us!
What you’ll do
Act as a Senior Cloud Reliability Engineer designing and operating the reliability, scalability, security and observability of the OverIT cloud-native SaaS platform running on AWS.
Drive observability initiatives across the company, managing and evolving monitoring and alerting platforms with a strong focus on Dynatrace, dashboards, anomaly detection, distributed tracing, and operational visibility.
Provide advanced troubleshooting and systemic improvements across compute, networking, storage, Kubernetes workloads, databases, IAM, and managed cloud services.
Perform root cause analysis (RCA) on incidents and actively contribute to blameless post-mortems.
Apply SRE principles such as Error Budgets, Service Level Objectives (SLOs), Service Level Indicators (SLIs), and operational excellence practices to balance platform stability and delivery velocity.
Identify and eliminate operational toil through automation, tooling, scripting, Infrastructure as Code, and process optimization initiatives.
Design, improve, and maintain backup and disaster recovery procedures, ensuring compliance with SLA/SLO targets.
What you’ll need
4–6 years of experience in Cloud Engineering, Platform Engineering, Site Reliability Engineering, or Cloud Architecture roles.
Strong hands-on experience with AWS environments including EKS, EC2, RDS, S3, IAM, VPC, Route53, CloudWatch, networking, and Load Balancing services, as well as Infrastructure as Code (IaC) practices and tools, preferably Terraform.
Strong knowledge of Dynatrace and modern observability standards (including OpenTelemetry) and practices in distributed cloud environments.
Strong understanding of metrics, logs, traces, alerting strategies, anomaly detection, and SLO/SLI-driven monitoring approaches.
Strong focus, ownership, and attention to detail in backup and disaster recovery strategies, including backup validation, restore procedures, retention policies, and operational resilience practices.
Security-first mindset with practical experience applying AWS security best practices, including IAM roles and policies, KMS encryption, Secrets Manager, and least-privilege access principles.
Experience with scripting and automation, preferably using Python.
Strong problem-solving attitude, ownership mindset, and ability to operate effectively during production incidents.
What's nice to have
Knowledge of Kubernetes ecosystem components, including service meshes (e.g., Istio, Linkerd).
Experience with Kubernetes observability, capacity planning, and cost optimization platforms such as Kubecost.
Experience supporting enterprise-grade or mission-critical SaaS platforms.
Experience with CI/CD and DevOps tooling.
Cloud or Kubernetes certifications, such as AWS Certified Solutions Architect Associate or Professional, AWS
Certified DevOps Engineer Professional, AWS Certified SysOps Administrator, or Certified Kubernetes Administrator (CKA).
What we offer
OverIT is a unique transformation project in the SaaS space arena, full of ambition to scale and grow globally.
International culture and environment with the opportunity to partner with an outstanding group of people and professionals who joined the company to scale and succeed
A career-defining opportunity with full exposure to two leading private equity firms.
Location-flexible smart working across Italy
Competitive pay, meaningful growth opportunities, and a transparent approach to compensation. Salary ranges are defined for each role based on experience, competencies, responsibilities, and geographic location, with additional benefits and performance-based incentives depending on the position.
The expected annual gross salary range for this position is €40,000–€55,000.
At OverIT we value diversity and are committed to equal employment opportunities regardless of religion, age, disability, sexual orientation, gender perception or identity, ethnicity, or place of origin.
TalentyGo è un aggregatore di offerte da fonti pubbliche. Verifica sempre le informazioni direttamente con l'azienda. La candidatura avviene tramite il sito originale dell'azienda; TalentyGo non gestisce processi di selezione.