Site Reliability Engineer

Software · Coimbra, Coimbra
Department Software
Employment Type Full-Time
Minimum Experience Experienced

Porto, Coimbra, Lisbon, PT


Working with an Internet of Things company for the Industrial Gases, Rail, Energy, Oil & Gas and Water/wastewater markets, the Site Reliability Engineer will be our champion for availability, scalability, latency, and efficiency. You will be part of a team that will build the next generation of cloud-native high-scalable software micro-services for digital twins, analytics, web and mobile apps. We offer a relaxed working environment with no dress code. Be part of a small product development team, where everybody as a voice and your opinion is heard. We offer health insurance and fruit for healthy snacks. We provide high end portable workstation and 27" 1440p extra monitor as working tools.

1. Main Duties. The successful candidate will be:

  • Building a world-class IOT platform
  • Developing a micro-services architecture, ensuring security, high-availability and scalability
  • Ensure that cloud operations can be executed with no customer downtime
  • Collaborate with the product teams to design and develop systems that are resilient and highly performant at scale
  • Monitor infrastructure, measuring availability and system health
  • Perform blameless root cause analyses on outages an ensure action items are done
  • Collaborate with customer support in recovering form outages
  • Troubleshoot complex incidents in highly distributed systems
  • Shorten time to detecting by improving the accuracy of alarms
  • Be a key stakeholder in the design of services so that they are resilient from day 0


2. Core Skills & Experience

  • Minimum +2 years on a SRE role or similar
  • Experience in designing resilient and fault-tolerant systems
  • Experience with at least on one programming language (Java, C#, Python, etc.)
  • Experience in debugging complex, distributed systems 
  • Love for automation
  • Fluency in English, written and spoken (mandatory)
  • Computer Science or other related engineering degree

3.Desirable Skills/Experience

  • Familiar with IoT and Cloud projects
  • Experience working on a high growth startup like environment 
  • Experience with Azure services is a plus
  • Experience with Docker and Kubernetes is a plus
  • Experience with automation and IaC is plus (Terraform, chef, etc)
  • Understanding of monitoring tools such as Prometheus, ELK, Grafana
  • Experience in troubleshooting and debugging
  • Experienced with public clouds such as AWS and Azure
  • Understanding of data stream platforms, messages brokers and queues (eg. Kaffka, RabbitMq, Azure Service Bus, Azure Events Hub)
  • Experience of working with Agile methodologies (eg. Scrum).

Thank You

Your application was submitted successfully.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

  • Location
    Coimbra, Coimbra
  • Department
  • Employment Type
  • Minimum Experience