

Monitoring & Observability Engineer
Job Description
There are NO limits to your career: come shape the future and be part of a truly unique global culture at OutSystems!
About This Role
An Monitoring & Observability Engineer is responsible for ensuring that complex systems are observable, resilient, and performant by designing, implementing, and maintaining monitoring, logging, and alerting solutions. This work enables teams to detect, diagnose, and resolve issues quickly, improving system reliability and availability.
At OutSystems, the Monitoring & Observability Engineers work closely with other Engineers, Product Managers, and Stakeholders to identify opportunities for improvement, implement best practices, and drive continuous optimization across our platforms.
What You Will Lead/Do or Key Responsibilities
As an Monitoring & Observability Engineer, these are your key responsibilities and duties:
- Observability Implementation: Develop and maintain telemetry systems, including logs, metrics, traces, and dashboards.
- Define and implement best practices for making systems and services measurable, and collaborate with stakeholders and teams to apply these practices
- Collaborate with engineering teams to implement modern instrumentation and telemetry signal collection for their services, ensuring meaningful insights are derived
- Create and maintain documentation related to the observability platform and SRE practices
- Work closely with development and operations teams to ensure optimal performance, availability, and security of services
- Contribute to our evolving "data-driven" and "cloud-first" culture through continuous learning
Monitoring & Observability Performance Indicators
The main KPIs that aid in understanding the impact and success of the Monitoring & Observability function at OutSystems are:
- Coverage & Adoption
- Telemetry Coverage (%) - Percentage of critical services instrumented with logs, metrics, and traces
- Log Completeness (%) – Percentage of logs successfully collected vs. expected logs.
- Tracing Coverage (%) – Percentage of distributed traces captured across service interactions.
- Metric Coverage (%) – Percentage of monitored services with key metrics (latency, errors, traffic, saturation).
- Dashboard & Alerting Adoption (%) – Percentage of teams actively using observability tools.
- Impact on Reliability
- Reduction in MTTR (%) – Improvement in Mean Time to Resolve due to observability insights.
- Root Cause Identified (%) – Percentage of incidents where observability tools helped determine a clear root cause.
- SLO Compliance (%) – Percentage of time services meet reliability objectives based on observability insights.
To illustrate the desired profile for a Monitoring & Observability Engineer. Nevertheless, the selection of candidates will always vary depending on specific knowledge of the field and prior experience.
Qualifications
- STEM degree (BSc, MSc, in Software Engineering/Computer Science or related fields);
- Strong experience in software development and/or operations;
- Proficiency in at least one high-level programming language (C++, Python, Java, C#, etc.).
- Strong troubleshooting and debugging skills.
- Fluency in English and excellent communication skills.
Soft Skills
- Communication - able to communicate effectively (in English) both orally and written showing empathy for the other person;
- Humbleness - accepts mistakes and acts accordingly, with a humble attitude, apologizing for them and mitigating them ASAP to avoid higher impact.
- Accountability - takes ownership of problems and makes sure to see them through. Even if he does not have all the necessary knowledge to move on alone, can involve the right people to reach closure.
- Negotiation Skills - has tough and politically complex conversations with colleagues and customers, defusing disagreements and leading towards a mutual agreement and understanding of all parties involved.
- Process Oriented - is organized and able to properly follow defined processes, whilst being able to properly challenge inefficient processes and suggest improvements.
- Problem-solving - Has a top-down approach to problems, breaking them into smaller pieces and solving them by starting with a wider scope and narrowing it down as the analysis progresses. Has critical thinking, so can analyze information objectively and make a reasoned judgment.
Technical Skills
- Strong experience with Observability/SRE tools, platforms, and standards, including but not limited to ELK Stack, Grafana, Prometheus, Loki, Nobl9
- Familiarity with modern logging frameworks and best practices: Opentelemetry, Logstash, Logguru, etc.
- Containerization technologies and orchestration platforms, mainly Kubernetes and EKS (CKA, CKAD, CKS certifications are valued);
- Experience with Python, Go, Bash/Shell scripting, or other automation tools/languages;
- Experience with automation and Infrastructure as Code (IaC) tools, such as AWS CloudFormation, Terraform, Puppet, Chef, Spacelift, etc;
- Strong understanding of designing resilient and fault-tolerant systems;
- Expertise in debugging complex distributed systems;
- Proficiency in monitoring and troubleshooting complex distributed systems.
The Longer Story:
OutSystems is a global leader transforming how companies innovate through software, empowering IT leaders with a better way to build the software that matters most.We are looking for talented and motivated people to join us in helping companies solve some of their most strategic business challenges, from modernizing their workplace processes to transforming their employee and customer experiences. As a member of the OutSystems global team, you will help build, deliver, manage, and evolve the software that is a low-code market leader and preferred by professional developers around the world.
OutSystems is a truly global company, with more than 800,000 developer community members, 1,700 employees, more than 500 partners, and thousands of active customers in over 75 countries and across 21 industries. Founded in 2001, OutSystems has offices in the United States, United Kingdom, the Netherlands, Portugal, Germany, the UAE, Japan, Hong Kong, Malaysia, Australia, India, and Singapore, and of course has a thriving, worldwide community of remote employees.
Working at OutSystems
Our goal is to ensure that OutSystems is a place for bright, happy, and motivated people who share a common purpose and take pride in excellent work towards our vision. Our culture is focused on building agility at scale, which allows us to operate with a high drive in a competitive market. In our federation of teams culture, if we have every team operating like a startup, we can all learn, grow, and innovate while having the space to be proactive and creative. We encourage our team members to collaborate, focus on results, act quickly, understand our business, and adopt a growth mindset.
What do we have to offer you?
- A company that continues to grow, change and innovate, and gives our teams the space to be proactive and creative.
- Real career opportunities. We care about growth and development. Vertical career progression is an obvious possibility, but we also offer the possibility for lateral moves, joining different teams, and mastering specific skills.
- Work colleagues that are as smart, hardworking and driven as you – and a team that is global.
- Disrupting the status quo is in our DNA. In fact, it’s why our company exists.
- We “Ask Why” a lot. It helps us connect our individual work to the bigger picture and sometimes even uncover a better way.
Are you ready for the next step in your career? Then we’d love to hear from you!
OutSystems nurtures an inclusive culture of diversity, where everyone feels empowered to be their authentic self and perform at their best. A company that embraces the creativity and innovation that comes through diverse perspectives. We are committed to creating a team that reflects society through inclusive programs and initiatives and are proud to be an equal opportunity employer. All qualified applicants receive equal consideration regardless of race, place of origin, color, age, marital status, religion, sex, sexual orientation, gender expression or identity, protected veteran status, disability status or any other status protected by law.
Join us in disrupting the status quo of the low-code market, we give you the power to "Ask Why", you give our customers the power to innovate through software!