• Jobs
  • >
  • Platform Monitoring Specialist

Platform Monitoring Specialist

  • Permanent
  • Full time
  • Remote
  • Software - Platform

At Bloq.it, we are building the world's most advanced Smart Locker platform, enabling frictionless parcel delivery and return while driving sustainability and efficiency across last-mile logistics.

As the fastest-growing Smart Locker company in the world—and a top European scale-up—we’re looking for an experienced Platform Engineer focused on Monitoring and Observability to join our high-performing Platform team.

Your mission will be to design, scale, and maintain our monitoring stack, ensuring deep visibility into our hybrid cloud/edge systems and helping teams anticipate and resolve issues before they impact our customers.

What will be your responsibilities

  • Own and evolve our monitoring, alerting, and observability infrastructure, ensuring coverage across all environments (Cloud, Lockers, CI/CD pipelines);
  • Collaborate with engineering teams to define metrics, logs, and tracing strategies that reflect business-critical SLIs/SLOs;
  • Build and maintain dashboards and alerts using Datadog, driving insights for engineering, QA, and operations teams;
  • Act as first responder and escalation point during platform incidents, coordinating diagnostics and driving post-mortems;
  • Develop automated health checks, alert tuning processes, and data integrity checks for critical services;
  • Support continuous improvement of monitoring playbooks, runbooks, and documentation.

What are the requirements to join us in this position

  • Proven experience as a Platform, DevOps, or Site Reliability Engineer with a specialized focus in observability or monitoring;
  • Solid expertise with Datadog (or equivalent platforms like Prometheus, Grafana, New Relic);
  • Strong experience with AWS services, Linux system administration, and Infrastructure-as-Code (e.g., Terraform, CDK);
  • Proficiency with CI/CD pipelines and automation (GitHub Actions preferred);
  • Experience working with logging, tracing, and metric systems, and designing high-signal alerting rules;
  • Strong troubleshooting and problem-solving skills in production environments;
  • Fluent in English, both written and spoken.

Extra valued skills

  • 4+ years in a platform, SRE, or observability role in a production-grade environment;
  • Familiarity with Atlas MongoDB, MQTT brokers, and distributed edge devices;
  • Experience defining SLIs/SLOs/SLAs and implementing reliability guardrails;
  • Knowledge of incident response frameworks and root cause analysis methodologies.

What will you get if you join us in this position

  • The opportunity to join our Platform team and play a pivotal role building and improving our infrastructure and contributing to innovative solutions that redefine Bloq.it's revolution in the smart locker industry;
  • A dynamic and fast-paced work environment with a culture of innovation, collaboration, and continuous learning;
  • Competitive salary and benefits package, tailored to your experience and skills, including performance-based bonus and Portuguese health insurance;
  • Flexible work conditions, including a remote-friendly policy and a flexible schedule that allows you to balance your work and personal life;
  • Regular meetings in-person at our HQ Office in Lisbon, PT, giving you the chance to connect with the team and immerse yourself in our company culture;
  • Make a tangible impact by contributing to the platform reliability , actively supporting our mission to provide affordable and sustainable solutions.

If you're a Monitoring-driven Platform Engineer who thrives in fast-paced, real-world environments and wants to build systems that just work, Bloq.it is your next home.

Join our team of #bloqstars and help us redefine the last-mile delivery experience!