Platform Engineer
Descrição da vaga
About Digibee
Digibee is an iPaaS that scales integration workflows while reducing cost and technical debt. Rather than require specialized integration experts, Digibee lets every developer quickly build, test, deploy, govern, and monitor integrations across on-premise and cloud environments using a simple but powerful low-code interface.
Founded in São Paulo, Brazil, in 2017 and headquartered in Weston, Florida, our team is widely distributed throughout the Americas. In May of 2023, Digibee closed a Series B funding round of $60 million that is intended to drive our expansion in the United States.
About the role
We are looking for a Platform Engineer to join the Streamline Engineering team within our Platform Vertical. This team owns two complementary tracks — Cloud Projects and Developer Experience (DevEx) — and is responsible for the foundational infrastructure and internal platforms that power Digibee's multi-cloud integration platform.
In this role you will design, build, and operate the platform capabilities that engineering teams rely on every day: Kubernetes infrastructure across multiple cloud providers, self-service provisioning tooling, CI/CD standards, and the observability framework that keeps our SaaS platform healthy. You will treat the platform as a product — reducing developer friction, eliminating manual touchpoints, and enabling teams to ship with velocity and safety.
Responsabilidades e atribuições
On a typical day, you will…
- Design, implement, and evolve Infrastructure as Code (Terraform, Flux, Helm) to provision and manage Kubernetes clusters across AWS, GCP, Azure, and upcoming cloud providers.
- Build and maintain self-service capabilities — automated provisioning pipelines, environment management, and deployment workflows — that allow engineering teams to operate their cloud resources autonomously without manual platform team involvement.
- Architect and operate GitOps-driven CI/CD pipelines, establishing Golden Paths and standardized templates that raise the quality baseline across all product teams.
- Lead the DevSecOps strategy for the platform, embedding compliance policies (SOC2, GDPR, customer audits) directly into automation pipelines and Kubernetes admission controls.
- Define and monitor SLIs/SLOs for platform-owned systems; drive incident response during major outages and lead blameless post-mortem analysis and follow-up actions.
- Manage and evolve developer tooling platforms including Backstage, GitLab, Artifactory, and SonarQube — continuously improving the developer experience based on team feedback.
- Define the observability framework and standards (metrics, tracing, log aggregation) across multiple teams, ensuring platform reliability through proactive system health monitoring.
- Develop Kubernetes operators and other cloud-native automation to reduce operational toil and increase platform resilience.
- Conduct disaster recovery planning, testing, and runbook documentation; create and maintain Production Readiness Reviews and operational procedures.
- Act as a Platform Engineering Champion for domain teams during refinements and technical spikes, providing dedicated sprint capacity for platform-related initiatives.
- Monitor and analyze infrastructure costs across cloud platforms; create tooling for proactive capacity planning.
Requisitos e qualificações
What you’ll need to bring…
Software Engineering & Architecture
- 3+ years of experience in Platform Engineering, DevOps, Cloud Engineering, or Site Reliability Engineering roles.
- Deep knowledge of Kubernetes and its ecosystem (cluster lifecycle, node pools, networking, RBAC, resource optimization, operators) across at least two major cloud providers — this is a mandatory requirement.
- Hands-on production experience with Infrastructure as Code tools, preferably Terraform, Flux, and Helm.
- Solid Linux systems administration and networking fundamentals (load balancers, DNS, VPNs, firewalls, TCP/IP).
- Strong information security fundamentals with hands-on experience applying DevSecOps practices to cloud infrastructure, including Policy-as-Code and compliance requirements (SOC2, GDPR).
- Experience designing and operating CI/CD pipelines and GitOps workflows (GitLab CI, FluxCD, ArgoCD, or similar).
Reliability & Operations
- Solid observability experience with modern tooling (OpenTelemetry, Prometheus, or similar).
- Strong grasp of SRE practices: SLI/SLO definition, error budgets, and blameless postmortems.
- Experience driving incident response processes, post-incident reviews, and operational follow-up actions.
- Demonstrated ability to create and maintain production readiness documentation, runbooks, and disaster recovery procedures.
- Familiarity with Chaos Engineering principles and proactive reliability practices.
Developer Experience (DevEx)
- Demonstrated ability to build self-service platform capabilities that reduce developer toil and eliminate ticket-driven workflows.
- Experience managing and evolving developer tooling platforms (Backstage, GitLab, Artifactory, SonarQube, or similar).
- Customer-focused mindset — ability to translate developer pain points into practical, adoptable platform solutions.
- Strong communication skills in both English and Portuguese (written and verbal), with proven ability to collaborate across cross-functional, remote-first teams.
Informações adicionais
It’s a plus if you have…
- CNCF Kubernetes certifications (CKA, CKS, or CKAD) or cloud platform certifications (AWS Solutions Architect, GCP Professional Cloud Engineer).
- Experience with managed Kubernetes services (EKS, GKE, AKS) in large-scale, multi-tenant environments.
- Familiarity with developer portal platforms such as Backstage, and experience driving organizational adoption of internal platforms.
- Experience promoting and driving adoption of AI-assisted engineering tools within engineering teams, including documentation, enablement, and integration into existing workflows.
- Experience with observability platforms beyond standard Prometheus/Otel stacks (Dash0, Cribl, Splunk, or similar).
- Familiarity with Chaos Engineering principles and "everything-as-Code" (EaC) patterns.
- Experience with multi-cloud networking and cross-cloud connectivity patterns.
- Background in establishing governance structures, RFCs, and engineering standards at the org level.
Etapas do processo
- Etapa 1: Cadastro
- Etapa 2: People Interview
- Etapa 3: Hiring Manager Interview
- Etapa 4: Cross-Functional Interview
- Etapa 5: Leadership Interview
- Etapa 6: Contratação
Together we can shape the future of integration!
We are a global integration software company with a Brazilian foundation. Our supportive work environment inspires our employees to give their best. Our people love what they do, we work hard and have fun doing it. We know our integration platform is special, and we’re excited to share it with our team, our customers, our industry, and the world.
Social Media