Principal Engineer, Site Reliability Engineer
We usually respond within two weeks
Driven by the passion to improve quality of people’s lives, WSA continues to grow as a market leader in the hearing aid industry. With our commitment to increase penetration in an underserved hearing care market, we want to accelerate our business transformation in order to reach more people, more effectively.
We are at a pivotal moment in our journey - transforming from a world-leading hearing aid manufacturer into a digital-first hearing healthcare company. Our cloud platforms now power personalized hearing experiences for millions of users globally, connecting sophisticated hearing devices with real-time audiological services and telehealth capabilities. As we scale this ecosystem, we need a visionary SRE Principal Engineer to architect and champion the reliability practices that will ensure our digital transformation succeeds - because when our systems falter, people miss the sounds that matter most.
Join our innovative team and gain awareness on how our hearing aids and service solutions can transform customers' lives by making wonderful sound part of everyone's life.
We are looking for SRE Principal Engineer with outstanding domain expertise in the following fields: reliability, failover, public clouds, and cloud-native.
What you will do
Lead Reliability Strategy & Architecture
Lead architecture and reliability standards across global multi-hyperscaler cloud platforms by partnering with product, engineering, and platform teams to define best practices, deliver guidance on resilient distributed systems, microservices, and event-driven architectures for real-time hearing aid data streams, and promote portable cloud-native stacks that eliminate vendor lock-in
Design and implement high-availability disaster recovery solutions including multi-region deployments, active-active/active-passive failover models, ensuring 99.99% uptime for WSA’s mission-critical medical-grade SaaS platform
Ensure Operational Excellence
Oversee system reliability, uptime, and operational health across the global cloud platform supporting millions of connected hearing devices by defining and implementing reliability metrics, SLIs, SLOs, and operational dashboards, while monitoring golden signals and building advanced correlations across logs, metrics, and traces to detect leading failure indicators early and ensure rapid resolution
Drive operational readiness and continuous improvement through multi-region high-availability deployments, disaster recovery procedures, rigorous incident management, root cause analysis, and turning every incident into systemic reliability enhancements for mission-critical workloads
Build Resilient Infrastructure
Lead the architecture and implementation of resilient cloud infrastructure and distributed systems, promoting best practices across teams. This includes building platform-portable architectures (e.g., portable database APIs and message-driven systems) with cross-platform failover capabilities such as integrating Azure Service Bus with RabbitMQ on Kubernetes to ensure high availability and flexibility
Champion an automation-first approach using infrastructure as code to deliver reliable and maintainable platforms, while establishing a modern observability framework. This involves implementing OpenTelemetry with Prometheus and Grafana to enable deep system insights, proactive monitoring, and full end-to-end traceability across all services and environments
Mentor & Elevate Teams
Scale SRE practices across the organization by mentoring senior engineers, developing strong technical leaders, and embedding repeatable reliability frameworks. Foster a culture of continuous learning by promoting blameless postmortems, encouraging teams to learn from failures, and driving ongoing improvement in system reliability and team practices
Continuously assess and adopt new technologies and architectural approaches that enhance system reliability, scalability, and overall platform resilience, ensuring the organization evolves with modern engineering standards and remains adaptable to changing demands.
What you bring
Experience
Minimum of 8 or more years of relevant experience in SRE, Software Engineer, or related roles, with a proven track record of leading reliability initiatives and mentoring teams
Experience in designing and operating large-scale SaaS platforms in public cloud environments while applying cloud-agnostic architectural principles
Expertise in distributed systems, their architecture patterns and failure modes, including microservices, event-driven systems, and messaging platforms (Service Bus, Kafka, RabbitMQ)
Experience designing highly available fault-tolerant systems, including multi-region deployments, cross-platform failover mechanisms, and disaster recovery architectures.
Hands-on experience with container orchestration platforms such as Kubernetes and designing portable platform infrastructure
Proficient in software engineering experience in languages such as C#, Python, TypeScript, or Go, with a focus on building scalable and maintainable systems
Good track record implementing infrastructure-as-code and automation frameworks
Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
Personal competencies
Strategic thinker with good analytical and problem-solving skills who can balance long-term architectural vision with pragmatic delivery
Good communication and collaboration skills - you can articulate complex technical concepts to both engineering teams and business stakeholders
Calm under pressure with proven ability to lead through high-stakes incidents
Natural mentor who enjoys developing others and building high-performing teams
Great problem-solving and analytical skills.
Who we are
At WSA, we provide innovative hearing aids and hearing health services.
Together with our 12,000 colleagues in 130 countries, we invite you to help unlock human potential by bringing back hearing for millions of people around the world.
With us, you will become part of a truly global company where we care for one another, welcome diversity and celebrate our successes.
Sounds wonderful? We can't wait to hear from you.
WSA is an equal-opportunity employer and committed to creating an inclusive employee experience for all. Regardless of race, color, religion, national origin, age, sex, gender, gender identity, gender expression, sexual orientation, marital status, medical condition, ancestry, disability, military or veteran status we firmly believe that our work is at its best when everyone feels free to be their most authentic self.
- Department
- Research & Development
- Role
- Software & Digital Solutions
- Locations
- Erlangen, Germany, Lynge, Copenhagen Region, Denmark