Senior Site Reliability Engineer (m/w/d)

OBI Group Holding SE & Co. KGaA • Cologne

Schanzenstraße 39, 51063 Köln

As part of OBI, we empower people to design their homes creatively and individually. Whether it's a new look for the kitchen or the garden project with the whole family, we inspire, advise and help make great DIY projects happen. Work with us to make the DIY store digital and develop a comprehensive service ecosystem. This is how we create unique customer experiences!

Here at Obi, we develop and operate a customer engagement and experience platform - based on a state-of-the-art cloud services architecture. We offer comprehensive analytics for our business and IT departments, marketing automation and personalisation capabilities for our new customer loyalty programme heyOBI. As Senior Site Reliability Engineer (SRE), you will work in the Data Engineering department, collaborating with Data Engineers, Data Scientists, and Software Engineers to develop and maintain a new Data Platform built on AWS which utilizes tools such as Databricks, Airflow, Airbyte, SQL & NoSQL databases and custom containerized and serverless services.

Your duties

• Design, build, and maintain infrastructure and automation tools to support the Data Platform using IaaC practices and Terraform

• Monitor and troubleshoot platform performance and reliability issues

• Collaborate with Data Engineers, Data Scientists, and Software Engineers to optimize platform performance and scalability

• Implement and maintain disaster recovery and business continuity processes

• Conduct root cause analysis and implement solutions to prevent future issues

• Continuously improve processes and infrastructure to enhance platform reliability and performance

• Implement and maintain DevOps practices within the team, including continuous integration and deployment

• Implement and maintain a security by design approach within the team, ensuring the security and privacy of data and systems

• Mentor and lead junior Data Engineers, providing guidance on best practices and processes

Your profile

• Bachelor's degree in Computer Science or related field

• 5+ years of experience in site reliability engineering or a related field

• Strong experience with AWS and Python

• Experience with other tools such as SQL and NoSQL databases and orchestration services

• Experience with GitLab, JIRA, and Confluence

• Experience with IaaC and DevOps practices, including Terraform and CI/CD

• Experience with security by design principles

• Excellent problem-solving and communication skills

• Ability to work in a fast-paced and dynamic environment

• Experience leading and mentoring team members preferred

• Full-Remote: if you desire

 

Similar jobs
We evaluate all jobs for you in order to suggest similar jobs that match the tasks and required skills.
Your contact person

Alexander Schmidt-Blacha

 

YOU WANT THE JOB?

I want the Job!