Your tasks:
- monitoring and incident management: You provide our services with the important metrics to be able to monitor our system permanently for malfunctions. In the event of an incident, you will support debugging in the Google Cloud in collaboration with the development teams in order to rectify the error as quickly as possible. Managing incident response activities and being on-call will also be part of your role.
- automation and scalability: You will take our cloud infrastructure to the next level by identifying and automating recurring tasks. You will also recommend managed services from the Google Cloud to the development teams and promote their use within the company. You will also be responsible for the continuous development of our deployment process, helping the teams to achieve simpler and more secure deployments in Kiwigrid's production environment.
- security and compliance: You will help Kiwigrid achieve a secure cloud environment by integrating tooling to monitor vulnerabilities and detect external, potentially dangerous activities. You are involved in the creation of compliance guidelines and support audits.
- documentation and knowledge sharing: You help the development teams to prepare their documentation in such a way that the stack and request flows are comprehensible. In addition, you bring best practices from the SRE team to the development teams in order to achieve a high degree of standardisation.
