Where

Site Reliability Engineer

Hire Resolve
Johannesburg Full-day Full-time

Description:

A leading Fintech company is looking for a Site Reliability Engineer based in Johannesburg, GP.

As a Site Reliability Engineer, you will be the guardian of our technical stability and infrastructure performance. You will manage and optimise hosting environments across production and development instances, covering platforms like Odoo ERP, WhatsApp chatbot systems, APIs, internal tools, external facing websites and reporting databases. Your work ensures that the systems powering over 50 000 Sales Force members and thousands of end users remain resilient, scalable and secure.

Responsibilities:
  • Manage and monitor the infrastructure of our ERP systems, applications, APIs, and databases.

  • Ensure high availability and scalability of production environments and development pipelines.

  • Administer cloud environments including deployments, rollbacks, and updates.

  • Establish and maintain CI/CD workflows for rapid and safe deployments.

  • Set up monitoring, logging and alerting systems to track system health and performance.

  • Investigate and resolve production incidents in a timely and thorough manner.

  • Implement backup, recovery, and failover processes to ensure data integrity.

  • Improve observability and reporting across environments and services.

  • Harden infrastructure security and enforce access controls and best practices.

  • Support development teams with staging, test, and release environments.

  • Automate routine tasks to improve system efficiency and reduce human error.

  • Set up and manage reliable and scalable hosting environments.

  • Diagnose and resolve incidents efficiently with minimal downtime.

  • Collaborate with software teams to enable faster and safer deployments.

  • Document infrastructure processes and maintain infrastructure knowledge bases.

  • Implement DevOps and SRE practices tailored to a fast-moving startup context.

  • Build processes that are robust and scale as the company grows.

  • Balance performance, security, and simplicity in all infrastructure decisions.

Requirements:
  • A tertiary qualification in Computer Science, Information Technology or a related field.

  • Minimum of 3 years’ experience in system administration, DevOps or SRE role.

  • Strong problem solving, troubleshooting, and communication skills.

  • Proficiency in English.

  • Knowledge and experience with Odoo hosting and maintenance workflows.

  • Knowledge and experience with hosting ERP systems, databases and API driven platforms.

  • Knowledge and experience with securing web infrastructure and access credentials.

  • Knowledge and experience with optimising costs and performance in cloud environments.

  • Knowledge and experience with scripting and automation using Bash, Python or similar.

  • Knowledge and experience with logging and system observability tools.

  • Knowledge and experience with fast recovery planning and disaster mitigation.

Contact Hire Resolve for your next career-changing move.
Our client is offering a highly competitive salary for this role based on experience.
Apply for this role today, contact Gaby Turner at gaby.turner@hireresolve.us or on LinkedIn
You can also visit the Hire Resolve website: hireresolve.us or email us your CV: itcareers@hireresolve.za.com

Requirements:

  • Manage and monitor the infrastructure of our ERP systems, applications, APIs, and databases.

  • Ensure high availability and scalability of production environments and development pipelines.

  • Administer cloud environments including deployments, rollbacks, and updates.

  • Establish and maintain CI/CD workflows for rapid and safe deployments.

  • Set up monitoring, logging and alerting systems to track system health and performance.

  • Investigate and resolve production incidents in a timely and thorough manner.

  • Implement backup, recovery, and failover processes to ensure data integrity.

  • Improve observability and reporting across environments and services.

  • Harden infrastructure security and enforce access controls and best practices.

  • Support development teams with staging, test, and release environments.

  • Automate routine tasks to improve system efficiency and reduce human error.

  • Set up and manage reliable and scalable hosting environments.

  • Diagnose and resolve incidents efficiently with minimal downtime.

  • Collaborate with software teams to enable faster and safer deployments.

  • Document infrastructure processes and maintain infrastructure knowledge bases.

  • Implement DevOps and SRE practices tailored to a fast-moving startup context.

  • Build processes that are robust and scale as the company grows.

  • Balance performance, security, and simplicity in all infrastructure decisions.

  • A tertiary qualification in Computer Science, Information Technology or a related field.

  • Minimum of 3 years’ experience in system administration, DevOps or SRE role.

  • Strong problem solving, troubleshooting, and communication skills.

  • Proficiency in English.

  • Knowledge and experience with Odoo hosting and maintenance workflows.

  • Knowledge and experience with hosting ERP systems, databases and API driven platforms.

  • Knowledge and experience with securing web infrastructure and access credentials.

  • Knowledge and experience with optimising costs and performance in cloud environments.

  • Knowledge and experience with scripting and automation using Bash, Python or similar.

  • Knowledge and experience with logging and system observability tools.

  • Knowledge and experience with fast recovery planning and disaster mitigation.

30 Sep 2025;   from: careers24.com

Similar jobs

  • Hire Resolve
  • Johannesburg
... looking for a Site Reliability Engineer based in Johannesburg, GP. As a Site Reliability Engineer, you will be ... looking for a Site Reliability Engineer based in Johannesburg, GP. As a Site Reliability Engineer, you will be ...
15 hours ago
  • Hire Resolve
  • Johannesburg
... looking for a Site Reliability Engineer based in Johannesburg, GP. As a Site Reliability Engineer, you will be ...
15 hours ago
  • Hire Resolve
  • Johannesburg
... looking for a Site Reliability Engineer based in Johannesburg, GP. As a Site Reliability Engineer, you will be ...
15 hours ago
  • Hire Resolve
  • Johannesburg
... looking for a Site Reliability Engineer based in Johannesburg, GP. As a Site Reliability Engineer, you will be ...
15 hours ago