Search
Search
Location
RAY AI Logo

RAY AI

3.7

Cloud Engineer

Cape Town

We are seeking a highly skilled Cloud Engineer & Infrastructure Security professional to design, build, and secure our hybrid infrastructure (cloud + on-prem). The ideal candidate will have deep experience with Kubernetes, Terraform, Helm, and a strong background in infrastructure security, DevSecOps, and on-prem deployments. This role is critical for architecting scalable, secure, and observable infrastructure supporting mission-critical applications and LLM (Large Language Model) workloads.

Your Responsibilities:

  • Infrastructure & Cloud Management

    • Deploy and manage Kubernetes clusters (cloud & on-prem) using Terraform and Helm.

    • Design secure network topology (VPCs, VPNs, firewalls).

  • Infrastructure Security & Zero Trust

    • Implement zero trust models, IAM, and least-privilege access.

    • Enforce security policies, micro-segmentation, and secrets management.

  • DevSecOps & CI/CD Security

    • Integrate security scanning, SBOM, and policy-as-code into pipelines.

    • Automate compliance and security checks during build and deploy.

  • LLM & Hybrid Deployments

    • Build and maintain infrastructure for LLM workloads (vLLM, KServe).

    • Support hybrid cloud and on-prem deployments ensuring consistency and security.

  • Monitoring & Observability

    • Implement monitoring, logging, and alerting using Grafana, Azure Monitor, Prometheus.

    • Maintain dashboards, SLIs/SLOs, and performance metrics.

  • Linux & Automation

    • Harden Linux systems, automate routine tasks, and support incident response.

    • Develop scripts and tools to streamline operations.

  • Collaboration & Strategy

    • Partner with engineering, security, and operations teams.

    • Mentor teams on cloud best practices and emerging technologies.

What we look for:

  • Strong experience with Kubernetes, including cluster provisioning, scaling, and security.

  • Proficient in Terraform and Helm for infrastructure-as-code and deployment automation.

  • Expertise in infrastructure security, zero trust models, and IAM best practices.

  • Hands-on experience with DevSecOps: security scanning, SBOM generation, secrets management, and policy-as-code.

  • Solid understanding of cloud networking: VPC design, VPN, and firewall configuration.

  • Experience with hybrid or on-prem deployments alongside cloud environments.

  • Skilled in Linux administration, scripting, and automation for operational efficiency.

  • Familiarity with monitoring and observability tools (Azure Monitor, Grafana, Prometheus).

  • Experience building and managing infrastructure for LLM or AI workloads (vLLM, KServe).

  • Nice - to - have:

    • Cloud and security certifications (e.g., CKA/CKAD, Terraform Associate, CISSP).

    • Experience with GitOps workflows (Argo CD, Flux) and CI/CD security pipelines.

    • Knowledge of policy frameworks (OPA, Gatekeeper, Kyverno) and workload identity systems (SPIFFE/SPIRE).

    • Familiarity with GPU/accelerator-based infrastructure for ML/LLM workloads.

    • Background in SRE practices, including SLO/SLI design and incident response.

    • Contributions to open-source cloud, DevSecOps, or LLM infrastructure projects.


    What we offer:

    • Competitive salary and performance-based bonuses.

    • Fully remote, flexible work environment.

    • Modern laptop and hardware provided by us.

    • Specialized training in AI, automation, and digital productivity tools.

    • Global exposure—collaborate with top-tier founders and fast-growing startups.

    • Continuous learning and career growth opportunities in an international environment.

Working here doesn’t have to be a secret

Sign in to browse authentic reviews, anonymous ratings and salary data before you apply.

3.7
  • 51 %
    Recommend to a friend
  • N/A
    Approve of CEO
  • CEO: 0 Ratings