Search
Search
Location
RAY AI Logo

RAY AI

3.7

Data Scientist

Cape Town

We are seeking a highly skilled Data Scientist with deep expertise in modern AI/ML systems, including LLMs, multimodal models, fine-tuning techniques, and advanced retrieval architectures. In this role, you will design, prototype, and deploy AI-powered solutions that leverage state-of-the-art language, vision, and agentic frameworks. You will work closely with engineering, product, and research professionals across the US and Europe to bring cutting-edge AI capabilities into production environments.

Your Responsibilities:

  • Design, build, and optimize LLM-powered systems using OpenAI, Anthropic, and open-source/local model families.

  • Architect and implement RAG pipelines, including hybrid search, query rewriting, prompt optimization, and reranking strategies.

  • Develop and maintain vector database infrastructures (Pinecone, Weaviate, Qdrant) for large-scale embedding storage and fast retrieval.

  • Train, evaluate, and retrain embedding models for domain-specific semantic search and knowledge retrieval.

  • Build and integrate multimodal AI solutions using OCR, CLIP, and modern vision architectures for text-image understanding.

  • Apply fine-tuning techniques (LoRA/QLoRA) to adapt foundation models to organizational datasets and specialized tasks.

  • Develop production-ready AI applications using Python, PyTorch, and modern orchestration frameworks.

  • Implement LLM orchestration with LangChain or LlamaIndex, including evaluators, tool abstractions, memory, and RAG components.

  • Establish robust evaluation frameworks to measure model performance, reduce hallucination, and ensure reliability in production.

  • Build agentic workflows using AutoGen, CrewAI, or similar frameworks to power automation and multi-agent collaboration systems.

  • Stay current with research trends and apply theoretical and practical insights in Generative AI to drive innovation across the organization.

What we look for:

  • Experience in applied machine learning or data science, with at least 2 years focused specifically on LLMs or Generative AI.

  • Demonstrated experience building end-to-end RAG, fine-tuning, or multimodal AI systems.

  • Strong proficiency in Python, PyTorch, and AI tooling ecosystems.

  • Experience deploying models at scale in production environments.

  • Strong understanding of evaluation metrics, model reliability, and safety/reduction of hallucination.

  • Familiarity with vector embeddings, vector databases, and semantic search.

  • Experience with agent frameworks such as AutoGen, CrewAI, or LangGraph-like toolkits.

  • Experience with distributed training, model optimization, quantization, or GPU acceleration.

  • Knowledge of DevOps/MLOps tooling for deploying LLM-based systems.

  • Contributions to open-source LLM or RAG projects.

What we offer:

  • Competitive salary and performance-based bonuses.

  • Fully remote, flexible work environment.

  • Modern laptop and hardware provided by us.

  • Specialized training in AI, automation, and digital productivity tools.

  • Global exposure—collaborate with top-tier founders and fast-growing startups.

  • Continuous learning and career growth opportunities in an international environment.

Working here doesn’t have to be a secret

Sign in to browse authentic reviews, anonymous ratings and salary data before you apply.

3.7
  • 51 %
    Recommend to a friend
  • N/A
    Approve of CEO
  • CEO: 0 Ratings