Senior GenAI & HPC Engineer
Dell Technologies
Senior GenAI & HPC Engineer
Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what each client wants to achieve. Then we make sure the services delivered by Dell Technologies deliver on all our promises. We also work closely with Sales and Global Services colleagues to develop strategic account growth plans, and to identify and pursue sales opportunities
Join us to do the best work of your career and make a profound social impact as a Senior GenAI & HPC Engineer on our Service Delivery Team in Malaysia.
What you’ll achieve
We’re seeking a Senior GenAI & HPC Engineer with deep experience in GPU‑accelerated systems, Linux performance tuning, and benchmarking. This role is highly hands‑on and customer‑facing, supporting onsite deployments across the South-East Asia/APJ for advanced HPC and GenAI solutions. You will work as a part of a team to help build, integrate, and test some of the world’s largest multi‑GPU systems, benchmark them using industry‑standard tools, make suggestions on how to optimize performance, and deliver the next generations of AI/HPC infrastructure.
You will:
Deploy, configure, and validate GPU‑accelerated compute clusters for AI, ML, and HPC with NVIDIA Base Command Manager (Warewulf and OpenHPC knowledge are a plus)
Perform benchmarking with HPL GPU, HPL MxP, STREAM, NCCL, RCCL, and related tools
Produce as-built documentation, performance reports, and share best practices amongst the team.
Configure and secure RHEL, Ubuntu, Rocky for GenAI or HPC workloads and learn constantly and get to work with the latest GenAI platforms and infrastructure.
Work directly with customers onsite (travel both in Malaysia, South East Asia and Potentially APJ)
Take the first step towards your dream career
Every Dell Technologies team member brings something unique to the table. Here’s what we are looking for with this role:
Essential Requirements
7+ years with HPC or GenAI clusters, GPU based systems, AI infrastructure, or related fields
Deep hands‑on experience with GPU deployment, configuration, and multi-node testing. Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP
Red Hat certification (RHCSA/RHCE) or 7+ years of relevant RH distros experience
Experience with GenAI/HPC networking (InfiniBand and/or RoCE), experience working in Linux based parallel computing environments at scale and experience with containers/orchestration (Docker, Singularity/Apptainer, Kubernetes, Slurm)
Strong customer‑facing and communication skills
Desirable Requirements
NVIDIA certifications (NCA, NCE, DGX) and experience with NVIDIA UFM, Infiniband, and SpectrumX fabrics
Exposure to hybrid cloud or GPU cloud environments and experience with GPU observability/performance profiling tools
Who we are
We believe that each of us has the power to make an impact. That’s why we put our team members at the center of everything we do. If you’re looking for an opportunity to grow your career with some of the best minds and most advanced tech in the industry, we’re looking for you.
Dell Technologies is a unique family of businesses that helps individuals and organizations transform how they work, live and play. Join us to build a future that works for everyone because Progress Takes All of Us.
Dell Technologies is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. Read the full Equal Employment Opportunity Policy here.