Deep Learning Compiler Engineer
Huawei Technologies
Job description
Huawei Canada has an immediate permanent opening for a Deep Learning Compiler Engineer.
About the team:
Initially founded in 1991 as Huawei's ASIC Design Center, the IC Lab is a leading global fabless semiconductor lab. This lab delivers trusted, cutting-edge semiconductor products and services for smart devices, contributing to smart home and mobility solutions. The local team in Canada specializes in semiconductors, and chipset solutions.
About the job:
Create customer value by developing new capabilities to improve inference times, optimize memory usage, and other state-of-the-art algorithms. You will use your expertise to drive optimization at both the model and system level.
Design, develop, and maintain optimized code in C++ and Python to implement and enhance deep learning inference systems. You will contribute to both low-level performance-critical components and appropriate high-level abstractions that define system behavior.
Drive performance improvements by integrating and optimizing deep learning systems for custom hardware accelerators. Your work will enable significant gains in speed, efficiency, and real-time performance for wireless applications.
Participate actively in code reviews, providing constructive feedback and ensuring that the codebase remains efficient, scalable, and conducive to high-velocity development. This is a collaborative environment where you’ll both share knowledge and grow from the feedback of your peers.
Work closely with a team of experienced engineers who will offer mentorship and guidance, allowing you to further refine your skills while contributing to challenging and innovative projects.
Job requirements
About the ideal candidate:
Undergraduate degree in Computer Science, Software Engineering, or a related field, and up to 5 years of professional experience in software engineering or a closely related domain.
Proficient in C++ and comfortable with Python.
Hands-on experience with large software systems and have a strong understanding of algorithms, data structures, and performance profiling techniques.
Strong problem-solving skills, with the ability to think critically and independently to tackle complex challenges.
Ability to communicate complex ideas clearly and work effectively with cross-functional teams, enabling you to thrive in a collaborative and fast-paced environment.
Experience working with deep learning frameworks (such as TensorFlow, PyTorch, or others) and are comfortable optimizing models for specialized hardware, including GPUs, TPUs, or AI accelerators is an asset.
or