Posted 10 days ago
Description
Staff Software System Design Engineer – AI Compiler
Join to apply for the Staff Software System Design Engineer – AI Compiler role at AMD.
We are looking for a dynamic, energetic Lead Compiler Engineer to join our growing AI group. In this role you will be responsible for designing, developing, and optimizing the frontend compiler for the latest neural networks on AMD’s XDNA Neural Processing Units, powering cutting‑edge generative AI models such as Stable Diffusion, SDXL‑Turbo, Llama‑2, etc. Your work will directly impact the efficiency, scalability, and reliability of our machine‑learning applications. If you thrive in a fast‑paced environment and love working on cutting‑edge inference applications, this role is for you.
Key Responsibilities
- Design and implement NPU compiler framework for neural networks.
- Develop hardware‑aware graph optimizations for high‑level ML frameworks such as ONNX.
- Research new algorithms for operator scheduling for efficient inference of latest NN models.
- Interface with ONNX / PyTorch runtime and lower‑level hardware implementation.
- Contribute to high‑performance inference for GenAI workloads such as Llama‑2‑7B, Stable Diffusion, SDXL‑Turbo, etc.
- Work closely with kernel developers, performance architects, and AI researchers.
- Manage CPU and memory resources effectively during model execution.
- Handle resource allocation for ML deployments across different tenants.
- Research heterogeneous mapping of ML operators for maximum efficiency.
- Build tools to track resource utilization, bottlenecks, and anomalies.
- Enable detailed profiling and debugging tools for analyzing ML workload latency.
- Implement rigorous code‑review practices for superior code‑quality assurance.
- Adopt incremental development methodologies for tackling complex projects effectively.
- Foster cross‑functional collaboration to address intricate challenges and drive innovation.
Qualifications
- Strong programming skills in C++ and Python.
- Experience with proprietary/open‑source compiler stack such as TVM, MLIR.
- Experience with ML frameworks (e.g., ONNX, PyTorch) is required.
- Experience with ML models such as CNN, LSTM, LLMs, Diffusion is a must.
- Experience with ONNX, PyTorch runtime integration is a bonus.
- Excellent problem‑solving abilities and a passion for performance optimization.
Academic Credentials
Master’s or PhD degree in Computer Science, Electrical Engineering, or related fields.
Seniority Level
Mid‑Senior level
Employment Type
Full‑time
Job Function
Semiconductor Manufacturing
Equal Opportunity Statement
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee‑based recruitment services. AMD and its subsidiaries are equal‑opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.
Benefits offered are described: AMD benefits at a glance.
#J-18808-Ljbffr