GPU Software Development Engineer Job at Advanced Micro Devices, Oregon State

MEw5ZVd3VkZ0TXhJeUZCVU9lZHIzMXFL
  • Advanced Micro Devices
  • Oregon State

Job Description

GPU Software Development Engineer Location Hybrid work in Oregon :

GPU Software Development Engineer
  • Oregon, United States
  • Engineering
  • 65684
  • USD $200,000.00/Yr.
  • USD $300,000.00/Yr.
WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance_ THE ROLE: We are seeking a talented Machine Learning Kernel Developer to design, develop, and optimize low-level machine learning kernels for AMD GPUs using the ROCm software stack. In this role, you will work on high-impact projects to accelerate AI frameworks and libraries, with a focus on emerging technologies like Large Language Models (LLMs) and other generative AI workloads. THE PERSON: The ideal candidate will have hands-on experience with GPU programming (ROCm or CUDA) and a passion for pushing the boundaries of AI performance. KEY RESPONSIBILITIES:
  • Design and implement highly optimized ML kernels (e.g., matrix operations, attention mechanisms) for AMD GPUs using ROCm.
  • Profile, debug, and tune kernel performance to maximize hardware utilization for AI workloads.
  • Collaborate with ML researchers and framework developers to integrate kernels into AI frameworks (e.g., PyTorch, TensorFlow) and inference engines (e.g., vLLM).
  • Contribute to the ROCm software stack by identifying and resolving bottlenecks in libraries like MIOpen, HIP, or Composable Kernel.
  • Stay updated on the latest AI/ML trends (LLMs, quantization, distributed inference) and apply them to kernel development.
  • Document and communicate technical designs, benchmarks, and best practices.
  • Troubleshoot and resolve issues related to GPU compatibility, performance, and scalability.
REQUIRED EXPERIENCE:
  • 2+ years of experience in GPU kernel development for machine learning (ROCm or CUDA).
  • Proficiency in C/C++ and Python, with experience in performance-critical programming.
  • Strong understanding of ML frameworks (PyTorch, TensorFlow) and GPU-accelerated libraries.
  • Basic knowledge of modern AI technologies (LLMs, transformers, inference optimization).
  • Familiarity with parallel computing, memory optimization, and hardware architectures.
  • Problem-solving skills and ability to work in a fast-paced environment.
PREFERRED EXPERIENCE:
  • Direct experience with AMD ROCm development (HIP, MIOpen, Composable Kernel).
  • Knowledge of LLM-specific optimizations (e.g., FlashAttention, PagedAttention in vLLM).
  • Experience with distributed training/inference or model compression techniques.
  • Contributions to open-source ML projects or GPU compute libraries.
ACADEMIC CREDENTIALS:
  • Bachelor's/Master's in Computer Science, Electrical Engineering, or related field.
LOCATION: Boxborough, Massachusetts #LI-DR1 #LI-HYBRID AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

Job Tags

Full time,

Similar Jobs

The UPS Store Franklin #0738

Retail Sales Associate- 021524 Job at The UPS Store Franklin #0738

Retail Sales Associate PT / FT with flexible availability (Some Saturday & Sundays required) If you have worked at The UPS Store before, definitely call us for this opportunity! The UPS Store of Franklin, Ma seeks associates with outstanding customer service skills... 

Community College of Aurora

Digital Content Developer and Web Manager Job at Community College of Aurora

Digital Content Developer and Web Manager Location Aurora, CO (Centretech area) : Digital Content Developer and Web Manager Community College...  ...manage timelines and deliverables. Strong understanding of SEO best practices, user experience principles, and web... 

Red Hat

Forward Deployed Engineer, AI Inference (vLLM and Kubernetes) Job at Red Hat

 ...complex interaction effects between specific model architectures (e.g., MoE, large context windows), hardware accelerators (NVIDIA GPUs, AMD GPUs, TPUs), and Kubernetes networking (Envoy/ISTIO). Feedback Loop : Act as the "Customer Zero" for our core engineering teams.... 

Simplot

HBF Forklift Operator - Moses Lake Production Campus Job at Simplot

 ...mining, fertilizer manufacturing, farming, ranching and cattle production, and other enterprises related to agriculture. Summary...  ...Operator safely maintains an orderly and efficient cold storage warehouse. There are two plant facilities that make up the Moses Lake... 

WelcometoSSM

Digital Sports Strategy & Innovation Director Job at WelcometoSSM

 ...passionate to be accountable for the development and execution of the sports product strategy and strategic initiatives, in partnership with...  ...leading qualitative and quantitative analysis of internal, research, and competitive data to assess strategic opportunities. Based...