Platform Power Management Architect – AMD Instinct™ GPUs

About the position

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. The Role AMD is seeking a Platform Power Management Architect to define and drive end‑to‑end power architecture for AMD Instinct™ data center GPU platforms. This role is responsible for system‑level power strategy, spanning silicon capabilities, board‑level power delivery, firmware, Linux power management, and rack‑scale deployment considerations. The architect will work cross‑functionally with silicon, firmware, platform hardware, Linux kernel/ROCm software, and data center system teams to optimize performance per watt, ensure power integrity and reliability, and deliver accurate power projections for current and next‑generation Instinct platforms. Exposure to scale‑up or scale‑out networking fabrics is highly desirable.

Responsibilities

  • Define the end‑to‑end power management architecture for AMD Instinct data center GPUs, spanning silicon, package, board, system, and rack levels.
  • Own platform‑level power concepts including power states, power limits, throttling policies, telemetry, and power‑performance trade‑offs.
  • Act as the technical authority for power‑related architectural decisions across multiple Instinct programs.
  • Lead power rail architecture and optimization, including rail partitioning, sequencing, voltage/frequency domains, and efficiency trade‑offs.
  • Partner with hardware and silicon teams to optimize VR efficiency, transient response, and steady‑state power delivery under AI/HPC workloads.
  • Influence silicon and platform features to improve power scalability and robustness across SKUs and deployment configurations.
  • Define requirements and architecture for Linux‑based power management, including interactions with kernel frameworks, drivers, firmware, and ROCm components.
  • Collaborate with software teams on power telemetry, control interfaces, policy enforcement, and observability.
  • Ensure alignment between platform power capabilities and software‑visible controls for data center operators.
  • Develop and own power projection methodologies for GPUs, platforms, and multi‑GPU systems across representative workloads.
  • Provide power projections and sensitivity analyses to support product planning, system design, customer engagements, and thermal/rack planning.
  • Validate projections against lab data and silicon characterization results, closing gaps between model and reality.
  • Incorporate scale‑up (e.g., high‑bandwidth GPU interconnects) and scale‑out (e.g., networking fabrics) considerations into platform power strategy.
  • Understand and influence the power impact of interconnects, NICs, switches, and fabric topologies in large GPU clusters.
  • Partner with fabric and system architects to ensure coherent power budgeting at node and rack scale.
  • Drive alignment across silicon, firmware, hardware, Linux, ROCm, platform, and data center solution teams.
  • Produce clear architectural documentation, power models, and executive‑level summaries.
  • Represent platform power architecture in technical reviews with senior leadership and external partners.

Requirements

  • Expert-level background in platform, system, or silicon architecture with significant focus on power management.
  • Strong understanding of power delivery networks (PDN), voltage regulation, rail optimization, and power integrity fundamentals.
  • Hands‑on experience with Linux power management concepts, kernel/driver interactions, or system‑level power control.
  • Experience building or consuming power models and projections for complex systems.
  • Ability to work across hardware and software boundaries and influence architectural decisions.
  • Bachelor’s degree in Electrical Engineering, Computer Engineering, Computer Science, or related field (Master’s or PhD preferred).
  • Solid understanding of RTPM, ACPI and Suspend to Idle / S0ix flows.

Nice-to-haves

  • Experience with data center GPUs, accelerators, or high‑performance SoCs.
  • Exposure to scale‑up GPU fabrics and/or scale‑out data center networking.
  • Familiarity with telemetry, power capping, workload‑aware power management, or fleet‑level power optimization.
  • Experience presenting architectural trade‑offs to senior technical leadership.
  • Background in HPC or AI training/inference systems.

Benefits

  • AMD benefits at a glance.
Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...