A merge of a bunch of different changes for pytorch. by AWoloszyn · Pull Request #14 · ROCm/hrx-system

AWoloszyn · 2026-05-28T18:02:11Z

This is a roll-up of a significant number of changes that are required for getting pytorch running on top of hrx.

stellaraccident

I don't think we need to carry the compiler support forward (likely you just got this from the merge but it was deleted in this branch).

As discussed, since you are flushing your perma-branches, I am ok with this going in as-is. But this needs unit and/or CTS tests, especially for things like the ELF/CCOB manipulation. Please file an issue so we don't lose track and can get this test coverage in. Not blocking this one so we can get out of branch debt.

stellaraccident · 2026-05-29T00:00:11Z

@@ -0,0 +1,69 @@
+// Copyright 2026 The HRX Authors


I think we dropped this entirely (and the cc file).

This is a roll-up of a significant number of changes that are required for getting pytorch running on top of hrx.

zjgarvey

I tried to read the bits which seemed important for me to understand better: e.g., kernel launching, and command buffer/stream interplay handling.

I have a few other questions, but could probably wait.

zjgarvey · 2026-06-01T16:16:09Z

+
+  while (remaining > 0) {
+    size_t this_chunk = remaining < chunk_size ? remaining : chunk_size;
+    iree_status_t iree_status = iree_hal_device_transfer_h2d(


Isn't this API fully synchronous? Why is this section labelled Async Stream Transfers? I guess the same below for d2h. I wanted to understand how async transfers worked between host and device and I'm kind of confused here. Are the async transfers just not implemented yet?

Maybe this is a terminology question more than anything: does "host" literally mean "host thread" so it must inherently block on the host thread? Or can some separate thread on the CPU do the transfer and signal to the host thread when it's done? Not sure how important this is unless we are paging data for a training job?

So I will answer both of the questions in ordedr

Are the async transfers just not implemented yet? << Yes this is the case. This was originally used by the streaming layer, but then we ended up writing it ourselves in the streaming layer. (There is more work to be done to move more of binding/common to use the libhrx functions instead of iree functions directly.

"host" here means "memory residing in system RAM not GPU ram"

zjgarvey · 2026-06-01T16:40:57Z

+      stream->submitted_value = signal_value;
+    }
+  } else {
+    status = iree_hal_command_buffer_dispatch(


This writes the kernel launch into the stream->command_buffer, correct?

I'm trying to understand what the lock stream->mutex is being used for (It's only used in stream_begin and stream_flush), and since this isn't under lock, I'm wondering if it possible that this could be interleaved with a stream_flush from a different thread to result in a UAF/nullptr deref or something?

So it is legal to send commands to a hip stream from multiple threads at once. So, this is SOME work in the API to handle this, but it is incomplete.
I have filed #26 to add tests and fix the implementation here.

stellaraccident approved these changes May 29, 2026

View reviewed changes

AWoloszyn mentioned this pull request Jun 1, 2026

Add tests for elf/ccob loading. #23

Open

A merge of a bunch of different changes for pytorch.

db08265

This is a roll-up of a significant number of changes that are required for getting pytorch running on top of hrx.

AWoloszyn force-pushed the users/awoloszyn/rollup-for-pytorch branch from 03ca5f8 to 6ff95b5 Compare June 1, 2026 13:11

AWoloszyn added 2 commits June 1, 2026 06:11

Remove additional mapping field that showed up in rebase.

6ff95b5

Removed hrx_compiler.

0b0f82a

AWoloszyn marked this pull request as ready for review June 1, 2026 14:53

zjgarvey reviewed Jun 1, 2026

View reviewed changes

Removed the src/streaming since it has moved into src/binding/common

ac343aa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A merge of a bunch of different changes for pytorch.#14

A merge of a bunch of different changes for pytorch.#14
AWoloszyn wants to merge 4 commits into
mainfrom
users/awoloszyn/rollup-for-pytorch

AWoloszyn commented May 28, 2026 •

edited by stellaraccident

Loading

Uh oh!

stellaraccident left a comment

Uh oh!

stellaraccident May 29, 2026

Uh oh!

AWoloszyn Jun 1, 2026

Uh oh!

zjgarvey left a comment

Uh oh!

Uh oh!

Uh oh!

zjgarvey Jun 1, 2026

Uh oh!

zjgarvey Jun 1, 2026

Uh oh!

AWoloszyn Jun 1, 2026

Uh oh!

zjgarvey Jun 1, 2026

Uh oh!

AWoloszyn Jun 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AWoloszyn commented May 28, 2026 • edited by stellaraccident Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stellaraccident left a comment

Choose a reason for hiding this comment

Uh oh!

stellaraccident May 29, 2026

Choose a reason for hiding this comment

Uh oh!

AWoloszyn Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

zjgarvey left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zjgarvey Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

zjgarvey Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

AWoloszyn Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

zjgarvey Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

AWoloszyn Jun 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AWoloszyn commented May 28, 2026 •

edited by stellaraccident

Loading

AWoloszyn Jun 1, 2026 •

edited

Loading