-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tentative PR #36
base: dev
Are you sure you want to change the base?
Tentative PR #36
Conversation
…VIDIA's GPUs work?
also added a const kernel_info_t& to the arg list of shader_core_ctx::find_available_hwtid. This allows trace calls within this function to be able to find the SM id.
…-arounds due to subcore scheduling Plus a utility function get_index_vector_from_range_with_wrap_around used to generate the vector of indices.
…ped range of thread ids, thanks to the glamorous WrappableUnsignedRange
…ake subcore round robin scheduling work, but is no longer relevant
This PR now partially models the effect of sub-core scheduling by issuing new warps starting from the last sub-core that was issued a warp. The effect of reg file segregation is not yet modelled. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks like a useful change that kind of got forgotten.
@FJShen - what do we need to do to get this integrated?
This PR is still pending code review. |
@FJShen a few things:
|
Supports round-robin dispatch of CTA warps to each sub-core. Its twin-PR to accel-sim depends on this PR.