Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rework batcher for a reservation system #1183

Merged
merged 6 commits into from
Mar 31, 2025

Conversation

rsuderman
Copy link
Contributor

With decode the majority of future steps are known a groupable. Created a basic reservation system so that steps can be reserved and the continuous batcher knows that a future groupable request is coming.

This should be adapted to include grouping multiple responses, and having a "grouping" exist. E.g. multiple parallel requests in flight and grouped together for the multiple hypothesis case.

With decode the majority of future steps are known a groupable. Created
a basic reservation system so that steps can be reserved and the
continuous batcher knows that a future groupable request is coming.

This should be adapted to include grouping multiple responses, and
having a "grouping" exist. E.g. multiple parallel requests in flight and
grouped together for the multiple hypothesis case.
@rsuderman rsuderman requested a review from stbaione March 28, 2025 03:37
@rsuderman rsuderman requested a review from stbaione March 31, 2025 17:46
@rsuderman rsuderman merged commit 9bb48d8 into nod-ai:main Mar 31, 2025
42 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants