Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RLlib] Split AddStates... connectors into 2 pieces (AddTimeDimToBatchAndZeroPad and AddStatesFromEpisodesToBatch) #49835

Merged
merged 9 commits into from
Jan 20, 2025

Conversation

sven1977
Copy link
Contributor

@sven1977 sven1977 commented Jan 14, 2025

Split AddStates... connectors into 2 pieces (AddTimeDimToBatchAndZeroPad and AddStatesFromEpisodesToBatch)

  • These might be considered separate operations and should therefore be processed and timed independently.

Why are these changes needed?

Related issue number

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: sven1977 <svenmika1977@gmail.com>
Signed-off-by: sven1977 <svenmika1977@gmail.com>
@sven1977 sven1977 added rllib RLlib related issues rllib-connectorv2 Connector related issues rllib-newstack labels Jan 14, 2025
Copy link
Collaborator

@simonsays1980 simonsays1980 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Its a good choice to separate concerns here.

@@ -2977,6 +2977,16 @@ def log_result(self, result: ResultDict) -> None:

@override(Trainable)
def cleanup(self) -> None:
# Stop all Learners.
if hasattr(self, "learner_group") and self.learner_group is not None:
self.learner_group.shutdown()
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This does sometimes not work. At least in my workspace running on GPU a algo.stop() did not pull down the BackendExecutor workers.

@sven1977 sven1977 enabled auto-merge (squash) January 16, 2025 13:17
@github-actions github-actions bot added the go add ONLY when ready to merge, run all tests label Jan 16, 2025
Signed-off-by: sven1977 <svenmika1977@gmail.com>
@github-actions github-actions bot disabled auto-merge January 17, 2025 11:37
@sven1977 sven1977 enabled auto-merge (squash) January 19, 2025 20:11
@github-actions github-actions bot disabled auto-merge January 19, 2025 20:11
Signed-off-by: sven1977 <svenmika1977@gmail.com>
@sven1977 sven1977 enabled auto-merge (squash) January 20, 2025 10:29
@sven1977 sven1977 merged commit 663e325 into ray-project:master Jan 20, 2025
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
go add ONLY when ready to merge, run all tests rllib RLlib related issues rllib-connectorv2 Connector related issues rllib-newstack
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants