Skip to content

Conversation

paragao
Copy link

@paragao paragao commented Jun 5, 2025

No previous issue, new feature.

Added a new AWS Cloudformation template to the Architecture/Common folder which deploys a solution to scale up/down compute nodes on an instance group.

The template deploys an Amazon EventBridge rule that triggers an AWS Lambda lambda function to update the node count on an instance group. The EventBridge rule is based on a cron expression. There is one rule for scaling up and another for scaling down.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

KeitaW
KeitaW previously requested changes Jun 5, 2025
Copy link
Contributor

@KeitaW KeitaW left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Author

@paragao paragao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

moved file and updated README.md as requested.

Copy link
Contributor

@KeitaW KeitaW left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe the previous file still remains in the original directory


## HyperPod cluster status change / node health event notifications

This template deploys a stack to receive human-readable email notifications for HyperPod cluster status changes and node health events. See the [workshop page](https://catalog.workshops.aws/sagemaker-hyperpod/en-US/07-tips-and-tricks/26-event-bridge) for more details.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Kindly remove this file + directory.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@KeitaW these files were already there and are not part of this PR.

If you want, I can open a new PR for moving the files that were originally there too, as those have not been modified. The reason is that other assets, such as workshops, might have links to those files and moving them will break these assets.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ah okay, my bad.

@KeitaW KeitaW self-requested a review June 9, 2025 22:39
@KeitaW KeitaW dismissed their stale review June 9, 2025 22:40

The file location LGTM.

@KeitaW KeitaW requested review from amanshanbhag and nghtm June 9, 2025 22:41
@nghtm
Copy link
Contributor

nghtm commented Aug 22, 2025

Lets close or merge this PR please. Status?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants