Skip to content

Conversation

@BlueCrescent
Copy link
Collaborator

What does this PR do?

Adds support for multi stage pipeline parallelism schedules, in particular interleaved 1F1B.

General Changes

  • TBD

Breaking Changes

  • TBD

Checklist before submitting final PR

  • My PR is minimal and addresses one issue in isolation
  • I have merged the latest version of the target branch into this feature branch
  • I have reviewed my own code w.r.t. correct implementation, missing type hints, proper documentation, etc.
  • I have run a sample config for model training
  • I have checked that all tests run through (python tests/tests.py)
  • I have updated the internal changelog (CHANGELOG_DEV.md)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants