Skip to content

chore: make DistributedRunner use random ports by default#919

Open
Gothos wants to merge 6 commits into
mainfrom
chore/make-distributedrunner-use-random-ports
Open

chore: make DistributedRunner use random ports by default#919
Gothos wants to merge 6 commits into
mainfrom
chore/make-distributedrunner-use-random-ports

Conversation

@Gothos

@Gothos Gothos commented Mar 11, 2026

Copy link
Copy Markdown
Contributor

Makes DistributedRunner use random ports by default, (random ports will be chosen when the master port/ worker ports are set to 0). Suffers from a minor TOCTOU problem, but we retry up to three times, so this should not be a problem in practice. The current default port of 29500 conflicts with other packages' default like pytorch distributed/ accelerate launch commands.

@Gothos Gothos requested a review from alex-remade March 20, 2026 11:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant