-
Notifications
You must be signed in to change notification settings - Fork 4.2k
Issues: microsoft/DeepSpeed
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
DeepSpeed with ZeRO3 strategy cannot build 'fused_adam'
bug
Something isn't working
training
#6892
opened Dec 18, 2024 by
LeonardoZini
How can DeepSpeed be configured to prevent the merging of parameter groups
#6878
opened Dec 16, 2024 by
CLL112
How do I know if stage-3 is a success by using deepspeed?
training
#6877
opened Dec 16, 2024 by
hwhyyds
[BUG] Cannot use --hostfile to start multi-node training in Docker.
bug
Something isn't working
training
#6875
opened Dec 16, 2024 by
Ind1x1
Windows wheel build error - Tried everything with all requirements you have
build
Improvements to the build and testing systems.
windows
#6871
opened Dec 14, 2024 by
FurkanGozukara
[BUG] Invalidate trace cache @ step 10: expected module 11, but got module 19
bug
Something isn't working
training
#6870
opened Dec 14, 2024 by
yafuly
[BUG] Mismatch of model parameters when using Sequence Parallel
bug
Something isn't working
training
#6868
opened Dec 13, 2024 by
chetwin-character
Unable to Install DeepSpeed on Windows using pip
windows
#6865
opened Dec 13, 2024 by
H4CK3R-5M4CK3R
[BUG]When fine-tuning an LLM, the following error occurs after training for some time: self.optimizer.param_groups[param_group_id]['params'] = [] IndexError: list index out of range
bug
Something isn't working
training
#6857
opened Dec 12, 2024 by
tdtgi
[BUG] Unable to Use Something isn't working
compression
quantization_setting
for Customizing MoQ in DeepSpeed Inference
bug
#6853
opened Dec 11, 2024 by
cyx96
[QUESTIONS]:Some questions about running Domino
enhancement
New feature or request
#6851
opened Dec 11, 2024 by
yingtongxiong
Opinion on Refactoring Ulysses
enhancement
New feature or request
#6843
opened Dec 9, 2024 by
Eugene29
[BUG] inference ops unit tests are failing
bug
Something isn't working
inference
#6839
opened Dec 9, 2024 by
oelayan7
[REQUEST] domino integration to nanotron
enhancement
New feature or request
#6835
opened Dec 7, 2024 by
NouamaneTazi
zero-3 cpuadam is so slow
enhancement
New feature or request
#6834
opened Dec 7, 2024 by
SeunghyunSEO
[BUG] offload optmizer states in zero3
bug
Something isn't working
training
#6833
opened Dec 7, 2024 by
Hanqer
[BUG] using deepspeed slower inference time
bug
Something isn't working
inference
#6818
opened Dec 4, 2024 by
williamlin0518
[BUG] DeepSpeed accuracy issue for torch.compile if activation checkpoint function not compiler disabled
bug
Something isn't working
training
#6811
opened Dec 1, 2024 by
NirSonnenschein
Previous Next
ProTip!
Updated in the last three days: updated:>2024-12-15.