Adding _set_gradient_checkpointing for compatibility

#22

by vriveras - opened Sep 19, 2023

base: refs/heads/main

←

from: refs/pr/22

Discussion Files changed

-0

Adding _set_gradient_checkpointing for compatibilitya30a9312

vriveras

Sep 19, 2023

Adding _set_gradient_checkpointing for compatibility when finetuning the model.

vriveras changed pull request title from Update modeling_mixformer_sequential.py to Adding _set_gradient_checkpointing for compatibility Sep 19, 2023

ziniuli

Sep 21, 2023

Hi,

I find that the updated code for gradient checkpointing does not work in my case, i.e., the memory usage is not reduced.

I wonder whether this code is tested in practice.

Best regards,
Ziniu

teknium

Sep 22, 2023

Please fix would be very nice

hiyouga

Sep 27, 2023

require for fix @gugarosa

gugarosa

Microsoft org Oct 3, 2023

Could you please re-update your file with the latest commit? As soon as the merge conflict is solved, I will merge this PR.

vriveras

Oct 3, 2023

I have rebased the PR.

vriveras

Oct 17, 2023

@gugarosa will you be able to merge this?

gugarosa changed pull request status to merged Oct 17, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment