Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[shardformer] merge shardformer to main #4152

Merged
merged 49 commits into from
Jul 4, 2023
Merged

Commits on Jun 26, 2023

  1. [shardformer] init shardformer code structure (#3731)

    * init shardformer code structure
    
    * add implement of sharder (inject and replace)
    
    * add implement of replace layer to colossal layer
    
    * separate different layer policy, add some notion
    
    * implement 1d and 2d slicer, can tell col or row
    
    * fix bug when slicing and inject model
    
    * fix some bug; add inference test example
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    604a213 View commit details
    Browse the repository at this point in the history
  2. [shardformer]: Feature/shardformer, add some docstring and readme (#3816

    )
    
    * init shardformer code structure
    
    * add implement of sharder (inject and replace)
    
    * add implement of replace layer to colossal layer
    
    * separate different layer policy, add some notion
    
    * implement 1d and 2d slicer, can tell col or row
    
    * fix bug when slicing and inject model
    
    * fix some bug; add inference test example
    
    * add share weight and train example
    
    * add train
    
    * add docstring and readme
    
    * add docstring for other files
    
    * pre-commit
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    ffacf0f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    69d3daa View commit details
    Browse the repository at this point in the history
  4. [shardformer] refactored the user api (#3828)

    * [shardformer] refactored the user api
    
    * polish code
    FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    0470f1b View commit details
    Browse the repository at this point in the history
  5. [shardformer] update readme with modules implement doc (#3834)

    * update readme with modules content
    
    * remove img
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    051e970 View commit details
    Browse the repository at this point in the history
  6. [shardformer] add Dropout layer support different dropout pattern (#3856

    )
    
    * add dropout layer, add dropout test
    
    * modify seed manager as context manager
    
    * add a copy of col_nn.layer
    
    * add dist_crossentropy loss; separate module test
    
    * polish the code
    
    * fix dist crossentropy loss
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    3e840f7 View commit details
    Browse the repository at this point in the history
  7. update README (#3909)

    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    bf9c2fd View commit details
    Browse the repository at this point in the history
  8. [shardformer] add gpt2 policy and modify shard and slicer to support (#…

    …3883)
    
    * add gpt2 policy and modify shard and slicer to support
    
    * remove unused code
    
    * polish code
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    551fec3 View commit details
    Browse the repository at this point in the history
  9. [shardformer] Align bert value (#3907)

    * add bert align test, fix dist loss bug
    
    * forward and backward align
    
    * add ignore index
    
    * add shardformer CI
    
    * add gather_output optional for user in shardconfig
    
    * update readme with optional gather_ouput
    
    * add dist crossentropy loss test, remove unused files
    
    * remove unused file
    
    * remove unused file
    
    * rename the file
    
    * polish code
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    e5bc7e3 View commit details
    Browse the repository at this point in the history
  10. [shardformer] Unit test (#3928)

    * fix bug in slicer, add slicer unit test
    
    * add dropout test
    
    * use pid as dropout seed
    
    * updata dropout test with local pattern
    
    * ad todo
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    661dc3b View commit details
    Browse the repository at this point in the history
  11. [shardformer] Add dropout layer in shard model and refactor policy api (

    #3949)
    
    * add dist dropout in model
    
    * update docstring and bert policy with dropout
    
    * refactor basepolicy and sharded, update bert
    
    * update format
    
    * update gpt2 policy
    
    * update bert policy
    
    * remove unused code
    
    * update readme for new policy usage
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    702513a View commit details
    Browse the repository at this point in the history
  12. [shardformer] support llama model using shardformer (#3969)

    adjust layer attr
    wukong1992 authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    17d1607 View commit details
    Browse the repository at this point in the history
  13. [shardformer] shardformer support t5 model (#3994)

    test t5
    wukong1992 authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    e849d1b View commit details
    Browse the repository at this point in the history
  14. [Shardformer] Downstream bert (#3979)

    * add dist dropout in model
    
    * update docstring and bert policy with dropout
    
    * refactor basepolicy and sharded, update bert
    
    * update format
    
    * update gpt2 policy
    
    * update bert policy
    
    * remove unused code
    
    * update readme for new policy usage
    
    * add downstream model of bert
    
    * remove unused code
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    735e44b View commit details
    Browse the repository at this point in the history
  15. [shardformer] fix an error in readme (#3988)

    * fix an error in readme
    
    * simplify code
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    73cacb7 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    45a3110 View commit details
    Browse the repository at this point in the history
  17. [shardformer] Refactor shardformer api (#4001)

    * fix an error in readme
    
    * simplify code
    
    * refactor shardformer
    
    * add todo
    
    * remove slicer
    
    * resolve code review
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    18396e7 View commit details
    Browse the repository at this point in the history
  18. [shardformer] integrated linear 1D with dtensor (#3996)

    * [shardformer] integrated linear 1D with dtensor
    
    * polish code
    FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    579b617 View commit details
    Browse the repository at this point in the history
  19. integrate with dist layer (#4011)

    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    bdc405e View commit details
    Browse the repository at this point in the history
  20. [shardformer] refactored embedding and dropout to parallel module (#4013

    )
    
    * [shardformer] refactored embedding and dropout to parallel module
    
    * polish code
    FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    2c366e3 View commit details
    Browse the repository at this point in the history
  21. Configuration menu
    Copy the full SHA
    eaa46d7 View commit details
    Browse the repository at this point in the history
  22. add vocabembedding layer

    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    60eb380 View commit details
    Browse the repository at this point in the history
  23. support bert with new api

    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    90e1a0a View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    38ceded View commit details
    Browse the repository at this point in the history
  25. [shardformer] fix bert and gpt downstream with new api (#4024)

    * fix bert downstream with new api
    
    * remove comment line
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    c982769 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    b2c5dd0 View commit details
    Browse the repository at this point in the history
  27. Configuration menu
    Copy the full SHA
    8219d96 View commit details
    Browse the repository at this point in the history
  28. [shardformer] add gpt2 test and layer class refactor (#4041)

    * add gpt2 test and layer class refactor
    
    * add dropout in gpt2 policy
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    0113097 View commit details
    Browse the repository at this point in the history
  29. [shardformer] adapted T5 and LLaMa test to use kit (#4049)

    * [shardformer] adapted T5 and LLaMa test to use kit
    
    * polish code
    FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    ac3aef3 View commit details
    Browse the repository at this point in the history
  30. Configuration menu
    Copy the full SHA
    e5d4a87 View commit details
    Browse the repository at this point in the history
  31. support kit use for bert/gpt test (#4055)

    * support kit use for bert test
    
    * support kit test for gpt2
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    d5d9178 View commit details
    Browse the repository at this point in the history
  32. [shardformer] support module saving and loading (#4062)

    * [shardformer] support module saving and loading
    
    * polish code
    FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    9436f73 View commit details
    Browse the repository at this point in the history
  33. [shardformer] add linearconv1d test (#4067)

    * add linearconv1d test
    
    * add linearconv1d test
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    8108c35 View commit details
    Browse the repository at this point in the history
  34. Configuration menu
    Copy the full SHA
    a484c71 View commit details
    Browse the repository at this point in the history
  35. [shardformer] Add layernorm (#4072)

    * add layernorm to bert
    
    * add layernorm test
    
    * add layernorm test with load state dict
    
    * add use_mixedfusedLN in shard config
    
    * refactor policy to support fused_layernorm
    FoolPlayer authored and FrankLeeeee committed Jun 26, 2023
    Configuration menu
    Copy the full SHA
    12801e8 View commit details
    Browse the repository at this point in the history
  36. [test] fixed tests failed due to dtensor change (#4082)

    * [test] fixed tests failed due to dtensor change
    
    * polish code
    FrankLeeeee authored Jun 26, 2023
    Configuration menu
    Copy the full SHA
    d88844c View commit details
    Browse the repository at this point in the history
  37. Configuration menu
    Copy the full SHA
    4e0db99 View commit details
    Browse the repository at this point in the history

Commits on Jun 27, 2023

  1. [shardformer] shardformer support opt models (#4091)

    * [shardformer] shardformer support opt models
    
    * [shardformer] shardformer support opt models, fix
    
    * [shardformer] shardformer support opt models, fix
    
    * [shardformer] shardformer support opt models, fix
    flybird11111 authored Jun 27, 2023
    Configuration menu
    Copy the full SHA
    a7433a0 View commit details
    Browse the repository at this point in the history

Commits on Jun 28, 2023

  1. [shardformer] support vision transformer (#4096)

    * first v of vit shardformer
    
    * keep vit
    
    * update
    
    * vit shard add vitattention vitlayer
    
    * update num head shard para
    
    * finish test for vit
    
    * add new_model_class & postprocess
    
    * add vit readme
    
    * delete old files & fix the conflict
    
    * fix sth
    klhhhhh authored Jun 28, 2023
    Configuration menu
    Copy the full SHA
    ad604f7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8b0930c View commit details
    Browse the repository at this point in the history

Commits on Jun 30, 2023

  1. Configuration menu
    Copy the full SHA
    92e669e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8d3f077 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    60d2cad View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    26ecfd7 View commit details
    Browse the repository at this point in the history
  5. [shardformer] write an shardformer example with bert finetuning (#4126)

    * [shardformer] add benchmark of shardformer
    
    * [shardformer] add benchmark of shardformer
    flybird11111 authored Jun 30, 2023
    Configuration menu
    Copy the full SHA
    b6f4e05 View commit details
    Browse the repository at this point in the history

Commits on Jul 3, 2023

  1. [shardformer] refactored some doc and api (#4137)

    * [shardformer] refactored some doc and api
    
    * polish code
    FrankLeeeee authored Jul 3, 2023
    Configuration menu
    Copy the full SHA
    1b4a901 View commit details
    Browse the repository at this point in the history

Commits on Jul 4, 2023

  1. [shardformer] made tensor parallelism configurable (#4144)

    * [shardformer] made tensor parallelism configurable
    
    * polish code
    FrankLeeeee authored Jul 4, 2023
    Configuration menu
    Copy the full SHA
    f8dcf9d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d1db043 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    dd9fe39 View commit details
    Browse the repository at this point in the history