Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve FP6-LLM 2+4bit weight splitting + user API #279

Merged
merged 24 commits into from
May 26, 2024

Commits on May 22, 2024

  1. add annotation

    gau-nernst committed May 22, 2024
    Configuration menu
    Copy the full SHA
    36add71 View commit details
    Browse the repository at this point in the history

Commits on May 25, 2024

  1. Configuration menu
    Copy the full SHA
    18824a7 View commit details
    Browse the repository at this point in the history
  2. add weight splitting logic

    gau-nernst committed May 25, 2024
    Configuration menu
    Copy the full SHA
    58bcf2f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    dd84cc2 View commit details
    Browse the repository at this point in the history
  4. update from fp6_quant

    gau-nernst committed May 25, 2024
    Configuration menu
    Copy the full SHA
    5e44f1c View commit details
    Browse the repository at this point in the history

Commits on May 26, 2024

  1. merge to_tc_float6_e3m2

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    bccc4f6 View commit details
    Browse the repository at this point in the history
  2. add more optimized version

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    cfa304c View commit details
    Browse the repository at this point in the history
  3. add some notes

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    ed9cb1d View commit details
    Browse the repository at this point in the history
  4. add from_tc_float6_e3m2

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    f609b6f View commit details
    Browse the repository at this point in the history
  5. add some docs

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    dec40af View commit details
    Browse the repository at this point in the history
  6. make fp6_llm.py

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    5e5dfdc View commit details
    Browse the repository at this point in the history
  7. add test for linear

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    5bdcd50 View commit details
    Browse the repository at this point in the history
  8. fix fp6 llm

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    708d485 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    ce0ffc1 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    59e39ce View commit details
    Browse the repository at this point in the history
  11. simplify further

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    cb43a7b View commit details
    Browse the repository at this point in the history
  12. fix typing for old python

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    b90938a View commit details
    Browse the repository at this point in the history
  13. add test

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    66cbe1d View commit details
    Browse the repository at this point in the history
  14. Configuration menu
    Copy the full SHA
    5ed6767 View commit details
    Browse the repository at this point in the history
  15. skip fp6_llm on cpu

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    fa08a3d View commit details
    Browse the repository at this point in the history
  16. improve error message

    gau-nernst committed May 26, 2024
    Configuration menu
    Copy the full SHA
    6945498 View commit details
    Browse the repository at this point in the history
  17. Configuration menu
    Copy the full SHA
    70b5a4c View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    d6c6b6a View commit details
    Browse the repository at this point in the history
  19. Configuration menu
    Copy the full SHA
    d798eaf View commit details
    Browse the repository at this point in the history