Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add int4 gptq and eval #116

Merged
merged 6 commits into from
Apr 3, 2024
Merged

add int4 gptq and eval #116

merged 6 commits into from
Apr 3, 2024

Conversation

HDCharles
Copy link
Contributor

Summary: adding int4 gptq and eval support. Also fixed a few bugs relating to quantizing the activation both during gptq calculation and when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: d29b6d73c90dec5171e12938afee25e5f42e042d
Pull Request resolved: #115

Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: d29b6d73c90dec5171e12938afee25e5f42e042d
Pull Request resolved: #115
@HDCharles HDCharles requested a review from jerryzh168 April 3, 2024 06:56
@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 3, 2024
@HDCharles HDCharles requested a review from cpuhrsch April 3, 2024 06:56
Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:
Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
@HDCharles HDCharles merged commit b0a333c into main Apr 3, 2024
7 checks passed
dbyoung18 pushed a commit to dbyoung18/ao that referenced this pull request Jul 31, 2024
* add int4 gptq and eval

Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

ghstack-source-id: d29b6d73c90dec5171e12938afee25e5f42e042d
Pull Request resolved: pytorch#115

* add int4 gptq and eval

Summary: adding int4 gptq and eval support. Also fixed a few bugs
relating to quantizing the activation both during gptq calculation and
when calculating the output.

Test Plan: python test/quantization/test_quant_api.py

Reviewers:

Subscribers:

Tasks:

Tags:

* remove debug from GPTQ

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024
Update README.md

Update README.md (pytorch#118)

Update README.md

Update README.md (pytorch#121)

Update REAME based on pytorch#107
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants