Skip to content

Commit

Permalink
Merge branch 'ArmDeveloperEcosystem:main' into demo-functionality
Browse files Browse the repository at this point in the history
  • Loading branch information
zachlasiuk authored Sep 13, 2024
2 parents d7e18eb + b57ce1b commit 66dda2a
Show file tree
Hide file tree
Showing 3 changed files with 87 additions and 67 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -7,12 +7,12 @@ who_is_this_for: This is an introductory topic for developers interested in runn

learning_objectives:
- Download and build llama.cpp on your Arm server.
- Download a pre-quantized Llama 2 model from Hugging Face.
- Download a pre-quantized Llama 3.1 model from Hugging Face.
- Re-quantize the model weights to take advantage of Arm improvements.
- Compare the pre-quantized Llama 2 model weights performance to the re-quantized weights on your Arm CPU.
- Compare the pre-quantized Llama 3.1 model weights performance to the re-quantized weights on your Arm CPU.

prerequisites:
- An AWS Graviton3 c7g.2xlarge instance to test Arm performance optimizations, or any [Arm based instance](/learning-paths/servers-and-cloud-computing/csp/) from a cloud service provider or an on-premise Arm server.
- An AWS Graviton3 c7g.16xlarge instance to test Arm performance optimizations, or any [Arm based instance](/learning-paths/servers-and-cloud-computing/csp/) from a cloud service provider or an on-premise Arm server.

author_primary: Pareena Verma, Jason Andrews, and Zach Lasiuk

Expand Down
Loading

0 comments on commit 66dda2a

Please sign in to comment.