Performance on Apple silicon #5

marella · 2023-05-21T02:46:30Z

@bgonzalezfractal did you notice any performance improvement just by changing the threads parameter?

If you don't have the latest quantized models, you can go back to the previous commit using:

git checkout e707f99
git submodule update

Here you can run the build commands and check:

cmake -S . -B build
cmake --build build

The text was updated successfully, but these errors were encountered:

bgonzalezfractal · 2023-05-21T18:29:56Z

@marella I haven't been able to use the llm again, once we manage to solve the starcoder quantize issue, I can post the M1 Pro 64GB performance.

marella · 2023-05-21T19:08:54Z

Hi, I'm guessing the issue might be related to performance - it is running too slow so taking time to print the output.

By default print(llm(...)) function doesn't show any output until all the text is generated. So you can try the new stream option to see text as soon as a token is generated:

for token in llm('def fibo(', max_new_tokens=5, stream=True):
    print(token, end='', flush=True)

The code used by starcoder example and this library is same. The only difference is in the way it is built. So building the library from source can help validate if building locally improves performance and may prevent the library from being stuck:

git clone --recurse-submodules https://github.com/marella/ctransformers
cd ctransformers
./scripts/build.sh

llm = AutoModelForCausalLM.from_pretrained(..., lib='/path/to/ctransformers/build/lib/libctransformers.dylib')

marella · 2023-05-24T18:31:59Z

See #8

marella mentioned this issue May 21, 2023

Starcoder / Quantized Issues #1

Closed

marella mentioned this issue May 22, 2023

Segmentation fault on m1 mac #8

Closed

marella closed this as completed May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance on Apple silicon #5

Performance on Apple silicon #5

marella commented May 21, 2023 •

edited

Loading

bgonzalezfractal commented May 21, 2023

marella commented May 21, 2023 •

edited

Loading

marella commented May 24, 2023

Performance on Apple silicon #5

Performance on Apple silicon #5

Comments

marella commented May 21, 2023 • edited Loading

bgonzalezfractal commented May 21, 2023

marella commented May 21, 2023 • edited Loading

marella commented May 24, 2023

marella commented May 21, 2023 •

edited

Loading

marella commented May 21, 2023 •

edited

Loading