How to use fp16 version of the model file? #676

cyio · 2024-04-02T12:10:24Z

Question

example files: https://huggingface.co/Xenova/modnet/tree/main/onnx

xenova · 2024-04-02T12:42:17Z

This requires the experimental v3 branch: #545
where you can specify dtype:

import { pipeline } from '@xenova/transformers';

// Create feature extraction pipeline
const extractor = await pipeline('feature-extraction', 'Xenova/all-MiniLM-L6-v2', {
    // device: 'webgpu', (Optional)
    dtype: 'fp32', // or 'fp16'
});

cyio · 2024-04-03T02:56:52Z

step:

npm install xenova/transformers.js#v3
for vite project, resolve Top await error with vite build error -- Transform failed with 3 errors - Top-level await is not available in the configured target environment remix-run/remix#7969 (comment)

cyio added the question Further information is requested label Apr 2, 2024

cyio closed this as completed Apr 3, 2024

DavidGOrtega mentioned this issue Sep 21, 2024

Pipeline tries to download all the possible weights even when the dtype is specified: utils/hub.js could not locate file #941

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use fp16 version of the model file? #676

How to use fp16 version of the model file? #676

cyio commented Apr 2, 2024

xenova commented Apr 2, 2024 •

edited

Loading

cyio commented Apr 3, 2024

How to use fp16 version of the model file? #676

How to use fp16 version of the model file? #676

Comments

cyio commented Apr 2, 2024

Question

xenova commented Apr 2, 2024 • edited Loading

cyio commented Apr 3, 2024

xenova commented Apr 2, 2024 •

edited

Loading