Fix conversion between F64 and F32 #726

stduhpf · 2025-07-08T19:49:34Z

Quantization/dequantization between F64 and F32 wasn't being handled properly, this was causing some issues.

F64 support in most GGML backends isn't really complete right now, so models using F64 weights can still make the program crash during inference. At least it's possible to convert F64 models to F32 with these changes.

leejet · 2025-07-23T17:04:29Z

Thank you for your contribution. But I think it would be better to convert f64 to f32 and i64 to i32 directly when loading from file. You can check out my latest commit.

improve f64 support (for convert mostly)

5a508b0

stduhpf mentioned this pull request Jul 8, 2025

Fix loading diffusers model (+support F64/I64 types) #681

Merged

stduhpf marked this pull request as draft July 8, 2025 20:01

f64<->quant

0aa6ca7

stduhpf marked this pull request as ready for review July 8, 2025 21:56

leejet closed this Sep 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix conversion between F64 and F32 #726

Fix conversion between F64 and F32 #726

Uh oh!

stduhpf commented Jul 8, 2025

Uh oh!

leejet commented Jul 23, 2025

Uh oh!

Uh oh!

Fix conversion between F64 and F32 #726

Fix conversion between F64 and F32 #726

Uh oh!

Conversation

stduhpf commented Jul 8, 2025

Uh oh!

leejet commented Jul 23, 2025

Uh oh!

Uh oh!