The MeshTransformer does not generate coherent results

I have trained the MeshTransformer on 200 different meshes from the chair category on ShapeNet after decimation and filtering meshes with less than 400 vertices and faces. The MeshTransformer reached a loss very close to 0
![image](https://github.com/lucidrains/meshgpt-pytorch/assets/37635308/c0aab094-6d6a-4603-81f9-5ee01693db63)
But when I call the `generate` method from the MeshTransformer, I get very bad results.
From left to right, ground truth, autoencoder output, MeshTransformer generated mesh with a temperature of 0, with a temperature of 0.1, a temperature of 0.7 and a temperature of 1. This is done with meshgpt-pytorch version 0.3.3
![image](https://github.com/lucidrains/meshgpt-pytorch/assets/37635308/aa725041-f39d-48d5-9231-f3fe70b0157b)
Note: the MeshTransformer was not conditioned on text or anything, so the output is not supposed to exactly look like the sofa, but it barely look like a chair. We can guess the backrest and the legs but that's it.

Initially I thought that there might have been an error with the KV cache so here are the results with `cache_kv=False`:
![image](https://github.com/lucidrains/meshgpt-pytorch/assets/37635308/9847fa6e-10c8-4e76-98f6-08eb9bbcd5e0)

And this one with meshgpt-pytorch version 0.2.11
![image](https://github.com/lucidrains/meshgpt-pytorch/assets/37635308/fd5ce8b8-4f45-4d60-a065-0ddd7fa036e8)

When I trained on a single chair with a version before 0.2.11, the `generate` method was able to create a coherent chair (from left to right, ground truth, autoencoder output, `meshtranformer.generate()`)

![comparisons](https://github.com/lucidrains/meshgpt-pytorch/assets/37635308/57c04acf-b4fc-4e3b-8552-611ae4390dcd)

Why even though the transformer loss was very low the generated results are very bad?

I have uploaded the autoencoder and meshtransformer checkpoint (on version 0.3.3) as well as 10 data samples there: https://file.io/nNsfTyHX4aFB

Also quick question, why rewrite the transformer from scratch, and not use the [HuggingFace GPT2 transformer](https://huggingface.co/docs/transformers/model_doc/gpt2#transformers.GPT2Model)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

The MeshTransformer does not generate coherent results #18

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

The MeshTransformer does not generate coherent results #18

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions