I'm curious that why MobileLLaMA 2.7B is faster then OpenLLaMA reported in v1 paper? #46

XiaohuJoshua · 2024-04-15T09:32:26Z

Thanks for the impressive work.
Have I overlooked any modifications to MobileLLaMA's architecture in the v1 paper that could enhance its inference speed? It appears that LDP is not applied to the LLM component.

huyiming2018 · 2024-07-30T07:02:58Z

You can take a look at Section 3.3 of the MobileVLM paper; in fact, the model size of MobileLLaMa is smaller than that of OpenLLaMa.

XiaohuJoshua · 2024-07-31T03:55:32Z

So, it is because MobileVLM 2.7B has 0.3B fewer parameters than OpenLLaMA 3B? Thanks for your reply.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I'm curious that why MobileLLaMA 2.7B is faster then OpenLLaMA reported in v1 paper? #46

I'm curious that why MobileLLaMA 2.7B is faster then OpenLLaMA reported in v1 paper? #46

XiaohuJoshua commented Apr 15, 2024

huyiming2018 commented Jul 30, 2024

XiaohuJoshua commented Jul 31, 2024

I'm curious that why MobileLLaMA 2.7B is faster then OpenLLaMA reported in v1 paper? #46

I'm curious that why MobileLLaMA 2.7B is faster then OpenLLaMA reported in v1 paper? #46

Comments

XiaohuJoshua commented Apr 15, 2024

huyiming2018 commented Jul 30, 2024

XiaohuJoshua commented Jul 31, 2024