RLHF: Reinforcement Learning from Human Feedback
LLM: Large Language Model
INT4/8量化
Embedding
knowledge database expert
fine-tuned digital person
- https://medium.com/geekculture/list-of-open-sourced-fine-tuned-large-language-models-llm-8d95a2e0dc76
- https://arxiv.org/pdf/2303.18223.pdf
- https://mp.weixin.qq.com/s/kjzRzoUenP0NYa1A9lS7Aw
- https://mp.weixin.qq.com/s/_9JevS70pRqEmPRbVVM9Vw
- https://zhuanlan.zhihu.com/p/614766286
- https://mp.weixin.qq.com/s/M-ToNk8SABoP2JG0xLUBxQ
- https://github.com/zhengzangw/awesome-huge-models
- https://github.com/eugeneyan/open-llms