- 主页 > 生活百科 > >
ChatGPT/InstructGPT详解( 六 )
^Radford, A., Wu, J., Child, R., Luan, D., Amodei, D. and Sutskever, I., 2019. Language models are unsupervised multitask learners. *OpenAI blog*, *1*(8), p.9. https://life-extension.github.io/2020/05/27/GPT%E6%8A%80%E6%9C%AF%E5%88%9D%E6%8E%A2/language-models.pdf ^Brown, Tom B., Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan et al. “Language models are few-shot learners.” *arXiv preprint arXiv:2005.14165* (2020). https://proceedings.neurips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf ^Wei, Jason, et al. "Finetuned language models are zero-shot learners." *arXiv preprint arXiv:2109.01652* (2021). https://arxiv.org/pdf/2109.01652.pdf ^Christiano, Paul F., et al. "Deep reinforcement learning from human preferences." *Advances in neural information processing systems* 30 (2017). https://arxiv.org/pdf/1706.03741.pdf ^Schulman, John, et al. "Proximal policy optimization algorithms." *arXiv preprint arXiv:1707.06347* (2017). https://arxiv.org/pdf/1707.06347.pdf?
推荐阅读
-
-
-
霸气的小猪佩琦|生死战23分惨案!湖人连赢4场太狠,休城面临重建
-
「久期财经」展望“稳定”,惠誉:确认蓝色光标(300058.SZ)“B+”长期本外币发行人评级
-
|84岁老爷爷参加高考,考前自估630,得知成绩后欲哭无泪
-
-
-
好声音■《好声音》导师阵容确定,没了沈腾贾玲搭档,他能适应吗?
-
利利讲快乐|东方卫视新综艺亮相,放弃《欢乐喜剧人》了吗,90%喜剧大咖聚齐
-
特斯拉|金钱在燃烧 马斯克:特斯拉柏林、德州超级工厂正亏损数十亿美元
-
开屏|杨丽萍到公园偶遇孔雀,孔雀纷纷开屏迎接,万物皆有灵!
-
-
尤文图斯|虽然来得晚些,尤文图斯仍如愿捧杯,C罗续写着他的辉煌
-
睡前常做4种行为,容易诱发失眠,做的越多,失眠的情况越严重
-
演唱会|经典音乐人:开演唱会都不用自己开口的人!全是观众在唱——伍佰
-
-
-
-
-