- 主页 > 生活百科 > >
ChatGPT/InstructGPT详解( 六 )
^Radford, A., Wu, J., Child, R., Luan, D., Amodei, D. and Sutskever, I., 2019. Language models are unsupervised multitask learners. *OpenAI blog*, *1*(8), p.9. https://life-extension.github.io/2020/05/27/GPT%E6%8A%80%E6%9C%AF%E5%88%9D%E6%8E%A2/language-models.pdf ^Brown, Tom B., Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan et al. “Language models are few-shot learners.” *arXiv preprint arXiv:2005.14165* (2020). https://proceedings.neurips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf ^Wei, Jason, et al. "Finetuned language models are zero-shot learners." *arXiv preprint arXiv:2109.01652* (2021). https://arxiv.org/pdf/2109.01652.pdf ^Christiano, Paul F., et al. "Deep reinforcement learning from human preferences." *Advances in neural information processing systems* 30 (2017). https://arxiv.org/pdf/1706.03741.pdf ^Schulman, John, et al. "Proximal policy optimization algorithms." *arXiv preprint arXiv:1707.06347* (2017). https://arxiv.org/pdf/1707.06347.pdf?
推荐阅读
-
西奇博物馆|连续公务多日泰王又腿软,王后苏提达浓妆遮盖疲惫,努力搀扶泰王
-
-
-
家电消费网|分析:OPPO为何进入亏损的彩电业?,OPPO电视将发布
-
-
-
-
-
-
#饭饭妈妈育儿#实则却在悄悄损害孩子听力,你别大意,掏耳朵看似一种正常的行为
-
海外网|前"港独"组织召集人涉违香港国安法被捕 保释申请被拒
-
-
世界体育圈|穆雷心态崩了,帕金斯正式表态!,西部半决赛来了!掘金喊话快船
-
[晓哥聊游戏]千万不要进去,因为里面有人在钓鱼!,和平精英:在门口看到这个东西
-
-
-
-
科技快报网|Waymo将在6月恢复自动驾驶货运服务
-
居然令人难以想象地实现欧冠赛场全胜夺冠,更向世人宣布,世界足坛已发生巨变,德甲早已满血归来
-
【太平洋电脑网】华为 nova 7 Pro 发布会后:关晓彤、易烊千玺将开箱展示