- 主页 > 生活百科 > >
ChatGPT/InstructGPT详解( 六 )
^Radford, A., Wu, J., Child, R., Luan, D., Amodei, D. and Sutskever, I., 2019. Language models are unsupervised multitask learners. *OpenAI blog*, *1*(8), p.9. https://life-extension.github.io/2020/05/27/GPT%E6%8A%80%E6%9C%AF%E5%88%9D%E6%8E%A2/language-models.pdf ^Brown, Tom B., Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan et al. “Language models are few-shot learners.” *arXiv preprint arXiv:2005.14165* (2020). https://proceedings.neurips.cc/paper/2020/file/1457c0d6bfcb4967418bfb8ac142f64a-Paper.pdf ^Wei, Jason, et al. "Finetuned language models are zero-shot learners." *arXiv preprint arXiv:2109.01652* (2021). https://arxiv.org/pdf/2109.01652.pdf ^Christiano, Paul F., et al. "Deep reinforcement learning from human preferences." *Advances in neural information processing systems* 30 (2017). https://arxiv.org/pdf/1706.03741.pdf ^Schulman, John, et al. "Proximal policy optimization algorithms." *arXiv preprint arXiv:1707.06347* (2017). https://arxiv.org/pdf/1707.06347.pdf?
推荐阅读
-
-
AI搞机:华为 Nova 7 开始降价,极点全面屏 +40W 闪充 + 杜比音效,很超值!
-
蒙泰|丸美股份:上半年净利润约2.68亿元,同比增长4.6%
-
香道中古法制作单品香,制作时间与节气之间的关系,请各位教我!
-
哈里梅根|哈里梅根还清装修欠款,查尔斯亲王宣布,将暂停对两人的一切援助
-
-
-
「一起来八卦」网友:坐等上市,马自达开始打翻身仗了?全新马6比奥迪A8还高级
-
-
2021年剩余多少天除夕假期 2021年剩余多少天除夕
-
-
防汛|“七下八上”关键期将至,下一步防汛救灾工作这样部署
-
总是梦到自己在特别脏的厕所里摔倒了 梦到在很脏的厕所摔倒特别臭
-
十二星座大宝典|喜欢简单直接,有话直说,和这4个星座相处
-
新华社|《粤港澳大湾区城际铁路建设规划》获批后首个城际铁路开工
-
-
-
#欧姐时尚达人#同穿焦糖色衬衣短裙,网友:除了年龄什么都输了,吴谨言撞衫唐嫣
-
鲜闻联播他就把手轻轻放在我头上,幽默笑话:每次和男朋友吵架
-