
由伯克利大学主导团队 LMSYS Org 近日发布了一个针对大语言模型的基准平台 Chatbot Arena。该平台采用匿名、随机的方式让不同的大模型产品进行对抗评测,基于国际象棋等竞技游戏中广泛使用的 埃洛等级分系统,通过用户投票产生,系统每次会随机选择两个不同的大模型机器人和用户聊天,并让用户在匿名的情况下选择哪款大模型产品的表现更好一些。最后系统根据用户的选择判定大模型产品的积分,以排行榜的形式出现在首页中。在上线一周后, Chatbot Arena 便吸引了超过4700次匿名投票,并有越来越多人开始在该平台为不同的大模型产品投票。


","gnid":"987be2d1add4f152f","img_data":[{"flag":2,"img":[{"desc":"","height":"720","title":"","url":"http://p2.img.360kuai.com/t01cc2acfaf08414fc6.jpg","width":"1080"},{"desc":"","height":"1920","title":"","url":"http://p2.img.360kuai.com/t0156959cf05830e769.jpg","width":"1080"},{"desc":"","height":"748","title":"","url":"http://p2.img.360kuai.com/t016331fc30f147ee96.jpg","width":"1080"}]}],"original":0,"pat":"art_src_1,fts0,sts0","powerby":"pika","pub_time":1686279127000,"pure":"","rawurl":"http://zm.news.so.com/0465fd21f76ee7ee01c6a4e95ceadd8a","redirect":0,"rptid":"06a77e3652238c94","rss_ext":[],"s":"t","src":"品玩","tag":[{"clk":"ktechnology_1:伯克利大学","k":"伯克利大学","u":""}],"title":"大模型打擂台?竞技平台 Chatbot Arena已上线","type":"zmt","wapurl":"http://zm.news.so.com/0465fd21f76ee7ee01c6a4e95ceadd8a","ytag":"科技:人工智能:AI技术","zmt":{"brand":{},"cert":"优质科技领域创作者","desc":"有品好玩的科技,一切与你有关。","fans_num":9264,"id":"2991151609","is_brand":"0","name":"品玩","new_verify":"7","pic":"http://p5.img.360kuai.com/t019112a1b3e04850a2.jpg","real":1,"textimg":"http://p9.img.360kuai.com/bl/0_3/t017c4d51e87f46986f.png","verify":"0"},"zmt_status":0}","errmsg":"","errno":0}