DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL (arxiv.org)
1338 points by gradus_ad 6 days ago | 1051 comments
1511338 points by gradus_ad 6 days ago | 1051 comments
15165 points by naggie 3 days ago | 26 comments
1529 points by rbanffy a day ago | 0 comments
153140 points by rrampage a day ago | 63 comments
154519 points by AnhTho_FR 2 days ago | 287 comments
15583 points by thunderbong 5 days ago | 25 comments
156137 points by tsadoq 4 days ago | 50 comments
15756 points by gmays 2 days ago | 6 comments
15829 points by rookie123 20 hours ago | 10 comments
159160 points by boriskourt 4 days ago | 27 comments
16092 points by todsacerdoti 5 days ago | 34 comments
16172 points by pseudolus 5 days ago | 32 comments
162177 points by naggie 8 days ago | 47 comments
16313 points by Philpax 17 hours ago | 0 comments
164400 points by ada1981 4 days ago | 350 comments
165876 points by sbarre 5 days ago | 260 comments
16688 points by cwillu 7 days ago | 6 comments
167155 points by arti_chaud 3 days ago | 45 comments
168329 points by todsacerdoti 4 days ago | 97 comments
1696 points by Fajar_Rahmad a day ago | 3 comments
1707 points by johnneville 12 hours ago | 0 comments
171117 points by wulujia 3 days ago | 35 comments
17211 points by throwaway019254 8 hours ago | 3 comments
173169 points by kawera 5 days ago | 41 comments
17486 points by zetalyrae 7 days ago | 120 comments
175604 points by todsacerdoti 5 days ago | 483 comments
176471 points by Einenlum 3 days ago | 137 comments
177359 points by todsacerdoti 4 days ago | 243 comments
178175 points by FinnLobsien 4 days ago | 77 comments
179154 points by hrpnk a day ago | 70 comments
180