Recent Posts

DeepSeek-R1

less than 1 minute read

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning