How to Contribute in GitHub

Building Intelligent Assistants with Semantic Kernel: A Comprehensive Guide from Theory to Practice

In the past few articles, we have delved into the various components of the Semantic Kernel, including Kernels, Plugins, Agents, and multi-agent systems. These tools not only make artificial ...

16h

Tsinghua's Latest Research! How to Theoretically Unify SFT and RL, and the Efficient Adaptive Algorithm Hybrid Post-Training

Post-training of large language models has long been clearly divided into two paradigms: supervised fine-tuning (SFT) centered on imitation and reinforcement learning (RL) driven by exploration.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Building Intelligent Assistants with Semantic Kernel: A Comprehensive Guide from Theory to Practice

Tsinghua's Latest Research! How to Theoretically Unify SFT and RL, and the Efficient Adaptive Algorithm Hybrid Post-Training

Trending now