view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 199
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 247
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2 • 61