Select - Your Community
Select
Get Mobile App

Stream of Goodies

avatar

Hacker News: Front Page

shared a link post in group #Stream of Goodies

Feed Image

datadreamer.dev

DataDreamer

Aligning a LLM with Human Preferences# In order to better align the responses instruction-tuned LLMs generate to what humans would prefer, we can train LLMs against a reward model or a dataset of hum

Comment here to discuss with all recipients or tap a user's profile image to discuss privately.

Embed post to a webpage :
<div data-postid="mzwoeqm" [...] </div>
A group of likeminded people in Stream of Goodies are talking about this.