
Hacker News: Front Page
shared a link post in group #Stream of Goodies
pile.eleuther.ai
The Pile
The Pile is a 825 GiB diverse, open source language modelling data set that consists of 22 smaller, high-quality datasets combined together.
Stream of Goodies
shared a link post in group #Stream of Goodies
pile.eleuther.ai
The Pile
The Pile is a 825 GiB diverse, open source language modelling data set that consists of 22 smaller, high-quality datasets combined together.
Comment here to discuss with all recipients or tap a user's profile image to discuss privately.
<div data-postid="zwarodn" [...] </div>