-
I do have a simple question: If we train a llm with 10e100000 repetitions of the sentence: 'there is a cat on the sofa' Is my thought accurate or not? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
This is actually a really nice description of the next-word prediction task in pretraining! In practice, that's why it's so important to have a large and diverse dataset. But yes, your understanding is spot on there. |
Beta Was this translation helpful? Give feedback.
This is actually a really nice description of the next-word prediction task in pretraining! In practice, that's why it's so important to have a large and diverse dataset. But yes, your understanding is spot on there.