Update README to include chat templating #1372

cpfiffer · 2025-01-10T22:03:54Z

The existing README has underwhelming or incorrect results (Example is underwhelming #1347) due to lack of templating for instruct models.

This adds special tokens for each instruct model call, as well as provide comments on how to obtain/produce special tokens.

The existing README has underwhelming or incorrect results (Example is underwhelming dottxt-ai#1347) due to lack of templating for instruct models. This adds special tokens for each instruct model call, as well as provide comments on how to obtain/produce special tokens.

torymur · 2025-01-13T12:37:30Z

README.md

@@ -107,7 +129,7 @@ generator = outlines.generate.choice(model, Sentiment)
 answer = generator(prompt)
 ````


Seems this part also needs to be adjusted, but maybe we can show only the difference:

from enum import Enum class Food(str, Enum): pizza = "Pizza" pasta = "Pasta" salad = "Salad" dessert = "Dessert" ... generator = outlines.generate.choice(model, Food)

IMO every example in a README should be completely copy-able with no slice-and-dice on the user's part. This is of course personal preference, so up to y'all.

If one day we generate documentation with mdbook, it offers a feature for hiding code lines from the user. I realized there isn't a similar feature in GitHub Flavored Markdown (yet?). The closest thing I'm aware of is collapsed sections...

@yvan-sraka yeah, sadly collapse sections doesn't work in the code snippets. Would be nice if visually repetitive code would be hidden by collapsing, but then it would copy everything including the hidden code, but without tags. This way "copy-paste and it works" magic would not be lost and "what you see is what you get" predictability will also be served.

In mdbook hiding code lines nicely works for running doc tests for example, but on "copying the code" side it still copies just what's visible, which might not work as it is without hidden code parts. Kind of on the same side of things there is also html comments, but also completely not helpful in copying the code snippets.

@cpfiffer fair point!

But this enum example still needs to be updated, considering that we're just showing different ways of multiple choice, maybe we can just extend original section with Enum in the first place and list as an alternative, wdyt?:

import outlines from enum import Enum model_name = "HuggingFaceTB/SmolLM2-360M-Instruct" model = outlines.models.transformers(model_name) # You must apply the chat template tokens to the prompt! # See below for an example. prompt = """ <|im_start|>system You extract information from text. <|im_end|> <|im_start|>user What food does the following text describe? Text: I really really really want pizza. <|im_end|> <|im_start|>assistant """ class Food(str, Enum): pizza = "Pizza" pasta = "Pasta" salad = "Salad" dessert = "Dessert" generator = outlines.generate.choice(model, Food) # You can also pass these choices simply as list: # generator = outlines.generate.choice(model, ["Pizza", "Pasta", "Salad", "Dessert"]) answer = generator(prompt) # Likely answer: Pizza

Awesome, I like the updated example. I'll add it. In general it sounds like we'll need to just overhaul the docs, which I suspect is probably more fruitful after the 1.0 release.

README.md

yvan-sraka · 2025-01-13T15:57:30Z

Rendered

Co-authored-by: Victoria Terenina <[email protected]>

cpfiffer · 2025-01-13T19:54:30Z

Great, appreciate the comments!

yvan-sraka

This overall LGTM. I like that we provide a lot of examples in the README.md and understand why the Positive/Negative one felt counterintuitive... That said, I found most of the examples a bit abstract/useless and would prefer more concrete, real-world use cases!

The example that probably fits this category best is character generation, though it would be even more interesting if one of the character generation fields was an unconstrained (or less constrained) string, something like a description field (eventually with a limit, e.g., less than 500 characters of lore)! Otherwise, I think it could be nice to hide internet culture jokes in example because, hey, we’re cool kids! Ofc, these comments are for future improvement, the PR is already great and ready to be merged as is!

rlouf · 2025-01-14T09:53:58Z

README.md

+You can find the chat template tokens in the model's HuggingFace repo or documentation. As an example, the SmolLM2-360M-Instruct special tokens can be found [here](https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct/blob/main/special_tokens_map.json).
+
+A convenient way to do this is to use the `tokenizer` from the `transformers` library:
+
+```python
+from transformers import AutoTokenizer
+
+tokenizer = AutoTokenizer.from_pretrained("HuggingFaceTB/SmolLM2-360M-Instruct")
+prompt = tokenizer.apply_chat_template(
+    [
+        {"role": "system", "content": "You extract information from text."},
+        {"role": "user", "content": "What food does the following text describe?"},
+    ],
+    tokenize=False,
+    add_bos=True,
+    add_generation_prompt=True,
+)
+```
+
+yields
+
+```
+<|im_start|>system
+You extract information from text.<|im_end|>
+<|im_start|>user
+What food does the following text describe?<|im_end|>
+<|im_start|>assistant
+```
+


We can leave the warning but this section should be in the documentation and we can add a link from here.

I would go as far as having this warning before the first example instead of having a separate section.

Agreed. I'm not sure what the best place is for it. We could add it to the prompting reference page or a new page. Not sure which is better. It wouldn't be too hard to cook up a new page for the chat templating issue, as we can provide lots of little bits of context.

cpfiffer added 3 commits January 10, 2025 14:03

Merge branch 'main' into readme-update

fc01f25

Trim trailing whitespace of README

0df6215

torymur requested a review from yvan-sraka January 13, 2025 10:12

torymur reviewed Jan 13, 2025

View reviewed changes

README.md Outdated Show resolved Hide resolved

Update README.md

d816749

Co-authored-by: Victoria Terenina <[email protected]>

Merge branch 'main' into readme-update

d43feb1

yvan-sraka approved these changes Jan 14, 2025

View reviewed changes

rlouf reviewed Jan 14, 2025

View reviewed changes

Merge branch 'main' into readme-update

e9ea22d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update README to include chat templating #1372

Update README to include chat templating #1372

cpfiffer commented Jan 10, 2025

torymur Jan 13, 2025

cpfiffer Jan 13, 2025

yvan-sraka Jan 14, 2025

torymur Jan 14, 2025

cpfiffer Jan 14, 2025

yvan-sraka commented Jan 13, 2025

cpfiffer commented Jan 13, 2025

yvan-sraka left a comment

rlouf Jan 14, 2025

rlouf Jan 14, 2025

cpfiffer Jan 14, 2025

		@@ -107,7 +129,7 @@ generator = outlines.generate.choice(model, Sentiment)
		answer = generator(prompt)
		````

Update README to include chat templating #1372

Are you sure you want to change the base?

Update README to include chat templating #1372

Conversation

cpfiffer commented Jan 10, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yvan-sraka commented Jan 13, 2025

cpfiffer commented Jan 13, 2025

yvan-sraka left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment