bug-fix: snprintf prints NULL in place of the last character #10419

kallewoof · 2024-11-20T06:25:28Z

We need to give snprintf enough space to print the last character and the null character, thus we allocate one extra byte and then ignore it when converting to std::string.

char buf[5];
snprintf(buf, 5, "hello");
printf("%s", buf); // -> hell\0

Because of this, when copying the C string into the std::string, we get a \u0000 at the end of it, which can cause issues.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

We need to give snprintf enough space to print the last character and the null character, thus we allocate one extra byte and then ignore it when converting to std::string.

ngxson · 2024-11-20T08:31:09Z

@slaren The return value of snprintf excludes the terminating null byte. Should we add one byte before returning from llama_model_meta_val_str, or it's up to the user?

In anyway, I think we should leave a comment in llama.h to clarify this behavior.

kallewoof · 2024-11-20T09:56:12Z

Edited: never mind, this is C code..

Added a comment. I think a cleaner approach might be to have this allocate and return a char* which is then strlen()'d instead. Chat templates are not enormous enough that the extra length check will pose a noticeable impact.

kallewoof · 2024-11-20T10:29:48Z

~~I pushed an alternative PR in #10424 which simplifies the interface but requires a free() call.~~
~~Rewritten to not require free() as #10430.~~

slaren · 2024-11-20T10:40:17Z

Should we add one byte before returning from llama_model_meta_val_str, or it's up to the user?

It's up to the user. They do not need to use NUL-terminated strings.

kallewoof · 2024-11-20T13:31:26Z

~~Unless there are architecture subtleties that I'm not aware of, I think #10430 is a cleaner solution, but keeping both up until a judgement is made.~~

ngxson · 2024-11-20T14:58:59Z

Should we also apply this patch to every places where llama_chat_apply_template is called? (We don't have many of those)

kallewoof · 2024-11-21T03:44:53Z

Should we also apply this patch to every places where llama_chat_apply_template is called? (We don't have many of those)

I looked and it doesn't seem like it is affected since the allocated buffer is basically guaranteed to be bigger than the chat template (including NULL term).

ngxson · 2024-11-21T10:55:23Z

Sorry I mean changing all other places using llama_model_meta_val_str (there is one inside llama_chat_apply_template IIRC)

kallewoof · 2024-11-22T00:00:09Z

Sorry I mean changing all other places using llama_model_meta_val_str (there is one inside llama_chat_apply_template IIRC)

Right -- the only other place is in server.cpp specifically in

llama.cpp/examples/server/server.cpp

Lines 663 to 674 in a5e4759

    
           bool validate_model_chat_template() const { 
        
               std::vector<char> model_template(2048, 0); // longest known template is about 1200 bytes 
        
               std::string template_key = "tokenizer.chat_template"; 
        
               int32_t res = llama_model_meta_val_str(model, template_key.c_str(), model_template.data(), model_template.size()); 
        
               if (res >= 0) { 
        
                   llama_chat_message chat[] = {{"user", "test"}}; 
        
                   std::string tmpl = std::string(model_template.data(), model_template.size()); 
        
                   int32_t chat_res = llama_chat_apply_template(model, tmpl.c_str(), chat, 1, true, nullptr, 0); 
        
                   return chat_res > 0; 
        
               } 
        
               return false; 
        
           }

which also preallocates a big enough buffer, like llama_chat_apply_template.

bug-fix: snprintf prints NULL in place of the last character

07a64b9

We need to give snprintf enough space to print the last character and the null character, thus we allocate one extra byte and then ignore it when converting to std::string.

github-actions bot added examples server labels Nov 20, 2024

This was referenced Nov 20, 2024

API: add /props route LostRuins/koboldcpp#1222

Merged

Derived templates SillyTavern/SillyTavern#3090

Merged

add comment about extra null-term byte requirement

2583561

kallewoof mentioned this pull request Nov 20, 2024

allocate c strings in metadata functions #10424

Closed

4 tasks

slaren approved these changes Nov 20, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug-fix: snprintf prints NULL in place of the last character #10419

bug-fix: snprintf prints NULL in place of the last character #10419

kallewoof commented Nov 20, 2024 •

edited

Loading

ngxson commented Nov 20, 2024

kallewoof commented Nov 20, 2024 •

edited

Loading

kallewoof commented Nov 20, 2024 •

edited

Loading

slaren commented Nov 20, 2024 •

edited

Loading

kallewoof commented Nov 20, 2024 •

edited

Loading

ngxson commented Nov 20, 2024

kallewoof commented Nov 21, 2024

ngxson commented Nov 21, 2024 •

edited

Loading

kallewoof commented Nov 22, 2024 •

edited

Loading

bug-fix: snprintf prints NULL in place of the last character #10419

Are you sure you want to change the base?

bug-fix: snprintf prints NULL in place of the last character #10419

Conversation

kallewoof commented Nov 20, 2024 • edited Loading

ngxson commented Nov 20, 2024

kallewoof commented Nov 20, 2024 • edited Loading

kallewoof commented Nov 20, 2024 • edited Loading

slaren commented Nov 20, 2024 • edited Loading

kallewoof commented Nov 20, 2024 • edited Loading

ngxson commented Nov 20, 2024

kallewoof commented Nov 21, 2024

ngxson commented Nov 21, 2024 • edited Loading

kallewoof commented Nov 22, 2024 • edited Loading

kallewoof commented Nov 20, 2024 •

edited

Loading

kallewoof commented Nov 20, 2024 •

edited

Loading

kallewoof commented Nov 20, 2024 •

edited

Loading

slaren commented Nov 20, 2024 •

edited

Loading

kallewoof commented Nov 20, 2024 •

edited

Loading

ngxson commented Nov 21, 2024 •

edited

Loading

kallewoof commented Nov 22, 2024 •

edited

Loading