Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Converting to UTF-8 code units? #8

Open
MarieTay opened this issue Jan 24, 2020 · 3 comments
Open

Converting to UTF-8 code units? #8

MarieTay opened this issue Jan 24, 2020 · 3 comments

Comments

@MarieTay
Copy link

Hi! What's the processing behind converting to UTF-8 code units?

I'm trying to accomplish this same thing, but not sure what the conversion process needs to be.

Any assistance is appreciated!

Thanks!

@MarieTay
Copy link
Author

This is a wonderful page, by the way! Thanks for making it!

@MarieTay
Copy link
Author

And let me explain a little more about what I'm trying to accomplish.

I have a known Decimal NCR value, 119808 for a utf-8 character I want to represent as (convert to) "F0 9D 90 80" which is what your utility does brilliantly.

I'd like to be able to accomplish that programatically. How does one go about doing that? And I don't mean within a specific language like JavaScript, but what are the general calculations that need to be done in any language to accomplish this?

Thanks so much!

@netjeff
Copy link

netjeff commented Jan 27, 2020

@MarieTay wrote:

What's the processing behind converting to UTF-8 code units?

RFC-3629 section 3 is a good starting point, with examples in section 7. The wikipedia page for UTF-8 also has a description and examples, plus lots of background.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants