Allow a simplified sintax without some of the symbols #131
Replies: 5 comments
-
Thank you for the suggestion. Right now we are bringing up the core language and type system, syntax optimizations are not the priority right now |
Beta Was this translation helpful? Give feedback.
-
To be honest, it is nicer for AI token usage but it's much less clear what everything is and I much prefer using a few more tokens to be able to read my code and that which is created by an AI (even more so with the AI). |
Beta Was this translation helpful? Give feedback.
-
The problem is not the cost of using a few more tokens, is that the number of tokens a AI can process is limited, each aditional token is one less association and complexity of the code. Todays Open Ai token count limits open for the public are is 4000 to GPT 3.5 Turbo and 8000 for GPT-4, if you use a free LLM to generate code allowing you to keep a AI runing always on your server, looking for bugs on each PR for example, the limit in free LLMs is 2000 tokens, these tokens are split between question and answer, if um do a good and detailed prompt you lose in generated code. Now think about two languages to choose:
After one or two years of compounded improvement of 2%, how much better would a software be if it uses fewer tokens? How would the entire ecosystem of that language be affected if all the software were 2% better? |
Beta Was this translation helpful? Give feedback.
-
The improvement don't even need to be on user level just suport of a simpler sintax, something that a IDE can translate back to the way you want. The Idea is to Finetune a AI using a simpler version, but even If I do It myself, the cascate effect on the environment of the language will not be as big if it comes from the creator. |
Beta Was this translation helpful? Give feedback.
-
I don't think this should be in scope of the official language. If you want to design a syntax specifically for LLMs to train on, why not compress it even more? Doesn't need to be human readable, just LLM readable. There are many different ways to you could define compression schemes for this, so I think it would be most appropriate as separate community projects. |
Beta Was this translation helpful? Give feedback.
-
Today we are in the time of AI and generating code using AI is a Path without return, the number of tokens a language use is a edge on the long run.
Python already one of the cheaper languages to generate code.
But can be even cheaper if we remove useless characters:
This a sample code in mojo:
Estimate token = 114
https://gpttools.com/estimator
Here a simplified code without some of the symbols I got from ChatGPT:
Estimate tokens: 101
Around of 10% of economy in tokens, this on the long run could allow the LLMs the generate a 10% more complex code, review error in a function with 10% more code and so on.
I tried to iterate and reduce but I'm not really a Python developer, and I'm not the best person to say what symbols could be cut without the code lose utility.
I'm a C# developer thinking of migrate to mojo because looks more like C# than Python
What I'm saying is not about not support these symbols but make then optional
Any IDE can change the color of the parameters, even the parenthesis could be optimized.
Beta Was this translation helpful? Give feedback.
All reactions