What is it that you think would make a certain non-Python language "more optimal" for an LLM? Is there something inherently LLM-friendly about certain language patterns or is "huge sets of training examples" and "a robust standard library" (the latter to conserve tokens/attention vs having to spit out super-verbose 20x longer assembly all day) all "optimality" means?
It's fair to point out that I didn't define exactly what we're optimizing for, but I can have the LLM generate assembly if I ask. It'll be faster than python, at the expense of readability, but if we're no longer writing code, then why not straight up use assembly? ARM vs x86 becomes an issue, but are there other reasons not to use assembly?