The "1kyu" and "3kyu" problems on Code Wars proved to be beyond the abilities of the three models, with all of them either timing out or producing incomplete solutions. The creator noted that the Qwen2.5-Coder-7B-Instruct model seemed to have the best overall capability, but even it couldn't handle the most complex algorithmic challenges.
Ultimately, the results of these tests highlight both the impressive capabilities and the current limitations of state-of-the-art coding AI models. While they can handle straightforward programming tasks and even generate functional games, the lack of internet access and the complexity of certain coding challenges proved to be significant hurdles.