I just wanted to throw something fun and pretty awesome out there!
I’ve been using Google Gemini 2.0 for Free the last few days on a few projects that I’ve been working on. And heresy I know, but I’ve actually been using Cline for VS Code as it offers a lot of awesome features still lacking in Bolt.diy (but I still see it as the future of AI coding). The problem with Cline is that it uses A LOT of tokens and was going through my Claude credits pretty fast…
So I tried to use LLama3.3-70B-Instruct, because it’s < 20¢ / MTok through Openrouter and pretty snappy too. Crazy, being that Claude is 15x+ that. But I wasn’t having much luck, and then I remembered Google offers Gemini Flash 2.0 (exp-1219 currently) for free and I heard there are no caps or limits. Microsoft also offers ChatGPT 4o through GitHub for Free as well, but with my testing it didn’t seem to work well for coding and I tried it once with Cline and ran out of my daily token limit within minutes… absurd.
Sadly, LLama3.3-70B-Instruct seemed to get stuck in logic loops and just rack up token usage even worse than Claude but it really didn’t cost much to test, so that’s fine. The real interesting part was when trying Gemini exp-1219, it seemed to do the best and by round two of back and forth, everything I asked for (in steps) was working. Impressive!
Gemini for me seems to work better than Claude even, is currently free to use and didn’t seem to have any limits. This was my token usage in one task session…
If anyone else wants to sign up and try it out, please visit Google AI Studio to get an API_KEY.
P.S. If anyone else has any useful tidbits, I’d love to hear them. This is all experimental at this point and I just want to improve my workflow. Thanks and best!