X's Grok Surpasses Expectations in AI Coding Tests

When X first launched its chatbot, it was tucked behind a paywall. But, as the saying goes, there ain't no such thing as a free lunch (TANSTAAFL), until recently when X decided to open up Grok to everyone. Curious about its capabilities, I decided to put it through my programming tests.
I've always had a soft spot for Grok, thanks to its name, which was coined by Robert Heinlein, one of my all-time favorite sci-fi authors. Heinlein's works played a significant role in shaping my young mind. My parents, who were quite strict about the media I consumed, allowed me to dive into science fiction at our local library, under the assumption that anything labeled as 'science' must be educational.
Heinlein's stories were not just entertaining; they were thought-provoking, challenging societal norms and weaving in scientific themes with social commentary. The term "grok," introduced in *Stranger in a Strange Land*, embodies a deep, fundamental understanding, making it a fitting name for an AI chatbot.
However, there's a catch...
When I inquired about the large language model (LLM) Grok uses, it mentioned being inspired by the wit and rebelliousness of *Hitchhiker's Guide to the Galaxy*. While *Hitchhiker's* certainly has its charm, it doesn't actually use the term "grok." But let's move on to the programming tests.
1. Writing a WordPress Plugin
This test required the AI to demonstrate PHP programming skills and knowledge of WordPress plugin development. It stemmed from a real-life request from my wife, who needed a tool to randomize names for her e-commerce site's monthly involvement device. The twist was that some users could have multiple entries, so the randomizer needed to ensure these names weren't placed side by side.
The code also had to be user-friendly, allowing her to simply paste names, click a button, and get her list. Grok passed this test with flying colors. The interface was clean, functional, and did exactly what it was supposed to do.
2. Rewriting a String Function
The second test involved fixing a user-reported issue with a function meant to validate dollar and cent amounts. My original code only accepted integers, so $5 was valid, but $5.25 wasn't. Grok rewrote the regular expression, coming close to a win. However, it failed to recognize numbers like .5 as valid currency, and it used an inefficient method with double conversions. So, it's a loss on this one.
3. Finding an Annoying Bug
This test required understanding the WordPress framework and API to pinpoint a subtle bug. Many LLMs, including myself initially, struggled with this. But Grok nailed it, providing a correct and useful solution. That's two wins out of three.
4. Writing a Script
The final test was a challenging one, requiring knowledge of Keyboard Maestro, a niche Mac scripting tool, and the ability to write code for multiple environments simultaneously: Keyboard Maestro, Chrome, and AppleScript. Only Google Gemini and ChatGPT with GPT-4 or higher had passed this test previously. Grok, however, aced it, securing three wins out of four.
Final Thoughts
Grok held up well in these tests. If it had just allowed currency values without a leading zero, it would have been perfect. Despite my mixed feelings about the changes at X since it replaced Twitter, Grok has proven to be a robust chatbot, especially in terms of programming skills.
What's your take on Grok? Have you tried it out? And what about *Stranger in a Strange Land* or *Hitchhiker's Guide to the Galaxy*? Share your thoughts in the comments below. So long, and thanks for all the fish!
Related article
AI Tools Transform Text into Free Sound Effects for Creative Projects
Producing sound effects once demanded costly equipment and expert sound designers. Now, AI-powered tools are reshaping audio creation by generating sounds from simple text descriptions. This article h
AI Comic Factory: Create Stunning Comics with Ease Using AI
Artificial intelligence has transformed comic creation, making it simpler and more accessible than ever. With tools like the AI Comic Factory, anyone can craft captivating comics without advanced arti
TechCrunch Disrupt 2025: Save Up to $900 on Tickets Before May 25 Deadline
Hurry! Save up to $900 on TechCrunch Disrupt 2025 passes before prices increase. Grab an Early Bird ticket now and get a second at 90% off — limited time offer.These exclusive deals end May 25 at 11:5
Comments (21)
0/200
EdwardJackson
July 27, 2025 at 9:20:21 PM EDT
Grok's coding skills blew me away! 😮 I tossed some tricky Python problems at it, and it nailed them faster than my old prof could grade papers. X opening it up for free feels like a game-changer—wonder how long it'll stay this good before they slap a paywall back on?
0
WalterLee
April 20, 2025 at 7:43:55 AM EDT
ग्रॉक की कोडिंग क्षमता अद्भुत है! ऐसा लगता है जैसे मेरे पास एक सुपर स्मार्ट दोस्त है जो इंसानों से बेहतर कोड करता है। मैंने अपने टेस्ट से इसे चेक किया और यह सभी में पास हो गया, बिना किसी परेशानी के! बस काश यह कभी-कभी जल्दी जवाब देता। फिर भी, किसी भी कोडर के लिए जरूरी है! 🚀
0
JonathanKing
April 20, 2025 at 6:14:26 AM EDT
¡Las habilidades de codificación de Grok son increíbles! Es como tener un amigo superinteligente que programa mejor que la mayoría de las personas. Lo probé con mis tests y pasó todos sin problemas. Solo desearía que respondiera más rápido a veces. Aún así, esencial para cualquier programador! 🚀
0
BruceClark
April 19, 2025 at 7:37:49 AM EDT
Grokのコード能力は驚異的です!まるで人間のコードを超える友達がいるようです。自分のテストで試してみたら、全て完璧にこなしました。ただ、返事がもう少し早ければいいのに。でも、コーダーには必須のアプリですね!🚀
0
WalterWhite
April 19, 2025 at 3:26:45 AM EDT
Grokのコーディングテスト結果にびっくり!簡単なスクリプト書いてもらったけど、めっちゃ速くて正確。AIの進化、ちょっと怖いね😅
0
KennethKing
April 19, 2025 at 2:13:21 AM EDT
O Grok da X está impressionante! Lida com problemas complexos como um profissional e suas sugestões são quase sempre precisas. Às vezes é um pouco detalhista demais, mas de forma geral, é uma ferramenta excelente para desenvolvedores!
0
When X first launched its chatbot, it was tucked behind a paywall. But, as the saying goes, there ain't no such thing as a free lunch (TANSTAAFL), until recently when X decided to open up Grok to everyone. Curious about its capabilities, I decided to put it through my programming tests.
I've always had a soft spot for Grok, thanks to its name, which was coined by Robert Heinlein, one of my all-time favorite sci-fi authors. Heinlein's works played a significant role in shaping my young mind. My parents, who were quite strict about the media I consumed, allowed me to dive into science fiction at our local library, under the assumption that anything labeled as 'science' must be educational.
Heinlein's stories were not just entertaining; they were thought-provoking, challenging societal norms and weaving in scientific themes with social commentary. The term "grok," introduced in *Stranger in a Strange Land*, embodies a deep, fundamental understanding, making it a fitting name for an AI chatbot.
However, there's a catch...
When I inquired about the large language model (LLM) Grok uses, it mentioned being inspired by the wit and rebelliousness of *Hitchhiker's Guide to the Galaxy*. While *Hitchhiker's* certainly has its charm, it doesn't actually use the term "grok." But let's move on to the programming tests.
1. Writing a WordPress Plugin
This test required the AI to demonstrate PHP programming skills and knowledge of WordPress plugin development. It stemmed from a real-life request from my wife, who needed a tool to randomize names for her e-commerce site's monthly involvement device. The twist was that some users could have multiple entries, so the randomizer needed to ensure these names weren't placed side by side.
The code also had to be user-friendly, allowing her to simply paste names, click a button, and get her list. Grok passed this test with flying colors. The interface was clean, functional, and did exactly what it was supposed to do.
2. Rewriting a String Function
The second test involved fixing a user-reported issue with a function meant to validate dollar and cent amounts. My original code only accepted integers, so $5 was valid, but $5.25 wasn't. Grok rewrote the regular expression, coming close to a win. However, it failed to recognize numbers like .5 as valid currency, and it used an inefficient method with double conversions. So, it's a loss on this one.
3. Finding an Annoying Bug
This test required understanding the WordPress framework and API to pinpoint a subtle bug. Many LLMs, including myself initially, struggled with this. But Grok nailed it, providing a correct and useful solution. That's two wins out of three.
4. Writing a Script
The final test was a challenging one, requiring knowledge of Keyboard Maestro, a niche Mac scripting tool, and the ability to write code for multiple environments simultaneously: Keyboard Maestro, Chrome, and AppleScript. Only Google Gemini and ChatGPT with GPT-4 or higher had passed this test previously. Grok, however, aced it, securing three wins out of four.
Final Thoughts
Grok held up well in these tests. If it had just allowed currency values without a leading zero, it would have been perfect. Despite my mixed feelings about the changes at X since it replaced Twitter, Grok has proven to be a robust chatbot, especially in terms of programming skills.
What's your take on Grok? Have you tried it out? And what about *Stranger in a Strange Land* or *Hitchhiker's Guide to the Galaxy*? Share your thoughts in the comments below. So long, and thanks for all the fish!




Grok's coding skills blew me away! 😮 I tossed some tricky Python problems at it, and it nailed them faster than my old prof could grade papers. X opening it up for free feels like a game-changer—wonder how long it'll stay this good before they slap a paywall back on?




ग्रॉक की कोडिंग क्षमता अद्भुत है! ऐसा लगता है जैसे मेरे पास एक सुपर स्मार्ट दोस्त है जो इंसानों से बेहतर कोड करता है। मैंने अपने टेस्ट से इसे चेक किया और यह सभी में पास हो गया, बिना किसी परेशानी के! बस काश यह कभी-कभी जल्दी जवाब देता। फिर भी, किसी भी कोडर के लिए जरूरी है! 🚀




¡Las habilidades de codificación de Grok son increíbles! Es como tener un amigo superinteligente que programa mejor que la mayoría de las personas. Lo probé con mis tests y pasó todos sin problemas. Solo desearía que respondiera más rápido a veces. Aún así, esencial para cualquier programador! 🚀




Grokのコード能力は驚異的です!まるで人間のコードを超える友達がいるようです。自分のテストで試してみたら、全て完璧にこなしました。ただ、返事がもう少し早ければいいのに。でも、コーダーには必須のアプリですね!🚀




Grokのコーディングテスト結果にびっくり!簡単なスクリプト書いてもらったけど、めっちゃ速くて正確。AIの進化、ちょっと怖いね😅




O Grok da X está impressionante! Lida com problemas complexos como um profissional e suas sugestões são quase sempre precisas. Às vezes é um pouco detalhista demais, mas de forma geral, é uma ferramenta excelente para desenvolvedores!












