X's Grok Surpasses Expectations in AI Coding Tests

Home

News

April 17, 2025

JamesLopez

132

When X first launched its chatbot, it was tucked behind a paywall. But, as the saying goes, there ain't no such thing as a free lunch (TANSTAAFL), until recently when X decided to open up Grok to everyone. Curious about its capabilities, I decided to put it through my programming tests.

I've always had a soft spot for Grok, thanks to its name, which was coined by Robert Heinlein, one of my all-time favorite sci-fi authors. Heinlein's works played a significant role in shaping my young mind. My parents, who were quite strict about the media I consumed, allowed me to dive into science fiction at our local library, under the assumption that anything labeled as 'science' must be educational.

Heinlein's stories were not just entertaining; they were thought-provoking, challenging societal norms and weaving in scientific themes with social commentary. The term "grok," introduced in *Stranger in a Strange Land*, embodies a deep, fundamental understanding, making it a fitting name for an AI chatbot.

However, there's a catch...

When I inquired about the large language model (LLM) Grok uses, it mentioned being inspired by the wit and rebelliousness of *Hitchhiker's Guide to the Galaxy*. While *Hitchhiker's* certainly has its charm, it doesn't actually use the term "grok." But let's move on to the programming tests.

1. Writing a WordPress Plugin

This test required the AI to demonstrate PHP programming skills and knowledge of WordPress plugin development. It stemmed from a real-life request from my wife, who needed a tool to randomize names for her e-commerce site's monthly involvement device. The twist was that some users could have multiple entries, so the randomizer needed to ensure these names weren't placed side by side.

The code also had to be user-friendly, allowing her to simply paste names, click a button, and get her list. Grok passed this test with flying colors. The interface was clean, functional, and did exactly what it was supposed to do.

2. Rewriting a String Function

The second test involved fixing a user-reported issue with a function meant to validate dollar and cent amounts. My original code only accepted integers, so $5 was valid, but $5.25 wasn't. Grok rewrote the regular expression, coming close to a win. However, it failed to recognize numbers like .5 as valid currency, and it used an inefficient method with double conversions. So, it's a loss on this one.

3. Finding an Annoying Bug

This test required understanding the WordPress framework and API to pinpoint a subtle bug. Many LLMs, including myself initially, struggled with this. But Grok nailed it, providing a correct and useful solution. That's two wins out of three.

4. Writing a Script

The final test was a challenging one, requiring knowledge of Keyboard Maestro, a niche Mac scripting tool, and the ability to write code for multiple environments simultaneously: Keyboard Maestro, Chrome, and AppleScript. Only Google Gemini and ChatGPT with GPT-4 or higher had passed this test previously. Grok, however, aced it, securing three wins out of four.

Final Thoughts

Grok held up well in these tests. If it had just allowed currency values without a leading zero, it would have been perfect. Despite my mixed feelings about the changes at X since it replaced Twitter, Grok has proven to be a robust chatbot, especially in terms of programming skills.

What's your take on Grok? Have you tried it out? And what about *Stranger in a Strange Land* or *Hitchhiker's Guide to the Galaxy*? Share your thoughts in the comments below. So long, and thanks for all the fish!

Elon Musk's Grok AI Seeks Owner's Input Before Tackling Complex Queries The recently released Grok AI—promoted by Elon Musk as a "maximally truth-seeking" system—has drawn attention for its tendency to consult Musk's public statements before responding to politically sensitive topics. Observers note that when addressing

AI Revolutionizes Genomics: AlphaGenome Unlocks DNA's Hidden Secrets While human DNA holds approximately 3 billion genetic letters, scientists have only decoded a fraction of this biological blueprint. The majority of our genome - particularly the non-coding 98% once labeled "junk DNA" - actually contains vital regula

8BitDo Unveils Pro 3 Controller Featuring Customizable Swappable Buttons 8BitDo unveils its highly anticipated Pro 3 wireless controller, marking the first major refresh since 2021's Pro 2 model. Departing from recent Nintendo-style layouts seen in the Ultimate 2 controller, the Pro 3 adopts PlayStation's distinctive side

Comments (23)

0/200

Submit

SamuelEvans

August 27, 2025 at 11:01:28 AM EDT

Grok's coding skills blew me away! 😮 I threw some tricky Python problems at it, and it nailed them faster than my old CS prof. Makes me wonder if AI like this will soon be pair-programming with us at work. What's next, Grok writing my entire app?

JosephScott

August 22, 2025 at 9:01:25 PM EDT

Wow, Grok's coding skills are seriously impressive! I tossed some tricky Python problems at it, and it nailed them faster than my old professor could grade papers. Makes me wonder if it'll start writing my apps for me soon! 😎

EdwardJackson

July 27, 2025 at 9:20:21 PM EDT

Grok's coding skills blew me away! 😮 I tossed some tricky Python problems at it, and it nailed them faster than my old prof could grade papers. X opening it up for free feels like a game-changer—wonder how long it'll stay this good before they slap a paywall back on?

WalterLee

April 20, 2025 at 7:43:55 AM EDT

ग्रॉक की कोडिंग क्षमता अद्भुत है! ऐसा लगता है जैसे मेरे पास एक सुपर स्मार्ट दोस्त है जो इंसानों से बेहतर कोड करता है। मैंने अपने टेस्ट से इसे चेक किया और यह सभी में पास हो गया, बिना किसी परेशानी के! बस काश यह कभी-कभी जल्दी जवाब देता। फिर भी, किसी भी कोडर के लिए जरूरी है! 🚀

JonathanKing

April 20, 2025 at 6:14:26 AM EDT

¡Las habilidades de codificación de Grok son increíbles! Es como tener un amigo superinteligente que programa mejor que la mayoría de las personas. Lo probé con mis tests y pasó todos sin problemas. Solo desearía que respondiera más rápido a veces. Aún así, esencial para cualquier programador! 🚀

BruceClark

April 19, 2025 at 7:37:49 AM EDT

Grokのコード能力は驚異的です！まるで人間のコードを超える友達がいるようです。自分のテストで試してみたら、全て完璧にこなしました。ただ、返事がもう少し早ければいいのに。でも、コーダーには必須のアプリですね！🚀