Date:

DeepSeek’s Coding Skills: Not All Doomed (Yet)

Here is the rewritten article:

DeepSeek: A Promising New AI Chatbot

DeepSeek, a new AI chatbot from China, has been making waves in the tech world. While it’s clear that the US government has concerns about Chinese companies, we’ll focus on DeepSeek’s abilities rather than politics. In this article, we’ll put both DeepSeek V3 and DeekSeek R1 through a series of tests to evaluate their programming skills.

Test 1: Writing a WordPress Plugin

This test is a classic, designed to evaluate the AI’s ability to write a WordPress plugin that sorts a list of names, separating duplicates and sorting the remaining names. Only about half of the AIs I’ve tested can fully pass this test, but DeepSeek V3 managed to create the user interface and program logic exactly as specified.

R1’s Unusual Approach

DeepSeek R1 took a different approach, generating 4502 words of analysis before sharing the code. The UI looked different, with wider input areas, but the logic worked as expected.

Test 2: Rewriting a String Function

In this test, we ask the AI to rewrite a routine that allows for both dollars and cents in a donation entry field. DeepSeek V3 generated code that worked, although it could be improved. The reasoning before generating the code in R1 was long and repetitive. Both models failed to explicitly round numbers, and R1 used JavaScript’s Number conversion without testing for edge cases.

Test Case

R1 presented a list of tests to validate against, but the failure to handle bad data inputs causes the code to crash.

Test 3: Finding an Annoying Bug

This test requires the AI to understand an error message, identify the incorrect answer, and find the actual bug. DeepSeek V3 and R1 both passed this test with similar answers.

Test 4: Writing a Script

In this challenging test, we ask the AI to understand the interplay between AppleScript, the Chrome object model, and a Mac scripting tool called Keyboard Maestro. Unfortunately, neither DeepSeek V3 nor R1 knew how to handle this task, making incorrect assumptions and failing to split the task between instructions.

Final Thoughts

DeepSeek seems to be overly loquacious in terms of the code it generates. Both models have their strengths and weaknesses, and we’re impressed with DeepSeek’s abilities despite being at the GPT-3.5 level. We hope to see continued improvement from the team.

FAQs:

Q: What is DeepSeek?
A: DeepSeek is a new AI chatbot from China.

Q: Is DeepSeek trustworthy?
A: We focus on DeepSeek’s abilities rather than politics, as the US government has concerns about Chinese companies.

Q: What are the key features of DeepSeek?
A: DeepSeek is open source, uses less infrastructure than other AI tools, and has a unique approach to coding.

Q: How did DeepSeek perform in the tests?
A: DeepSeek V3 performed well, passing three out of four tests, while R1 failed two tests and passed two others.

Q: Is DeepSeek a viable alternative to ChatGPT?
A: While DeepSeek has some impressive abilities, it’s clear that it has room for improvement, and ChatGPT is still the current leader in the field.

Latest stories

Read More

Docker Exercises: Part 1

Table of Contents Questions Create a Dockerfile that installs...

Chipmakers Qualcomm and Arm post sales rise on smartphone strength

Qualcomm and Arm Post Strong Quarterly Sales Growth Qualcomm and...

DeepSeek A.I. Is a Danger to Party Control

China's Ambitious AI Plan In 2017, China watched in awe...

AI Discovers Hidden Cancer Markers

AI Tool Finds Cancer Signs Missed by Doctors Pioneering Research...

Amazon Nova: Cost-Effective and Performant Cloud Computing Options

Security Teams Leverage Amazon Nova Micro to Automate Threat...

Elegoo Mercury Plus V3: Neater 3D Printing

Resin 3D Printing: The Elegoo Mercury V3 Wash and...

LEAVE A REPLY

Please enter your comment!
Please enter your name here