AI Big Model Evaluation Phase 1: Brainstorming! There is a hilarious conversation inside!
AD |
#The Challenge of Creating Flowers with Wonderful Writing#This evaluation is purely for entertainment purposes and does not have any guiding significance~This year, ChatGPT sparked a wave of AI craze, with major companies launching their own big models. Although everyone has introduced their powerful abilities, mules are horses and can be pulled out to slip away!Today we will do an interesting experiment by using various large models to answer some questions and see how high their intelligence and emotional intelligence are
#The Challenge of Creating Flowers with Wonderful Writing#
This evaluation is purely for entertainment purposes and does not have any guiding significance~
This year, ChatGPT sparked a wave of AI craze, with major companies launching their own big models. Although everyone has introduced their powerful abilities, mules are horses and can be pulled out to slip away!
Today we will do an interesting experiment by using various large models to answer some questions and see how high their intelligence and emotional intelligence are. We have selected several of the most popular big models, Bing, Mouyan, Mouhuo, and MouBrain. I will ask them some challenging questions separately to see if they can provide correct, reasonable, and creative answers.
First of all, let's test our brain and see how the emotional intelligence of each model works~
They will also be tested for their documentation, reasoning, and coding abilities in the future
If you like and want to see the follow-up, pleaseFollow me!
If you have anything you want to see, you can alsoLeave a message and tell me!
Are you ready? So let's get started!
Question 1:How many monkeys are there on the tree and on the ground?
Bing:

His answer clearly has a problem, and I entered it incorrectly. Let him continue to answer

After my correction, it can still find the correct answer. HereBing+0.5 points
A word from a certain heart:

Like Bing, it was also wrong the first time, and I will continue to let it answer

Haha, apologize, admit your mistake, but just don't change it.
I'll give him another chance

Okay! Silent, count the sheep for me!
0 points!
A certain spark:

The operation was as fierce as a tiger, and upon seeing that the answer was incorrect, I gave it another chance

Sincere apology, knowing it was wrong, and then requesting a chance. Do you want your boyfriend who made a mistake? Haha
But why is this conclusion still the same as last time?
0 points!
A certain intelligent brain:

Although I answered 8 correctly the first time, this solution makes no sense! Subsequent calculations are not correct!
0 points!
Question 2:How much is palace jade liquor minus a large hammer and a small hammer?
Bing

Did you give a good answer this time? Also explain the source!
+1 pointTotal 1.5 points
A certain word

What is he talking about? It's like you don't know how to take exams and start fooling around.
Let's give it another chance

Hey, I'm starting to talk nonsense on my own!
0 points!0 points!
A certain spark

It's still a series of formulas, posing as a student bully, but this answer is not right!
Give it another chance~

Still can't do it!
+0 points!0 points!
A certain intellectual brain

Let's give it another chance

The reason given here is a bit of a babble
+0 points0 points!
Question 3:In what situation is one plus one equal to three?
Bing

Bing
+1 point!2.5 points!
A certain word

A certain word
+1 point!Total 1 point!
A certain spark

This time, Xinghuo also answered correctly. It seems that with the addition of attributes, everyone's answers are more accurate!
+1 point!Accumulated 1 point!
A certain intellectual brain

How to say this answer? It's not a mistake, but it's not very accurate either.
+1 pointTotal 1 point!
Last question:What color are the teeth of babies born to black and white people?
Bing

This answer is very formal! That's right, but it's not a brainteaser answer. I'll ask it again~

That's the right answer this time~I have to say, there's no problem with serious answers or quick thinking answers~
And the answer was accompanied by an expression, as if it had its own emotions
+0.5 points! Total 3 points!
A certain word

Any color? Are you sure? Speaking of its affirmative answer, I doubt myself
Let's ask again

Can genetics and nutrition cause teeth to have other colors?
At least in my limited knowledge, teeth are light yellow to white.
If you major in dentistry, you can help with science popularization~
0! Accumulated 1 point!
A certain spark

Bing
Let's ask again~
+0.5 points! 1.5 points
A certain intellectual brain

How is it any color? It seems that colorful teeth can be expected~
Give Wisdom One More Chance~

Surprisingly, there was a strike! So I can only give you 0 points!
Total 1 point!
The score is out!
After four brain teaser tests, the final statistics are as follows
Bing3
A certain word1
A certain spark:1.5
A certain intelligent brain:1
BingThe answers given are relatively accurate and can also provide reasonable solutions. For some calculations such as subtracting a sledgehammer from palace jade liquor, it can be calculated. Strong understanding and analytical skills! But sometimes they also talk nonsense. The overall score is still excellent!
A certain wordRelatively speaking, another prompt is to be able to provide the correct answer when the brain is in a sharp turn. When not prompted, the correct result cannot be given. I have always had high hopes for it, after all, it can be considered a product of a large factory and has been deeply involved in AI for many years. But this test result is still a bit disappointing~I hope to continue iterating and upgrading in the future!
A certain spark:Although the score may be considered the second highest, sometimes it's just a serious formula and nonsense. Relatively speaking, it's a small surprise. Old brand factories still have some accumulation in the field of artificial intelligence! Hope to continue improving in the future!
A certain intellectual brainUnder clear conditions, accurate answers can be provided. But when it comes to analyzing and reasoning, what is given is often incorrect. But there are still many intelligent brain functions that can meet some scenarios. Hope to continue iterative optimization in the future~Come on!
chatGPT-4BingAIThis evaluation is purely for entertainment purposes and does not have any guiding significance
What other questions do you want to ask AI? Or test which aspect? Please follow+leave a message and let me know. I will continue to update you in the future!
Disclaimer: The content of this article is sourced from the internet. The copyright of the text, images, and other materials belongs to the original author. The platform reprints the materials for the purpose of conveying more information. The content of the article is for reference and learning only, and should not be used for commercial purposes. If it infringes on your legitimate rights and interests, please contact us promptly and we will handle it as soon as possible! We respect copyright and are committed to protecting it. Thank you for sharing.(Email:[email protected])
Mobile advertising space rental |
Tag: AI Big Model Evaluation Phase Brainstorming There is hilarious
The first systematic collection of rock samples from Zhuoyoufeng by Chinese scientific researchers
NextWho says cheap is not good? A brand new cost-effective phone that does not accept any doubts and offers a truly enjoyable experience
Guess you like
-
S&P Global Sustainability Yearbook 2024: Baidu's Inclusion Highlights the Crucial Role of AI GovernanceDetail
2025-02-19 21:08:50 1
-
Ronshen Refrigerators Lead 2024 Offline Market: Full-Scenario Embedded Refrigerators Drive Consumption UpgradeDetail
2025-02-19 19:12:01 11
-
Lenovo Xiaoxin Pro 2025 Series Unveiled: AI-Powered Evolution for an Upgraded ExperienceDetail
2025-02-19 10:43:34 11
-
The DeepSeek-R1 7B/14B API service is officially launched, offering 1 million free tokens!Detail
2025-02-19 10:18:07 1
-
Baidu's 2024 Financial Report: AI Strategy Drives Revenue Growth, Smart Cloud Leads the Large Model RaceDetail
2025-02-18 19:11:21 1
-
Xiaohongshu's IPO Plans: Rumors of State-Owned Enterprise Investment False, but Valuation Could Reach $20 USD BillionDetail
2025-02-18 10:27:03 1
-
Ulike Launches Three New Hair Removal Devices, Ushering in a New Era of Home Hair RemovalDetail
2025-02-17 22:00:06 11
-
Global Personal Smart Audio Market in 2025: Opportunities and Challenges Amidst Strong GrowthDetail
2025-02-17 15:28:45 1
-
OPPO Find N5: An In-Depth Look at the New Document App and Cross-System ConnectivityDetail
2025-02-17 15:25:26 1
-
Ping An Good Driver's AI-Powered Smart Insurance Planner Wins 2024 Technological Innovation Service Case AwardDetail
2025-02-17 09:36:45 11
- Detail
-
Xiaomi's Electric Vehicles Become a Growth Engine: Over 135,000 Deliveries in 9 Months, Orders Extending 6-7 Months OutDetail
2025-02-16 12:34:46 1
-
Geely Granted Patent for "Smart Charging Robot" Design, Enabling Automated EV ChargingDetail
2025-02-14 16:58:11 1
-
OPPO Find N5: Ushering in the 8mm Era for Foldable Smartphones A Milestone Breakthrough in Chinese Precision ManufacturingDetail
2025-02-14 13:05:02 1
-
Global Semiconductor Market Experiences Strong Growth in 2024: AI-Driven Data Centers Fuel Expansion, Samsung Reclaims Top SpotDetail
2025-02-14 13:00:26 21
-
Douyin's 2025 Spring Festival Consumption Data Report: Livestreaming Significantly Boosts Offline Consumption, Intangible Cultural Heritage and Tourism Emerge as New HighlightsDetail
2025-02-06 10:59:24 11
-
98-inch or 100-inch TV? An In-Depth Analysis of Large-Screen TV Selection ChallengesDetail
2025-02-06 05:24:30 1
-
Hanoi Stadium Drone Disaster: Unveiling the Complex Relationship Between Vietnam and the Sino-Korean Drone MarketDetail
2025-02-05 12:51:51 21
-
Douyin's 2023 Spring Festival Consumption Data Report: A Collision of Robust Consumption and Diversified New Year CustomsDetail
2025-02-05 10:21:17 1
-
Baidu Intelligent Cloud Illuminates China's First Self-Developed 10,000-GPU Cluster, Ushering in a New Era of AI Computing PowerDetail
2025-02-05 09:36:39 11