The AI discussion thread

Page 32 - Seeking answers? Join the AnandTech community: where nearly half-a-million members share solutions and discuss the latest tech.

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
OpenAI announced the new o3 model with simulated reasoning, which includes math, science, logic, and coding:


Going by one benchmark, OpenAI is slowly inching closer to AGI. On ARC-AGI, a test designed to evaluate whether an AI system can efficiently acquire new skills outside the data it was trained on, o3 achieved an 87.5% score on the high compute setting. At its worst (on the low compute setting), the model tripled the performance of o1.

For coding, it is now at the International Grandmaster level & is approximately in the top 200 of competitive human coders on the planet:


“This model is incredible at programming,” Atlman added.

Breaking records:

Benchmark Performance
OpenAI reports that the o3 model has achieved unprecedented results across several benchmarks:

Coding Proficiency: The o3 model surpasses previous performance records, achieving a 22.8% improvement over its predecessor in coding tests, and even outperforms OpenAI’s Chief Scientist in competitive programming scenarios.

Mathematical Reasoning: In the 2024 American Invitational Mathematics Exam (AIME), o3 nearly achieved a perfect score, missing only one question. Additionally, it solved 25.2% of problems on the Frontier Math benchmark by EpochAI, a significant leap from previous models that did not exceed 2%.

Scientific Understanding: The model attained an 87.7% score on the GPQA Diamond benchmark, which comprises graduate-level questions in biology, physics, and chemistry.

Although:

And there are risks. AI safety testers have found that o1’s reasoning abilities make it try to deceive human users at a higher rate than conventional, “non-reasoning” models — or, for that matter, leading AI models from Meta, Anthropic, and Google. It’s possible that o3 attempts to deceive at an even higher rate than its predecessor; we’ll find out once OpenAI’s red-team partners release their testing results.

 
Reactions: William Gaatjes

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
First AI war:

* Ukraine has collected 2 million hours (228 years) of battlefield drone footage since 2022 to train AI; AI is being applied to air & land drones
* More than five terabytes of data is being added to the system per day on average
* Ukraine’s defense ministry has said that another system called Avengers, which centralizes footage from drones & CCTV, has been able to spot 12,000 Russian pieces of equipment a week using AI identification.


They're working on AI swarms as well. Straight out of a Hollywood movie:


In Ukraine, the key task for manufacturers is to produce an AI targeting system for drones which is cheap. That would allow it to be deployed en masse along the entire 1,000 km (621 mile) front line, where thousands of FPV drones are used up each week.

Costs can be brought down by running AI programmes on a Raspberry Pi, a small, cheap computer which has found global popularity outside the educational purposes it was designed for.

Makarchuk said he estimated the cost of putting in a simple targeting system, which would lock onto a shape visible to the drone's camera, at only about $150 per drone.

Portable grenade launcher drone:



Anti-tank mine drone:

 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
Into the uncanny valley
Rode the six hundred

The eyes on the top image look like they belong to two different people. The second one is all wrong on facial form, everything below the eyes is messed up and puffy, sort of like she has the mumps.

I don't know what you're talking about

 
Reactions: Muadib

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
An AI 100-word email costs a bottle of water:


A single query on ChatGPT-4 can use up to 3 bottles of water & a year of queries uses enough electricity to power over nine houses:


ChatGPT consumes 25 times more energy than Google:

 
Reactions: Red Squirrel

Red Squirrel

No Lifer
May 24, 2003
69,220
13,001
126
www.anyf.ca
Been using Chatgpt more for actual tasks now instead of just messing with it, and got to say it's so much better than the old fashioned way of just googling stuff. For example I wanted to configure postfix to run email through spamassassin and not really getting anywhere via google, as some articles were either wrong, or just did not explain things well enough. Ask chatgpt and no BS it just lays out all the config changes I have to make while explaining them and everything works.

At very least it gives me something to go by so I can verify further that I'm doing it right, and I can now check the relevant parts of the documentation. I find lot of time the documentation alone is not good enough if you don't know where to even look in first place.
 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
AI headshots: ($35 to $75)

1. Take a selfie
2. Choose a style
3. Get a professional headshot in 2 hours!



Sample applications:

* Dating
* Healthcare
* Realtor
* Christmas
* Yearbook
* Halloween
* Hair Salon
* Travel
* Glamour
* Old Money


What a great idea lol. My wife did professional photography for many years...now all you need is a smartphone! #AIisTakingOurJobs

 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
They are at like 99.999% realistic with photos now. 2025 will be the Year of AI Catfishing lol







 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
Been using Chatgpt more for actual tasks now instead of just messing with it, and got to say it's so much better than the old fashioned way of just googling stuff. For example I wanted to configure postfix to run email through spamassassin and not really getting anywhere via google, as some articles were either wrong, or just did not explain things well enough. Ask chatgpt and no BS it just lays out all the config changes I have to make while explaining them and everything works.

At very least it gives me something to go by so I can verify further that I'm doing it right, and I can now check the relevant parts of the documentation. I find lot of time the documentation alone is not good enough if you don't know where to even look in first place.

Check out this thread on Gemini auto-research:


It's the first Reasoning Model to have access to the Internet:


Research in minutes:


Try Perplexity AI search engine:

 
Reactions: Red Squirrel

manly

Lifer
Jan 25, 2000
12,561
3,379
136
Sam Altman claims Skynet AGI is coming soon.

 

Red Squirrel

No Lifer
May 24, 2003
69,220
13,001
126
www.anyf.ca
Check out this thread on Gemini auto-research:


It's the first Reasoning Model to have access to the Internet:


Research in minutes:


Try Perplexity AI search engine:


Yeah if it can work with having real time access to internet that is incredible. ChatGPT is not continuously being trained and is based on a specific date, but imagine one that is, and a new version of software comes out and there's a common issue everyone is having, it would quickly learn on that and provide the relevant info.
 

Kaido

Elite Member & Kitchen Overlord
Feb 14, 2004
49,253
5,812
136
Yeah if it can work with having real time access to internet that is incredible. ChatGPT is not continuously being trained and is based on a specific date, but imagine one that is, and a new version of software comes out and there's a common issue everyone is having, it would quickly learn on that and provide the relevant info.

This is crazy:

 
sale-70-410-exam    | Exam-200-125-pdf    | we-sale-70-410-exam    | hot-sale-70-410-exam    | Latest-exam-700-603-Dumps    | Dumps-98-363-exams-date    | Certs-200-125-date    | Dumps-300-075-exams-date    | hot-sale-book-C8010-726-book    | Hot-Sale-200-310-Exam    | Exam-Description-200-310-dumps?    | hot-sale-book-200-125-book    | Latest-Updated-300-209-Exam    | Dumps-210-260-exams-date    | Download-200-125-Exam-PDF    | Exam-Description-300-101-dumps    | Certs-300-101-date    | Hot-Sale-300-075-Exam    | Latest-exam-200-125-Dumps    | Exam-Description-200-125-dumps    | Latest-Updated-300-075-Exam    | hot-sale-book-210-260-book    | Dumps-200-901-exams-date    | Certs-200-901-date    | Latest-exam-1Z0-062-Dumps    | Hot-Sale-1Z0-062-Exam    | Certs-CSSLP-date    | 100%-Pass-70-383-Exams    | Latest-JN0-360-real-exam-questions    | 100%-Pass-4A0-100-Real-Exam-Questions    | Dumps-300-135-exams-date    | Passed-200-105-Tech-Exams    | Latest-Updated-200-310-Exam    | Download-300-070-Exam-PDF    | Hot-Sale-JN0-360-Exam    | 100%-Pass-JN0-360-Exams    | 100%-Pass-JN0-360-Real-Exam-Questions    | Dumps-JN0-360-exams-date    | Exam-Description-1Z0-876-dumps    | Latest-exam-1Z0-876-Dumps    | Dumps-HPE0-Y53-exams-date    | 2017-Latest-HPE0-Y53-Exam    | 100%-Pass-HPE0-Y53-Real-Exam-Questions    | Pass-4A0-100-Exam    | Latest-4A0-100-Questions    | Dumps-98-365-exams-date    | 2017-Latest-98-365-Exam    | 100%-Pass-VCS-254-Exams    | 2017-Latest-VCS-273-Exam    | Dumps-200-355-exams-date    | 2017-Latest-300-320-Exam    | Pass-300-101-Exam    | 100%-Pass-300-115-Exams    |
http://www.portvapes.co.uk/    | http://www.portvapes.co.uk/    |