News


From the editor's desk: Intelligence is needed to train AI

22 November 2023 News


Peter Howells, Editor

As I sit in front of my computer and ponder the year that is swiftly coming to a close, I again marvel at everything that has happened. It has definitely been a busy year. There have been plenty of announcements in the tech space, flooding my inbox, especially when it comes to artificial intelligence and machine learning.

No matter who you speak to, they will have heard something about AI and the effects it will have on our everyday life. And they all have their own opinions too. Many of them, however, are not based on any sort of scientific fact.

One opinion piece that did cross my desk this month was an article on training of AI models. Many people do not realise that AI is trained on human-generated information. This mostly comes from information that already exists on the internet in some form: articles that have been written, responses to questions, shopping habits, browsing history; pretty much anything is fair game for use in the training of AI models.

But there is a finite amount of non-repetitive information that exists. Let me clarify what I mean by this. Much of the enormous amount of information created each day is a copy of data/information that already exists. People reposting text, images and other multimedia, copying of other data from one platform to another; all this counts towards the total amount of information created.

Yes, there is still a massive amount of information available for training, but AI models are becoming more and more powerful, and are able to be trained on larger volumes of data. This year, even though the total amount of data generated globally was in the order of 120 zettabytes, much of this cannot be used for training models. ChatGPT was trained on 570 gigabytes of data, which amounted to around 300 million words. The more data used to train these AI models, the more accurate the models’ responses will become.

And this is where the concern starts to kick in for AI researchers; the volume of datasets needed to train AI models is growing much more rapidly than the growth of online data stocks. If the current training trend continues, in a paper published in 2022 it was predicted that we will run out of high-quality data before 2026. If models then turn to the remaining low-quality data, this will also be exhausted, sometime between 2030 and 2050.

But do we really want to train our models on low-quality data? We all know the bad decisions that can be made when only poor-quality data is available. After all, the internet is full of examples of ‘average’ people doing stupid things based on lack of insight or forethought. Do we really want our artificial intelligences to be only as smart as the average person?

One hope is that newer AI models will have a lower data overhead, that is, to be able to be trained suitably well using less data than their predecessors. I believe this would be similar to how many people get to conclusions nowadays – they are able to make quite reasonable decisions even when they do not know everything about a subject.

The one overriding thing I have taken away from all this talk about AI during this year is that we are certainly all living in an interesting and exciting era, even if it can be quite concerning at times.

To all our readers I would like to take this opportunity to wish you all a joyous and restful season. May your new year be filled with new goals, new achievements and above all, happiness.


Credit(s)



Share this article:
Share via emailShare via LinkedInPrint this page

Further reading:

From the editor's desk: AI – a double-edged sword
Technews Publishing News
As with any powerful tool, AI presents challenges, some of which, if not carefully managed, threaten to undo the potential that it can offer.

Read more...
Global semiconductor sales increase
News
The Semiconductor Industry Association (SIA) has announced global semiconductor sales were $57,0 billion during the month of April 2025, an increase of 2,5% compared to the March 2025.

Read more...
Avnet Abacus announced new president
Avnet Abacus News
Avnet Abacus has announced that Mario Merino will succeed Rudy Van Parijs as president of Avnet Abacus, effective 1 July 2025.

Read more...
Avnet Abacus wins multiple prestigious awards
Avnet Abacus News
The awards from Molex recognise outstanding performance, collaboration, and significant growth in the challenging market conditions of 2024.

Read more...
From the editor's desk: Is the current AI really what we want?
Technews Publishing Editor's Choice
The companies that develop LLMs need to change direction and concentrate on freeing up our time, not so that we can have more time to do the tasks we don’t want to do in the first place, but rather to allow us more time to do what we love.

Read more...
Components distribution slowdown Q1 2025
News
European components distribution (DMASS) experienced a continued slowdown in the first quarter 2025.

Read more...
Semiconductor sales increase 17% YoY
News
The Semiconductor Industry Association (SIA) recently announced global semiconductor sales were $54,9 billion during the month of February 2025, an increase of 17,1% compared to the February 2024 total.

Read more...
Silicon Labs – Q1 results
News
Silicon Labs, a leading innovator in low-power wireless, recently reported financial results for the first quarter, which ended April 5, 2025.

Read more...
Strengthening industry through strategic partnerships at KITE 2025
Specialised Exhibitions News
The KwaZulu-Natal Industrial Technology Exhibition is not just an exhibition, it is a powerhouse of industry collaboration where visitors and exhibitors gain access to authoritative insights, technical expertise, and high-impact networking opportunities.

Read more...
Solar Youth Project calls on industry to step up
News
With the second cohort completed training and the first cohort returning for their final module, host companies are urgently needed to turn the training into a long-term opportunity.

Read more...









While every effort has been made to ensure the accuracy of the information contained herein, the publisher and its agents cannot be held responsible for any errors contained, or any loss incurred as a result. Articles published do not necessarily reflect the views of the publishers. The editor reserves the right to alter or cut copy. Articles submitted are deemed to have been cleared for publication. Advertisements and company contact details are published as provided by the advertiser. Technews Publishing (Pty) Ltd cannot be held responsible for the accuracy or veracity of supplied material.




© Technews Publishing (Pty) Ltd | All Rights Reserved