That would be GPT-3, which is short form for Generative Pre-Trained Transformer 3. It is the third in its series, developed by the Elon Musk co-founded (among others) start-up, OpenAI. It is a Natural Language Processing (NLP) model based on the transformer models introduced in 2017 (quite recently). It uses neural networks and applies deep learning algorithms to write coherent and sensible human text (multiple languages). Well, it goes beyond just text. It is capable of more generalized, second-order applications such as writing programs that create websites, write blog articles, generate images using text etc. It potentially has the output quality to fool human beings (at least most of them) into believing that the text was human-written, provided that the conditions were set right. If you don’t believe this, have a look at this blog article by The Guardian. Spoiler alert: The article was written by none other than GPT-3.
Why is GPT-3 key to artificial intelligence?
For a transformer model to be task agnostic and require minimal learning-iterations (known as few-shot learning) at this scale is unheard of. I’ve been researching the scientific paper published by the development team. GPT-3 has achieved this by not reinventing the base technology, but by scaling and tuning the data-sets that were used to train the base algorithm. Put in simple terms, it uses current state of the art methods, but just went bananas-brute-force with the scale of data used. To compare, the latest and biggest Microsoft NLP model, the Turing-NLG, released in February 2020 uses about 17-billion parameters (these can be imagined as weighted neurons). In comparison, GPT-3 was fed with both open internet data, and specifically chosen data-sets that amounted to 175 billion parameters. The training data sets are estimated at a cost of around 5 million US dollars (USD), and the running costs of the model are estimated at 87000 USD on a yearly basis.
The Good
GPT-3 simply has such a wide array of applications. It is the most holistic statistical model of the human language we have yet. If there is something that has been recorded in the human language on the internet, GPT-3 has a good chance of knowing it. And the more frequently it has occurred, the better the chance of GPT-3 retaining it. We have the possibility to benefit directly from this, i.e., thousands of cumulative years of human text. I would define this as augmented intelligence, and not artificial intelligence. Regardless of how we choose to call this, it means that we may be able to generate program codes without being programmers, or generate images based out of descriptions without being forensic experts, and so on.
From my foraging into the scientific paper, what I understand is that GPT-3 functions like a grandiose auto-complete algorithm. From all of the vastness of its datasets, and more precisely, the neural network it has retained from the datasets, it computes the conditional probability of the text that accomplishes the task requested of it. This simply means that GPT-3’s world is just a whole big bunch of strings that have no inherent meaning to the code. All it does is pattern match, and gets continually better at it. Before we bash this approach, I’d like to share that I learnt a foreign language in recent years (German). I intuitively followed a similar approach. Therefore, as rude as it sounds, GPT-3’s mimic and pattern match approach is remarkably similar to humans. However, beyond pattern matching, GPT-3 has no causal logic or reasoning skills, which is precisely why I refuse to call it artificial intelligence. Again, this is based on my understanding of the scientific paper. I have no access to the source code yet (that’s apparently worth a billion dollars). Anyway, since it doesn’t do any reasoning, it leads to some strange results like the image shown from twitter. Without reasoning logic, by changing the weights of parameters or groups of parameters, it is possible to generate completely authentic-looking, and coherent text that is manipulative, and potentially false. Therefore, in this sense, GPT-3 is outright dangerous.
The Ugly
Since OpenAI know that GPT-3 is dangerous, they chose not to release the source code. Their argument was that since GPT-3 requires a lot of resources to run, the group of people who are most likely to benefit from the code are big organisations, and they have an incentive to manipulate their target audience. OpenAI still plans to make the API available to normal users like you and me, for a small fee. Using the API, one can request GPT-3 to perform a task, and get the output. I have been on the waiting list for the API for months now. If anyone from the dev. team is reading is, I’m right here guys! This sounds like a genuinely ethical move from OpenAI, right?
Except, OpenAI announced last month, that Microsoft (also part of the original supporting group) would get exclusive rights to license the source code for a meagre fee of 1 billion USD. As the name suggests, OpenAI was originally meant to be about open source code. They have published the source code for all of their work until this point. For them to taken a U-turn now, and function as a for-profit organisation seems suspect to say the least. Besides, let’s give exclusive access to such a powerful model to one single organisation. What could possibly go wrong? For instance, imagine what the possibilities are, if a biased, causal, reasoning model is paired with GPT-3. What could possibly go wrong?
I’d love to hear your thoughts on this in the comments section below.
I hope you found this article interesting and useful. If you’d like to get notified when interesting content gets published here, consider subscribing.
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
Cookie
Duration
Description
cookielawinfo-checkbox-advertisement
1 year
Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional
11 months
The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent
1 year
Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
viewed_cookie_policy
11 months
The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Cookie
Duration
Description
_gat
1 minute
This cookie is installed by Google Universal Analytics to restrain request rate and thus limit the collection of data on high traffic sites.
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Cookie
Duration
Description
__gads
1 year 24 days
The __gads cookie, set by Google, is stored under DoubleClick domain and tracks the number of times users see an advert, measures the success of the campaign and calculates its revenue. This cookie can only be read from the domain they are set on and will not track any data while browsing through other sites.
_ga
2 years
The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_R5WSNS3HKS
2 years
This cookie is installed by Google Analytics.
_gat_gtag_UA_131795354_1
1 minute
Set by Google to distinguish users.
_gid
1 day
Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
CONSENT
2 years
YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.
Cookie
Duration
Description
IDE
1 year 24 days
Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
test_cookie
15 minutes
The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE
5 months 27 days
A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC
session
YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices
never
YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id
never
YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
Comments