The Article You Are About to Read May Have Been Written by Artificial Intelligence

7 years ago

By Super Neuro
When AI can write brilliantly, do you feel the breath of a new era?

Here comes an AI that is better at programming than humans

Giving a beginning and asking the other person to write the rest of the story may be difficult for some people, but if we give it to AI, how good can they do?

Recently, OpenAI released an automatic text generation model that can write articles that are full of "realism".

With a human-generated beginning, this AI model can quickly complete the manuscript. As for the readability and fluency of the text, if you don’t tell it in advance, you might not guess that it was written by AI.

For example, give him a beginning like this:Scientists have made a shocking discovery: a herd of unicorns lives in a remote, untouched valley in the Andes Mountains. Even more surprising is that these unicorns speak perfect English.

The article generated by this AI model is as follows (partial):

These creatures have unique horns, so scientists named them Ovid's Unicorn. The silvery-white creature with four horns was unknown to the scientific community before.

…

Although the origins of these creatures are unclear, some believe they were the result of a human and a unicorn mating before human civilization existed. "Such phenomena are common in South America," said Professor Pérez.

…

If they are to be confirmed as descendants of a vanished race, DNA testing may be the only way.

In addition to being able to write realistic manuscripts, it also has the capabilities of reading comprehension, question and answer, generating article summaries, and translating texts.

Translation: From French to English

Dataset: WMT-14 Fr-En

Original sentence	Unforgettable experience with opération gratuite and avatar subie pour soigner with hernie lui permet travailler à nouveau.
Artificial	One man explained that the free hernia surgery he'd received will allow him to work again.
AI Translation	A man told me that the operation gratuity he had been promised would not allow him to travel.

This AI is a bit strong

This AI model is called GPT-2, which is an "upgraded version" of GPT. What's more brutal is that this time it uses more training data. The principle is the same as the previous version, but GPT-2 is a direct amplification of the GPT model. It is trained on 10 times more data and has 10 times more parameters.

By analyzing the input text, GPT-2 can perform basic text processing functions. It excels at language modeling tasks, which is the ability for the program to predict the next word in a sentence. Give it a title and the AI can write the rest of the article perfectly, even attaching fake quotes and statistics.

Someone said about it, "Want a short story? Just give it the first line and you'll get an unexpected and wonderful story. With the right prompts, it can even write a novel."

The goal of training GPT-2 is simple: given the previous words in the text, predict the next words. The diversity of the training data set allows it to generate text in a large number of different fields.

Although there is nothing new in terms of technology, people have received mining-level training, which is why they have created monster-level new tools.

OpenAI researchers said that GPT-2 has achieved excellent evaluation scores in language modeling tests on various domain-specific datasets. As a model that has not been specially trained on data in any field, its performance is better than those specially built models.

The era of the rise of NLP?

A few months ago, the language model BERT launched by Google attracted widespread attention in the industry. It was constantly on the screen for a while, and its achievement of breaking 11 records with 300 million parameters was praised by people. But the GPT-2 launched by OpenAI this time is even more deadly, with 1.5 billion parameters.

Compared to the previous state-of-the-art AI model, the GPT2 model is "12 times larger, with a 15 times larger dataset and a wider range." It was trained on a dataset of about 10 million articles, which were selected through news links that had more than 3 votes on Reddit. The text data used for training is as much as 40GB!

Before BERT swept all the top NLP (natural language processing) indicators, OpenAI's GTP was already among the top experts, and the amount of data trained by the newly released GPT-2 has directly brought this field to a new height.

With BERT and GPT-2, the road of NLP will definitely be prosperous. As for how to better benefit mankind, this is still a cautious topic.

Ani Kembhavi, a researcher at the Allen Institute for Artificial Intelligence, said one reason to be excited about GPT-2 is that predicting text can be thought of as a "super task" for computers, and once this challenge is solved, it will open the door to intelligence.

Could it be Pandora's box?

Unfortunately, such a powerful tool cannot be released for the time being. The consideration behind this is that it may bring hidden dangers, such as generating fake news, malicious comments, creating spam, etc. If such a weapon is used in illegal ways, the consequences will be disastrous.

Developers are also worried about this aspect. OpenAI researchers said they can't predict what will come. They are still exploring. For various reasons, they are very cautious about what they share about the project, and currently do not make the main basic code and training data public.

Another reason for caution, they point out, is that it could be dangerous if someone feeds GPT-2 racist, violent, misogynistic, or abusive text. After all, it relies on the internet for training.

There is no denying that this technology will bring about tremendous changes, but any tool, in the hands of someone with bad intentions, can have disastrous consequences.

Moreover, since the texts written by GPT-2 are newly generated and there is no problem of copying and pasting, it is more difficult to detect and check using previous detection methods, which will be a potential threat.

So, here comes the key question: was this article written by AI?

The Article You Are About to Read May Have Been Written by Artificial Intelligence

7 years ago

Headlines

Recommended List

By Super Neuro
When AI can write brilliantly, do you feel the breath of a new era?

Here comes an AI that is better at programming than humans

Giving a beginning and asking the other person to write the rest of the story may be difficult for some people, but if we give it to AI, how good can they do?

Recently, OpenAI released an automatic text generation model that can write articles that are full of "realism".

The article generated by this AI model is as follows (partial):

These creatures have unique horns, so scientists named them Ovid's Unicorn. The silvery-white creature with four horns was unknown to the scientific community before.

…

If they are to be confirmed as descendants of a vanished race, DNA testing may be the only way.

In addition to being able to write realistic manuscripts, it also has the capabilities of reading comprehension, question and answer, generating article summaries, and translating texts.

Translation: From French to English

Dataset: WMT-14 Fr-En

Original sentence	Unforgettable experience with opération gratuite and avatar subie pour soigner with hernie lui permet travailler à nouveau.
Artificial	One man explained that the free hernia surgery he'd received will allow him to work again.
AI Translation	A man told me that the operation gratuity he had been promised would not allow him to travel.

This AI is a bit strong

Someone said about it, "Want a short story? Just give it the first line and you'll get an unexpected and wonderful story. With the right prompts, it can even write a novel."

Although there is nothing new in terms of technology, people have received mining-level training, which is why they have created monster-level new tools.

The era of the rise of NLP?

With BERT and GPT-2, the road of NLP will definitely be prosperous. As for how to better benefit mankind, this is still a cautious topic.

Could it be Pandora's box?

Another reason for caution, they point out, is that it could be dangerous if someone feeds GPT-2 racist, violent, misogynistic, or abusive text. After all, it relies on the internet for training.

There is no denying that this technology will bring about tremendous changes, but any tool, in the hands of someone with bad intentions, can have disastrous consequences.

So, here comes the key question: was this article written by AI?

Command Palette

The Article You Are About to Read May Have Been Written by Artificial Intelligence

Here comes an AI that is better at programming than humans

This AI is a bit strong

The era of the rise of NLP?

Could it be Pandora's box?

Command Palette

The Article You Are About to Read May Have Been Written by Artificial Intelligence

Here comes an AI that is better at programming than humans

This AI is a bit strong

The era of the rise of NLP?

Could it be Pandora's box?

Related News

OpenAI Releases GeneBench-Pro, Which Assesses AI Research Capabilities Across 129 Questions and 10 domains.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

MIT/IBM Has Released ChartNet, the Largest Synthetic Chart Dataset to Date, Generating 1.5 Million Diverse Chart samples.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Paper Weekly Report | ProgramBench Enables AI to Write Software From Scratch, With 9 Major Models Failing En Masse; ExoActor Demonstrates Strong Scene Generalization Ability Without Additional real-world Data… A Quick Overview of the week's cutting-edge AI Papers

The National University of Singapore Proposes an AI-computational Chemistry Collaborative Process to Accelerate the Repositioning of Drugs for Diabetic Wound Healing, Reducing the R&D Cycle by Over 701 TP3T!

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

In Just 30 Minutes, the Biological multi-agent Robin Successfully Integrated 550 Research Papers, Establishing an Autonomous Research Loop and Identifying dAMD Candidate therapies.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers

Command Palette

The Article You Are About to Read May Have Been Written by Artificial Intelligence

Here comes an AI that is better at programming than humans

This AI is a bit strong

The era of the rise of NLP?

Could it be Pandora's box?

Related News

OpenAI Releases GeneBench-Pro, Which Assesses AI Research Capabilities Across 129 Questions and 10 domains.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

MIT/IBM Has Released ChartNet, the Largest Synthetic Chart Dataset to Date, Generating 1.5 Million Diverse Chart samples.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Paper Weekly Report | ProgramBench Enables AI to Write Software From Scratch, With 9 Major Models Failing En Masse; ExoActor Demonstrates Strong Scene Generalization Ability Without Additional real-world Data… A Quick Overview of the week's cutting-edge AI Papers

The National University of Singapore Proposes an AI-computational Chemistry Collaborative Process to Accelerate the Repositioning of Drugs for Diabetic Wound Healing, Reducing the R&D Cycle by Over 701 TP3T!

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

In Just 30 Minutes, the Biological multi-agent Robin Successfully Integrated 550 Research Papers, Establishing an Autonomous Research Loop and Identifying dAMD Candidate therapies.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers

Related News

OpenAI Releases GeneBench-Pro, Which Assesses AI Research Capabilities Across 129 Questions and 10 domains.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

MIT/IBM Has Released ChartNet, the Largest Synthetic Chart Dataset to Date, Generating 1.5 Million Diverse Chart samples.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Paper Weekly Report | ProgramBench Enables AI to Write Software From Scratch, With 9 Major Models Failing En Masse; ExoActor Demonstrates Strong Scene Generalization Ability Without Additional real-world Data… A Quick Overview of the week's cutting-edge AI Papers

The National University of Singapore Proposes an AI-computational Chemistry Collaborative Process to Accelerate the Repositioning of Drugs for Diabetic Wound Healing, Reducing the R&D Cycle by Over 701 TP3T!

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

In Just 30 Minutes, the Biological multi-agent Robin Successfully Integrated 550 Research Papers, Establishing an Autonomous Research Loop and Identifying dAMD Candidate therapies.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers

Related News

OpenAI Releases GeneBench-Pro, Which Assesses AI Research Capabilities Across 129 Questions and 10 domains.

Tutorial Summary | Open-source Small Models Achieve Overall Intelligence Comparable to GPT-5; one-stop Evaluation of Popular Models Such As Qwen 3.5/Gemma 4.

MIT/IBM Has Released ChartNet, the Largest Synthetic Chart Dataset to Date, Generating 1.5 Million Diverse Chart samples.

A Locally Runnable Privacy Detection Model: Privacy Filter Achieves high-quality PII Filtering at Low Cost; Hardcore Open Source! Covering the Transfermarkt Structured Football Dataset With Over 80,000 matches.

Paper Weekly Report | ProgramBench Enables AI to Write Software From Scratch, With 9 Major Models Failing En Masse; ExoActor Demonstrates Strong Scene Generalization Ability Without Additional real-world Data… A Quick Overview of the week's cutting-edge AI Papers

The National University of Singapore Proposes an AI-computational Chemistry Collaborative Process to Accelerate the Repositioning of Drugs for Diabetic Wound Healing, Reducing the R&D Cycle by Over 701 TP3T!

EnergAIzer, a GPU Power Estimation Framework Developed by MIT and Others, Completes Predictions in an Average of 1.8 Seconds With an Error of Approximately 81 TP3T.

In Just 30 Minutes, the Biological multi-agent Robin Successfully Integrated 550 Research Papers, Establishing an Autonomous Research Loop and Identifying dAMD Candidate therapies.

Paper Weekly Report | Microsoft MAI-Thinking Explores self-evolution of Pure RL, Achieving an AIME Accuracy of 97%; VLM³ Achieves 3D Task Generalization Using Plain Text Coordinates Without Architectural Modifications… A Quick Overview of the week's cutting-edge AI Papers