/ An inside look at the business of digital content
Can machine learning detect “fake news” ?
March 6, 2017 | By Francesco Marconi, Strategy Manager—The Associated Press@fpmarconiWhile artificial intelligence can help newsrooms build a consistent fake news detector, it can also empower others to disseminate and even create new forms of misinformation.
Fake news is nothing new. The Roman Emperor Augustus led a campaign of misinformation against Mark Antony, a rival politician and general. The KGB used disinformation throughout the Cold War to enhance its political standing. Today fake news continues to serve as a political tool around the world, and new technologies are enabling individuals to propagate that fake news at unprecedented rates.
One of those new developments, artificial intelligence, can help journalists build a consistent fake news detector, but AI can also empower others to disseminate and even create new forms of misinformation. To understand how, we need to take a quick detour and explain machine learning — one of the most important sub-domains of artificial intelligence
Detecting Fake News with Machine Learning
Machine learning is, in the most basic sense, a system that learns from its actions and makes decisions accordingly, and it relies in turn on a process called deep learning which breaks down a single complex idea into a series of smaller, more approachable tasks.
Thus, conceptually, machine learning can help detect fake news! An intelligent system that takes news stories as its input and a big ol’ ‘Fake’ or ‘Not Fake’ sticker as output.
Machine learning (and deep-learning) relies mostly on algorithms, a set of rules that when followed leads to a desired output. But constructing algorithms is exceptionally difficult and the results can be catastrophic, especially when we rely on them to determine what news stories should be broadcast to our readers.
Algorithms Make Mistakes
The two most common errors in this sort of machine learning are terms that we borrow from statisticians — Type I (false negative) and Type II (false positive) errors.
A false negative would mean that your machine labels a fake news item as not fake. We don’t want that.
A false positive means your machine labels a real news story as fake. We don’t want that, either.
What we want is a system that can, with a high level of accuracy, label fake news as being fake, and real stories as not being fake. Again, we ask: How?
Using news articles to train the AI
AI machines can best make decisions like what is and is not fake news when you define fake and real news — you do so by showing the machine tens of thousands of examples of each.
For that, you will need a data set of high quality journalism, as well as another collection with a sample of fake news, which could be sourced from a predetermined list of fake news.
The AI Journalist
Remember, algorithms are written by humans, and humans make errors. Therefore, our AI machine may well make an error, especially in its early stages.
There’s also an editorial decision to be made here. No system is going to be 100 percent accurate, so which would you rather tend towards, false positives or false negatives? Would you rather a fake story be labeled as real or would you rather have a real story labeled fake?
Algorithmic errors aside, AI can help detect fake news. But as we mentioned earlier AI can also help disseminate it (all the more reason to understand AI, then!)
A New Wave of “Fake-news”?
If you’ve worked in the journalism industry long enough you’ve probably been fooled by a doctored video, photo or sound bite. And every day the technology available to produce those fake news items is becoming easier to use and more publicly accessible. For instance, Adobe recently announced an AI project that is able to replicate the same tone of voice by simply analyzing a sample of a speech, while a project developed by Stanford University researchers enables the manipulation of someone’s face in a video in real time.
In other words, the same sorts of machine learning and sub-domains of AI that can be used to fight fake news can also be used by others to propagate new types of misinformation.
The conclusion: Journalists need to understand AI.