Meta wants to supercharge Wikipedia with an AI upgrade

Wikipedia has a problem. And Meta, the not-too-long-ago rebranded Facebook, may just have the answer. Let’s back up.

Wikipedia is one of the largest-scale collaborative projects in human history, with more than 100,000 volunteer human editors contributing to the construction and maintenance of a mind-bogglingly large, multi-language encyclopedia consisting of millions of articles. Upward of 17,000 new articles are added to Wikipedia each month, while tweaks and modifications are continuously made to its existing corpus of articles. The most popular Wiki articles have been edited thousands of times, reflecting the very latest research, insights, and up-to-the-minute information.

The challenge, of course, is accuracy. The very existence of Wikipedia is proof positive that large numbers of humans can come together to create something positive. But in order to be genuinely useful and not a sprawling graffiti wall of unsubstantiated claims, Wikipedia articles must be backed up by facts.

This is where citations come in. The idea – and for the most part this works very well – is that Wikipedia users and editors alike can confirm facts by adding or clicking hyperlinks that track statements back to their source.

Citation needed

Say, for example, I want to confirm the entry on President Barack Obama’s Wikipedia article stating that Obama traveled to Europe and then Kenya in 1988, where he met many of his paternal relatives for the first time. All I have to do is to look at the citations for the sentence and, sure enough, there are three separate book references that seemingly confirm that the fact checks out.

By contrast, the phrase “citation needed” is probably the two most damning in all of Wikipedia, precisely because they suggest that there’s no evidence that the author didn’t conjure the words out of the digital ether. The words “citation needed” affixed to a Wikipedia claim is the equivalent of telling someone a fact while making finger quotes in the air.

the wikipedia logo on a pink background

Citations don’t tell us everything, though. If I were to tell you that, last year, I was the 23rd highest-earning tech journalist in the world and that I once gave up a lucrative modeling career to write articles for Digital Trends, it appears superficially plausible because there are hyperlinks to support my delusions.

The fact that the hyperlinks don’t support my alternative facts at all, but rather lead to unrelated pages on Digital Trends is only revealed when you click them. For the 99.9 percent of readers who have never met me, they might leave this article with a slew of false impressions, not the least of which is the surprisingly low barrier to entry to the world of modeling. In a hyperlinked world of information overload, in which we increasingly splash around in what Nicholas Carr refers to as “The Shallows,” the existence of citations themselves appear to be factual endorsements.

Meta wades in

But what if citations are added by Wikipedia editors, even if they don’t link to pages that actually support the claims?

As an illustration, a recent Wikipedia article on Blackfeet Tribe member Joe Hipp described how Hipp was the first Native American boxer to challenge for the WBA World Heavyweight title and linked to what seemed to be an appropriate webpage. However, the webpage in question mentioned neither boxing nor Joe Hipp. In the case of the Joe Hipp claim, the Wikipedia factoid was accurate, even if the citation was inappropriate.

Nonetheless, it’s easy to see how this could be used, either deliberately or otherwise, to spread misinformation.

Mark Zuckurburg introduces Facebook's new name, Meta.

It’s here that Meta thinks that it’s come up with a way to help. Working with the Wikimedia Foundation, Meta AI (that’s the AI research and development research lab for the social media giant) has developed what it claims is the first machine learning model able to automatically scan hundreds of thousands of citations at once to check if they support the corresponding claims. While this would be far from the first bot Wikipedia uses, it could be among the most impressive.

“I think we were driven by curiosity at the end of the day,” Fabio Petroni, research tech lead manager for the FAIR (Fundamental AI Research) team of Meta AI, told Digital Trends. “We wanted to see what was the limit of this technology. We were absolutely not sure if [this AI] could do anything meaningful in this context. No one had ever tried to do something similar [before].”

Understanding meaning

Trained using a dataset consisting of 4 million Wikipedia citations, Meta’s new tool is able to effectively analyze the information linked to a citation and then cross-reference it with the supporting evidence.

And this isn’t just a straightforward text string comparison, either. “There is a component like that, [looking at] the lexical similarity between the claim and the source, but that’s the easy case,” Petroni said. “With these models, what we have done is to build an index of all these webpages by chunking them into passages and providing an accurate representation for each passage … That is not representing word-by-word the passage, but the meaning of the passage.

That means that two chunks of text with similar meanings will be represented in a very close position in the resulting n-dimensional space where all these passages are stored.”

a single-pane comic from xkcd about Wikipedia citaionsxkcd

Just as impressive as the ability to spot fraudulent citations, however, is the tool’s potential for suggesting better references. Deployed as a production model, this tool could helpfully suggest references that would best illustrate a certain point. While Petroni balks at it being likened to a factual spellcheck, flagging errors and suggesting improvements, that’s an easy way to think about what it might do.

But as Petroni explains, there is still much more work to be done before it reaches this point. “What we have built is a proof of concept,” he said. “It’s not really usable at the moment. In order for this to be usable, you need to have a fresh index that indexes much more data than what we currently have. It needs to be constantly updated, with new information coming every day.”

This could, at least in theory, include not just text, but multimedia as well. Perhaps there’s a great authoritative documentary that’s available on YouTube the system could direct users toward. Maybe the answer to a particular claim is hidden in an image somewhere online.

A question of quality

There are other challenges, too.

Notable in its absence, at least at present, is any attempt to independently grade the quality of sources cited. This is a thorny area in itself. As a simple illustration, would a brief, throwaway reference to a subject in, say, the New York Times prove a more suitable, high-quality citation than a more comprehensive, but less-renowned source?

Should a mainstream publication rank more highly than a non-mainstream one? Google’s trillion-dollar PageRank algorithm – certainly the most famous algorithm ever built around citations – had this built into its model by, in essence, equating a high-quality source with one that had a high number of incoming links. At present, Meta’s AI has nothing like this.

If this AI was to work as an effective tool, it would need to have something like that. As a very obvious example of why, imagine that one was to set out to “prove” the most egregious, reprehensible opinion for inclusion on a Wikipedia page. If the only evidence needed to confirm that something is true is whether similar sentiments could be found published elsewhere online, then virtually any claim could technically prove correct — no matter how wrong it might be.

“[One area we are interested in] is trying to model explicitly the trustworthiness of a source, the trustworthiness of a domain,” Petroni said. “I think Wikipedia already has a list of domains that are considered trustworthy, and domains that are considered not.

But instead of having a fixed list, it would be nice if we can find a way to promote these algorithmically.”

Editors’ Recommendations

bathstore listed on couponmatrix.ukjustyou listed on couponmatrix.uknokia listed on couponmatrix.ukoakley listed on couponmatrix.uksimply-supplements listed on couponmatrix.ukwoods-furniture listed on couponmatrix.uk
bathstore listed on couponmatrix.ukShop the premier bathrooms and accessories offered by bathstore, the UK’s largest specialist bathroom retailer, and create a beautiful space where you can unwind among the bubbles. Add a bathstore discount code from us to your shop and you could score that roll-top bath of your dreams, install an illuminated mirror or completely redesign your bathroom, for a fraction of the regular price.
justyou listed on couponmatrix.ukIndividual travellers can select their destination, pack their bags and wake up in their dream location with these Just You voucher codes. Just You, the single traveller company organises everything from start to finish so people can meet the locals, taste the food and see the iconic sights of their chosen destination. Unwind, chat, socialise, the choice is yours. See things you’ve always dreamt of seeing, travel in a way you’ve always wanted to and create memories, all on your own terms. Just You have been taking single travellers on escorted tours for a good few years now so they are well-versed in what it is travellers, just like you, are looking for from a holiday. You can be assured of a great holiday experience in a welcoming environment and the perfect balance between escorted tour and independent travel. Just You's holidays aren’t just for single people, however, they’re for anyone who wants to travel as an individual, without having to pay the single supplement for accommodation. On some long-haul holidays they include free access to the airport lounge and they have developed a diverse range of holidays that allow you to take in a wealth of destinations and experience a deluge of cultures. Whether it s a quick city break or a longer adventure you're looking for, to destinations in Europe, Asia, The Americas, Australia, New Zealand or Africa, these Just You voucher codes will help save you some much needed spending money.
nokia listed on couponmatrix.ukA name known in millions of households worldwide, Nokia is renowned for crafting quality entertainment and communication devices – from W-Fi hubs to smart TVs. If you’re on the lookout for some innovative new tech to add to your own, turn to Groupon for the latest Nokia promo code and cut the cost of your next device. Whether you need a new laptop, reliable home broadband or simply some no-nonsense headphones, trust Nokia to deliver results at a competitive price.
oakley listed on couponmatrix.ukVisit the Oakley website to explore the range of sunglasses, backpacks, clothing and more available for both men and women. Grab an Oakley promo code and save a small fortune on the cost of your purchase. With over 44 years experience in retail, Oakley has tried and tested all their products to perfection; goggles for mud and snow meet European Community directives for optical performance and every item has been designed with you in mind.
simply-supplements listed on couponmatrix.ukDo you feel run down, or don’t think that you’re operating at 100%? Modern lifestyles and lack of nutrition from our diets can lead to deficiencies, and when we become deficient in things it can affect our health and our mood. This is where Simply Supplements steps in. Offering a massive range of vitamins and food supplements, they can help redress the balance in our bodies and create perfect equilibrium. Grab a Simply Supplements discount code to save when you stock up! Whether you’re looking for some fish oil, collagen powder, garlic extract or horny goat weed, Simply Supplements has it all.
woods-furniture listed on couponmatrix.ukFrom the coffee table you grace every morning to the bed where you rest your head at day’s end, Woods Furniture offers up every item a person could need to make a house a home. With an ever-growing catalogue spanning styles in upholstery, linen and wood, use a Woods Furniture discount code from Groupon to secure the décor you love for a fraction of the price. Should you like even some new throw pillows for your sofa or some candles to finish your renovation’s new look, Woods Furniture offer furniture solutions of even the smallest variety.