Facebook Dataset Addresses Algorithmic Bias

April 08, 2021 at 07:14 am EDT

By John McCormick

Facebook Inc. on Thursday made publicly available a dataset designed to help artificial-intelligence researchers evaluate their computer vision and audio models for potential algorithmic bias.

The dataset, called Casual Conversations, consists of videos of some 3,000 participants of various skin tones sharing their age and gender.

Cristian Canton Ferrer, Facebook AI's research manager who supervised the effort, said the dataset tries to address two problems: "The critical need within the AI community to [improve] the fairness of AI systems" and "the lack of high-quality data sets that are designed to help measure this fairness in AI." Facebook AI is the social network's artificial-intelligence organization.

Artificial-intelligence systems are trained on large sets of data. Facial-recognition systems, for instance, are fed mountains of facial images that allow the system to find patterns in faces that it can use to make a match. If a dataset used to train a system included mostly the photos of a particular demographic, it might not be as accurate when identifying photos of people in other demographics. AI systems have been shown to be less accurate at identifying faces of dark-skinned women, for example.

Facebook said having people provide their ages and genders for content labeling, as opposed to having a third party or computer system estimate that information, creates a relatively unbiased dataset of people's actual ages and genders.

The Casual Conversations dataset also includes labels of participants' apparent skin tones that were developed by trained annotators using the Fitzpatrick scale, a skin classification system. The annotators also marked videos with ambient lighting conditions, which can help measure how AI systems treat skin tones under lowlight conditions.

In all, participants made an average of 15 videos each, in which they engaged in nonscripted conversations, for a total of more than 45,000 videos. The videos were originally gathered as part of an earlier project Facebook participated in called the Deepfake Detection Challenge, which was set up to accelerate research for detecting and preventing manipulated media.

Many companies have released tools designed to check algorithms for bias in recent years. LinkedIn Fairness Toolkit, introduced last year by Microsoft Corp.'s professional social-network unit, analyzes the attributes of a data set, such as its racial and gender makeup, and compares its findings with an algorithm's results. If a data set is nearly equally divided by gender, for example, but a search algorithm based on that data set generates results with only a quarter of women, the system will spot that.

The standard way of evaluating AI models' performance today is to measure them against a test set after the models have been trained and validated, Facebook said. But that test set, the company said, may contain the same shortcomings as the training sets because it may be collected from similar sources.

Svetlana Sicular, a vice president, analyst, at technology research and advisory firm Gartner Inc., said such a second set of eyes can help AI developers validate the fairness of their systems.

The dataset, for instance, allows a company building a product with a facial-recognition feature to perform additional algorithmic bias testing. In an initial test, the system might have performed equally well across all the ages and genders. However, when run against the Facebook Casual Conversations dataset, in which actual ages, genders and skin tones are known, the second test might pick up instances in which the system doesn't perform consistently well with a group of people with a certain skin tone.

"That's where these data sets might be helpful -- to allow you to measure how fair you are with respect to a bunch of different categories, " Mr. Ferrer said.

If a problem is identified, he said, the developer could add more images of people with that skin tone to the software's training set to improve the AI system's ability to recognize it.

But the dataset is just a step, Mr. Ferrer said. The company is allowing outside developers to access the dataset to find ways to improve it. For instance, he said, the dataset videos were all captured in the U.S. Perhaps, he said, an outsider could enrich the dataset by adding videos of people outside the U.S.

Write to John McCormick at john.mccormick@wsj.com

(END) Dow Jones Newswires

04-08-21 0914ET

Market Closed - Nasdaq Other stock markets 04:00:00 2024-04-19 pm EDT			After market 07:35:20 pm
481.1 ^USD	-4.13%		478.8	-0.46%

Apr. 19	S&P 500 Posts Third Consecutive Weekly Drop, Led by Tech, Consumer Discretionary Amid Mixed Earnings, Middle East Turmoil	MT
Apr. 19	US Equities Markets Close Mixed Friday as Netflix Slump Weighs on S&P 500, Nasdaq	MT

Add to a list Add to a list 0 selected To use this feature you must be a member Log in Sign up	Price	Change	5d. change	Capi.
DOW JONES INDUSTRIAL	37,986 PTS	+0.56%	-0.55%	-
GARTNER, INC.	440.4 USD	-2.05%	-5.01%	35B
MICROSOFT CORPORATION	399.1 USD	-1.27%	-3.51%	3,004B
BRITISH POUND / US DOLLAR	1.237 USD	-0.52%	-1.47%	-
EURO / US DOLLAR	1.065 USD	+0.10%	-0.68%	-
CANADIAN DOLLAR / US DOLLAR	0.7269 USD	+0.11%	-0.49%	-
AUSTRALIAN DOLLAR / US DOLLAR	0.6416 USD	-0.11%	-1.91%	-
NEW ZEALAND DOLLAR / US DOLLAR	0.5886 USD	-0.27%	-1.92%	-
INDIAN RUPEE / US DOLLAR	0.012 USD	+0.21%	0.00%	-

S&P 500 Posts Third Consecutive Weekly Drop, Led by Tech, Consumer Discretionary Amid Mixed Earnings, Middle East Turmoil	Apr. 19	MT
US Equities Markets Close Mixed Friday as Netflix Slump Weighs on S&P 500, Nasdaq	Apr. 19	MT
Tech Shares Slide; Nvidia Falls Below $2 Trillion Market Cap -- Technology Roundup	Apr. 19	DJ
Dutch Data Protection Authority Flags Privacy Concerns With Meta's Facebook	Apr. 19	MT
Top Midday Stories: Corporate Earnings in Focus; Tesla Recalls Cybertrucks; Apple Removes Meta's WhatsApp, Threads From China App Store	Apr. 19	MT
Meta Likely to Report Modest Q1 Upside, BofA Says	Apr. 19	MT
Meta Platforms' Facebook Should Not be Used by Dutch Government Agencies if Privacy is Uncertain, Privacy Watchdog Says	Apr. 19	MT
Hungary investigates Orban critic Magyar over funding	Apr. 19	RE
Global markets live: P&G, Amex, Blackstone, Apple, Tesla...	Apr. 19
Dutch privacy watchdog recommends government organisations stop using Facebook	Apr. 19	RE
DUTCH PRIVACY WATCHDOG RECOMMENDS GOVERNMENT ORGANISATIONS STOP…	Apr. 19	RE
Meta Platforms Insider Sold Shares Worth $291,675, According to a Recent SEC Filing	Apr. 19	MT
Netflix slides as move to end sharing user count sparks growth worries	Apr. 19	RE
Deutsche Bank Raises Meta Platforms Price Target to $540 from $525	Apr. 19	MT
Apple Reportedly Removes WhatsApp, Threads From China App Store After Beijing Directive	Apr. 19	MT
Social Buzz: Wallstreetbets Stocks Mostly Lower Pre-Bell Friday; Netflix, Taiwan Semiconductor to Open Lower	Apr. 19	MT
North American Morning Briefing : Stock Futures Dip, Oil Prices Whipsaw After Israeli Strike on Iran	Apr. 19	DJ
Wall St Week Ahead-'Crowded' megacap trade in US stocks awaits earnings test	Apr. 19	RE
META : RBC maintains a Buy rating	Apr. 19	ZD
Microsoft-Backed OpenAI Hires First India Employee, Bloomberg Reports	Apr. 19	MT
Futures fall as Middle East tensions simmer, Netflix slumps	Apr. 19	RE
Apple Reportedly Removes WhatsApp, Threads From China App Store After Beijing Directive	Apr. 19	MT
Apple removes WhatsApp, Threads from China app store, WSJ reports	Apr. 18	RE
Meta releases early versions of its Llama 3 AI model	Apr. 18	RE
S&P 500, Nasdaq slip as investors digest earnings, data	Apr. 18	RE

	1st Jan change	Capi.
META PLATFORMS, INC.	+35.91%	1,279B
KUAISHOU TECHNOLOGY	-16.24%	25.39B
TENCENT MUSIC ENTERTAINMENT GROUP	+26.08%	18.05B
MATCH GROUP, INC.	-11.84%	8.64B
WEIBO CORPORATION	-27.85%	1.96B
BUMBLE INC.	-30.94%	1.33B
NEXTDOOR HOLDINGS, INC.	+4.76%	768M
GREE, INC.	-20.00%	515M
NEW WORK SE	-23.10%	376M
ZIGEXN CO., LTD.	-0.94%	369M

Meta Platforms, Inc.

Equities

META

US30303M1027

Internet Services

Facebook Dataset Addresses Algorithmic Bias

Stocks mentioned in the article

Latest news about Meta Platforms, Inc.

Chart Meta Platforms, Inc.

Company Profile

Income Statement Evolution

Analysis / Opinion

Ratings for Meta Platforms, Inc.

Analysts' Consensus

EPS Revisions

Quarterly earnings - Rate of surprise

Sector Social Media & Networking