三级aa视频在线观看-三级国产-三级国产精品一区二区-三级国产三级在线-三级国产在线

USEUROPEAFRICAASIA 中文雙語Fran?ais
Opinion
Home / Opinion / Op-Ed Contributors

Better manage risks inherent in Big Data

By Ernest Davis | China Daily | Updated: 2017-02-13 08:00

Better manage risks inherent in Big Data

A man tries out a VR (virtual reality) device during the ongoing Big Data Expo 2016 in Guiyang, capital of Southwest China's Guizhou province, May 25, 2016. [Photo/Xinhua]

In the last 15 years, we have witnessed an explosion in the amount of digital data available-from the Internet, social media, scientific equipment, smart phones, surveillance cameras, and many other sources-and in the computer technologies used to process it. "Big Data", as it is known, will undoubtedly deliver important scientific, technological, and medical advances. But Big Data also poses serious risks if it is misused or abused.

But having more data is no substitute for having high-quality data. For example, a recent article in Nature reports that election pollsters in the United States are struggling to obtain representative samples of the population, because they are legally permitted to call only landline telephones, whereas Americans increasingly rely on cellphones. And while one can find countless political opinions on social media, these aren't reliably representative of voters, either. In fact, a substantial share of tweets and Facebook posts about politics are computer-generated.

A Big Data program that used this search result to evaluate hiring and promotion decisions might penalize black candidates who resembled the pictures in the results for "unprofessional hairstyles," thereby perpetuating traditional social biases. And this isn't just a hypothetical possibility. Last year, a ProPublica investigation of "recidivism risk models" demonstrated that a widely used methodology to determine sentences for convicted criminals systematically overestimates the likelihood that black defendants will commit crimes in the future, and underestimates the risk that white defendants will do so.

Another hazard of Big Data is that it can be gamed. When people know that a data set is being used to make important decisions that will affect them, they have an incentive to tip the scales in their favor. For example, teachers who are judged according to their students' test scores may be more likely to "teach to the test," or even to cheat.

Similarly, college administrators who want to move their institutions up in the US News and World Reports rankings have made unwise decisions, such as investing in extravagant gyms at the expense of academics. Worse, they have made grotesquely unethical decisions, such as the effort by Mount Saint Mary's University to boost its "retention rate" by identifying and expelling weaker students in the first few weeks of school.

A third hazard is privacy violations, because so much of the data now available contains personal information. In recent years, enormous collections of confidential data have been stolen from commercial and government sites; and researchers have shown how people's political opinions or even sexual preferences can be accurately gleaned from seemingly innocuous online postings, such as movie reviews-even when they are published pseudonymously.

Finally, Big Data poses a challenge for accountability. Someone who feels that he or she has been treated unfairly by an algorithm's decision often has no way to appeal it, either because specific results cannot be interpreted, or because the people who have written the algorithm refuse to provide details about how it works. And while governments or corporations might intimidate anyone who objects by describing their algorithms as "mathematical" or "scientific," they, too, are often awed by their creations' behavior. The European Union recently adopted a measure guaranteeing people affected by algorithms a "right to an explanation"; but only time will tell how this will work in practice.

When people who are harmed by Big Data have no avenues for recourse, the results can be toxic and far-reaching, as data scientist Cathy O'Neil demonstrates in her recent book Weapons of Math Destruction.

The good news is that the hazards of Big Data can be largely avoided. But they won't be unless we zealously protect people's privacy, detect and correct unfairness, use algorithmic recommendations prudently, and maintain a rigorous understanding of algorithms' inner workings and the data that informs their decisions.

The author is a professor of computer science at the Courant Institute of Mathematical Sciences, New York University.

Project Syndicate

Most Viewed in 24 Hours
Copyright 1995 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
主站蜘蛛池模板: 国产伦精品一区二区三区免费 | 国产精品极品美女免费观看 | 亚洲午夜视频在线 | 亚洲第一免费播放区 | 色综合久久六月婷婷中文字幕 | 日本第一次处毛片 | 久久精品中文字幕有码日本 | 久久久国产高清 | 在线碰 | 欧美丝足 | 33333在线亚洲| 草草在线播放 | 日韩中文字幕在线看 | 欧美综合图区亚欧综合图区 | 国产96福利视频在线观看 | 国产成人8x视频一区二区 | 六月婷婷视频 | a4yy午夜 | 日韩亚洲一区中文字幕 | 亚洲精品国产一区二区 | a一级黄色 | 精品福利一区二区三区免费视频 | 日本一级毛片不卡免费 | 欧美一级特黄aaaaaa在线看首页 | 欧美在线视频二区 | 国产精品不卡无毒在线观看 | 久久无码精品一区二区三区 | 国产亚洲欧美在线播放网站 | 国产成人18黄网站麻豆 | 国产免费观看a大片的网站 国产免费观看网站黄页 | 国产大片免费天天看 | 看美女黄色片 | 你懂得国产 | 日本0930免费视频 | 日本免费乱人伦在线观看 | 一级国产在线观看高清 | 毛片在线网 | 黄色a级在线观看 | ak福利午夜在线观看 | 欧美国产视频 | 青青青青久久久久国产的 |