Researchers just released profile data on 70,000 OkCupid users without permission

2 months ago Public Submissions 0

OK Cupid Caught Allowing Rape Of OKC Users Private Sex Data

Researchers just released profile data on 70,000 OkCupid users without permission

Updated by Brian Resnick

A group of researchers has released a data set on nearly 70,000 users of the online dating site OkCupid. The data dump breaks the cardinal rule of social science research ethics: It took identifiable personal data without permission.

The information — while publicly available to OkCupid users — was collected by Danish researchers who never contacted OkCupid or its clientele about using it.

The data, collected from November 2014 to March 2015, includes user names, ages, gender, religion, and personality traits, as well as answers to the personal questions the site asks to help match potential mates. The users hail from a few dozen countries around the world.

Why did the researchers want the data?

The researchers, Emil Kirkegaard, Oliver Nordbjerg, and Julius Daugbjerg Bjerrekær, ran software to “scrape” the information off OkCupid’s website and then uploaded the data onto the Open Science Framework, an online forum where researchers are encouraged to share raw data to increase transparency and collaboration across social science. Kirkegaard, the lead author, is a graduate student at Aarhus University in Denmark. (The university notes Kirkegaard was not working on the behalf of the university, and that “his actions are entirely his own responsibility.”)

Kirkegaard and Bjerrekær write that OkCupid is a valuable source of survey information “because users often answer hundreds if not thousands of questions.”

But the data set reveals deeply personal information about many of the users. OkCupid uses a series of personal questions — on topics such as sexual habits, politics, fidelity, feelings on homosexuality, etc. — to help match people on the site.

The data dump did not reveal anyone’s real name. But it’s entirely possible to use clues from a user’s location, demographics, and OkCupid user name to determine their identity.

If your OkC username is one you’ve used anywhere else, I now know your sexual preferences & kinks, your answers to thousands of questions.

Scott B. Weingart (@scott_bot) May 11, 2016

This is a huge breach of social science research ethics

The American Psychological Association makes it very clear: Participants in studies have the right to informed consent. They have a right to know how their data will be used, and they have the right to withdraw their data from that research. (There are some exceptions to the informed consent rule, but those do not apply when there’s a chance a person’s identity can be linked to sensitive information.)

This data scrape, and potential future studies built on it, won’t provide any of those protections. And scientists who use this data set may be in breach of the standard ethical code.

“This is without a doubt one of the most grossly unprofessional, unethical and reprehensible data releases I have ever seen,” writes Oliver Keyes, a social computing researcher*, on his blog.

A separate paper by Kirkegaard and Bjerrekær describing the methods they used in the OkCupid data scrape (also published on the Open Science Framework) contains another big ethical red flag. The authors report that they didn’t scrape profile pictures because it “would have taken up a lot of hard drive space.”

And when researchers asked Kirkegaard about these concerns on Twitter, he shrugged them off.

Note: The IRB is the institutional review board, a university office that reviews the ethics of research projects.

Does open science need some gatekeeping?

“Some may object to the ethics of gathering and releasing this data,” Kirkegaard and his colleagues argue in the paper. “However, all the data found in the dataset are or were already publicly available, so releasing this dataset merely presents it [in] a more useful form.”

(The profiles might technically be public, but why would OkCupid users expect anyone else but other users to look at them?)

On his blog, Keyes points out that Kirkegaard published the methods paper in a journal called Open Differential Psychology. The editor of that journal? Kirkegaard.

“The thing [Open Differential Psychology] looks pretty much like a vanity press,” Keyes writes. “In fact, of the last 26 papers it ‘published’, he authored or co-authored 13.” The paper claims it was peer-reviewed, but the fact that Kirkegaard is the editor is a conflict of interest.

The Open Science Framework was created, in part, in response to the traditional scientific gatekeeping of academic publishing. Anyone can publish data to it, with the hope that the freely accessible information will spur innovation and keep scientists accountable for their analyses. And as with YouTube or GitHub, it’s up to the users to ensure the integrity of the information, and not the framework.

If Kirkegaard is found to have violated the site’s terms of use — i.e., if OkCupid files a legal complaint — the data will be removed, says Brian Nosek, the executive director of the Open Science Foundation, which hosts the site.

This seems likely to happen. An OkCupid spokesperson tells me: “This is a clear violation of our terms of service — and the Computer Fraud and Abuse Act — and we’re exploring legal options.”

Overall, Nosek says the quality of the data is the responsibility of the Open Science Framework users. He says that personally he’d never post data with potential identifiers.

(For what it’s worth, Kirkegaard and his crew aren’t the first to scrape OkCupid user data. One user scraped the site to match with more women, but it’s a bit more controversial when data is posted on a site meant to help scientists find fodder for their projects.)

Nosek says the Open Science Foundation is having internal discussions of whether it should intervene in these cases. “This is a tricky question, because we are not the moral truth of what is appropriate to share or not,” he says. “That’s going to require some follow-up.” Even transparent science may need some gatekeeping.

It might be too late for this episode. The data has been downloaded nearly 500 times so far, and some are already analyzing it.

*This post originally identified Keyes as an employee of the Wikimedia foundation. He no longer works there.

Did you mean: okcupid spies on users

Is OK cupid really safe? – Fight for the Future

Fight for the Future

… so everything you do on the site can be seen by anyone who wants to spy on you. … “Dear OKCupid: It’s your responsibility to protect users by using standard … OKCupid doesn’t use HTTPS encryption to protect their users‘ safety on the vast …

Six Heartbreaking Truths about Online Dating Privacy | Electronic ……/six-heartbreaking-t…

Electronic Frontier Foundation

Whether you signed up on a lark or maintained an active profile for several …. Please send an email to OkCupid to tell them to safeguard user …

OkCupid Experiments – AskMen


OkCupid has been studying its users for years without their knowledge. … OkCupid Has Been Spying On Your Digital Dating Habits – Here’s …

OkCupid Profile Fraud Is The Crime You’d Never Expect – Jezebel…/okcupid-profile-fraud-is-the-crime-youd-never-ex…


In a strange new form of e-crime, a reader let us know that his OkCupid profile had been stolen — and used to spy on another user.

Forum / Help and FAQ / Privacy settings for google | OkCupid


The forums are open to indexing by Google and as you‘ve found this will … OK Cupid seems to be different, if I don’t enable the google spy …

Help and FAQ / Cupid always freezes up when I click

Nov 9, 2012

Unmitigated Drama / Adchoices are spying on me!

Apr 30, 2012


Apr 22, 2009

Group Meetups / D.C.- Spy Museum, anyone?

Apr 12, 2009

More results from

I-Spy / 43 / Dallas, United States | OkCupid

I will tell you this much about my life, since I have a family I don’t get away too much but would like to at some point in time. My days are filled with work then …

I just saw that my girlfriend is active on okcupid. I want to know if ……/OkCupid/…/i_just_saw_that_my_girlfriend_is…

Mar 14, 2014 – I’m not on okcupid, I just have an unfinished account there that I opened years ago. … I said, when all those guys send you messages there, you didn’t for …. shitty about spying that I found the proof of a nearly year-long affair.

Spy on OKCupid | OKCupid | Know Your Meme

Know Your Meme

See more ‘OKCupid‘ images on Know Your Meme! … OKCupidSpy on OKCupidYou Might Also Like: …

I said hello to a passing acquaintance on OkCupid (who is cute). She ……


Don’t message someone you know in real life on OKCupid unless you‘re sure that … Imagine you are a spy and you are working undercover in the enemy camp

Girlfriend still on OKCupid – dating jealousy | Ask MetaFilter

Jun 14, 2009 – She’s changed her status on OKCupid to ‘seeing someone‘ and is now only looking for ‘new ….. She may want to spy on her girlfriends picks.

Related posts: