Opinion

How Cambridge Analytica’s Facebook targeting model really worked – according to the person who built it

Matthew Hindman speaks to Aleksandr Kogan, the man who built the infamous Cambridge Analytica-Facebook data analysis project, in this crossposting from The Conversation.

MATTHEW HINDMAN

April 4, 2018 12:08

The researcher whose work is at the center of the Facebook-Cambridge Analytica data analysis and political advertising uproar has revealed that his method worked much like the one Netflix uses to recommend movies.

In an email to me, Cambridge University scholar Aleksandr Kogan explained how his statistical model processed Facebook data for Cambridge Analytica. The accuracy he claims suggests it works about as well as established voter-targeting methods based on demographics like race, age and gender.

If confirmed, Kogan’s account would mean the digital modeling Cambridge Analytica used was hardly the virtual crystal ball a few have claimed. Yet the numbers Kogan provides also show what is – and isn’t – actually possible by combining personal data with machine learning for political ends.

Regarding one key public concern, though, Kogan’s numbers suggest that information on users’ personalities or “psychographics” was just a modest part of how the model targeted citizens. It was not a personality model strictly speaking, but rather one that boiled down demographics, social influences, personality and everything else into a big correlated lump. This soak-up-all-the-correlation-and-call-it-personality approach seems to have created a valuable campaign tool, even if the product being sold wasn’t quite as it was billed.

Be a member to keep reading

Join Mumbrella Pro to access the Mumbrella archive and read our premium analysis of everything under the media and marketing umbrella.

Become a member

Topics

Cambridge Analytica Facebook George Washington University Matthew Hindman The Conversation

1 COMMENTS

Antony G 4 Apr 18

As Stillwell and Koscinski (et al.) demonstrated, aligning a Big Five dimension (output) with a congruent message (input) resulted in greater efficacy. Adding demographic dimension to the input would allow for very effective ‘push-polling’ or ‘dogwhistle’ political messaging.

https://mumbrella.com.au/did-a-short-research-paper-just-unlock-facebooks-psychometric-blackbox-484800

The elephant in the room is this; if the Stillwell, Koscinski et al. method of gathering information to inform their psychometric modelling was effectively shut down in 2012 by FB, why did FB allow another entity to do the same in 2015 – as well as increase its scope by allowing them to harvest data from the friends and linked others of the consenting participants too?

User ID not verified.