No results found

All MRS websites use cookies to help us improve our services. Any data collected is anonymised. If you continue using this site without accepting cookies you may experience some performance issues. Read about our cookies here.

Accept and continue A white arrow

Reject all cookies A white arrow

Cookie Settings A white arrow

New Delphi report: Who owns understanding?

'Everything is interesting' - big challenges in big data
31 March 2014

In a keynote interview at the recent MRS annual conference, the renowned author, Will Self, commented that potentially everything is interesting; humans are passive entities that need to be stimulated.

We have the option of looking at the world from deep and narrow or broad and shallow perspectives. We can experience epiphanies, or insights, seeing the relationship between detail and the bigger picture.

When asked how he knew when to stop researching the background for his novels, Self replied that it is never possible to have all the available data, and time is not inexhaustible, whilst there is always the temptation to attempt completeness. Therefore, every research project needs a structure.

Finally, in creating a narrative, Self reminded us that this takes away as much as it gives – it is just one, our own, interpretation of data, events etc.

>>You can watch Will Self's full keynote interview here.

I mused on these points from Self’s interview during a recent fascinating one day conference organised by Cancer Research UK and Winton Capital Management at The British Library on April 1st on the theme of ‘Multidisciplinary Challenges of Big Data’.

As the speakers during the day reminded us, we cannot start from the perspective that everything might be interesting when analysing data – the challenge is detecting the real signals from that mass of noise, as underlined by presentations that covered the international project on cancer genomics, the Large Hadron Collider, cosmology, international financial markets, Google.

These were speakers who were also obviously highly stimulated by the challenges they face. The amount of data generated in some of these projects is beyond human comprehension, as were some of the figures from an article in National Geographic on neuroscience (‘Secrets of the Brain’, Vol. 225 No. 2, 2014) I happened to read on the train going home – a mouse brain has 70 million neurons and the project to complete that scan will take a further two years.

When asked to comment on scanning the entire human brain, Jeff Lightman (Ramón y Cajal Professor of Arts and Sciences at Harvard) replied: ‘I don’t dwell on that. It’s too painful’!

In fact, the work to date in re-creating just a tiny grain of salt sized part of the mouse’s brain contains 100 terabytes of data, the equivalent of 25,000 high definition movies. An image of the human brain would, it is estimated, contain 1.3 billion terabytes of data – just under half of total global digital storage in 2012.

In the conference, Harry Cliff (University of Cambridge) described the Hadron projects that involve 1 billion collisions per second, generating 10 zettabytes of raw data a year, 30 times greater than all current knowledge in the world. This data is communicated via a Worldwide LHC Computing Grid to 170 centres in 40 countries for analysis.

So, echoing Self, there is indeed a need to create an appropriate structure to projects when data is that big, and most of it is in fact not interesting and discarded, lost forever.

As speakers described, it is vital to identify the question before setting up the analysis – and these huge projects have protocols for doing this to ensure that the interesting data is robustly analysed against a defined aim.

The challenges facing the International Cancer Genome Consortium (ICGC) are equally complex with 10 billion items of data being generated in 2012. And their focus is only on the 20 most common cancers. As Jan Korbel (European Molecular Biology Laboratory) described, no two conditions are alike.

For example, Ruth Travis (Oxford University) described how prostate cancer in young men is different than the form in older men, and emphasised the importance of creating a holistic view of each individual patient – a data integration challenge.

The eventual aim from the ICGC programme is personalised medicine where a patient can be scanned and the data analysed at an affordable cost within an appropriate timeframe (could be hours, or days depending on the diagnosed condition), this data being also integrated with NHS and contextual data on the patient (e.g. lifestyle, demographics etc) to produce a tailored treatment programme.

Travis also described the issue of representativeness - who agrees to take part in the research and why the data still remains valuable when it isn’t, and therefore the need for replication studies.

The more complex the data, the more chance of bias, especially where data collected for one purpose is later used in other ways, and, as another panel member, John Copas (Warwick University), warned there is also the difference between what we can measure and what we’re interested in.

Don’t get fooled by the scale of available data (John Quackenbush, Dana-Farber Cancer Institute), and ensure you know the context.

Self’s advice about knowing when to draw the line and stop aiming for completeness seems another big challenge facing researchers in these fields. Whilst the immediate analytic need maybe met, this is simply one element in a complex, long-term programme of research.

When human health, or the building-blocks of the universe is the topic, drawing a line in the sand must be an extremely difficult decision to make.

What was particularly interesting about this conference was not just the detail that speakers provided in their presentations, but the interdisciplinary range of experiences covered in the programme, with plenty of opportunities to learn from those working in other fields.

This was underlined by the Rt Hon David Willetts MP (UK Minister for Universities and Science), who reminded us that George Osborne had announced in the recent Budget statement funding of £42m over 5 years for a new institute, the Alan Turing Centre (named after the UK WW2 codebreaker and computing pioneer) to ensure Britain leads the way in the use of big data.

Willetts also described how an adaptation of the real time telemetry system used in Formula 1 races by McLaren was being tested as a way to continuously monitor sick children in one UK hospital.

All of this makes the challenges faced in day-to-day market research projects seem rather small beer, but the same principles need to be applied, especially as the survey data is increasingly just one source of information in the pursuit of holistic knowledge about consumers and their behaviour.

Researchers need to collaborate across disciplines and borrow methods successful in other fields.

In ‘big data’, the challenges faced by researchers are the real ‘big’ – it is not simply about having lots of data. Research and curiosity go hand-in-hand; you don’t become a researcher unless you have an enquiring mind and a fascination for creating new knowledge.

However, not everything can be interesting, but as all researchers know, drawing the boundaries between what is, and what isn’t, can be the ultimate challenge.

P.S. Still on the big data theme, if you’ve not seen it, the UK National Statistician has recently recommended a predominantly online census in 2021 supplemented by the further use of administrative and survey data. It is now up to the government to make the final decision.

>>Read Keith Dugmore's blog on the future of the Census

How to access the International Journal of Market Research (IJMR)

Published by SAGE, MRS Certified Members can access the journal on the SAGE website via this link.

Go to the IJMR website A white arrow

More ijmr newsletters

IJMR Landmark Paper: ‘Role of the ESRC Data Archive in the dissemination of data for secondary analysis’

27 July 2018

‘Role of the ESRC Data Archive in the dissemination of data for secondary analysis’, Denise Lievesley, then Director, ESRC Data Archive, University of Essex, JMRS Vol 35 No 3, July 1993 I’ve written in an earlier Editorial about the excellent work being undertaken by the Archive of Market and Social Research…

The search for ‘truth’: measurement formats in research

27 March 2015

In the latest issue of IJMR, we are publishing three papers on the theme of measurement formats. The first is a comprehensive literature review, by Callegaro et al, that in addition to summarising ‘best practice’ in the search for ‘truth’ in data collection, also identifies gaps in current published knowledge…

‘Fit for purpose’ - the researcher’s mantra

04 March 2014

In 'We can do better' - the Viewpoint in our latest issue (Vol.56 Issue1) - Reg Baker addresses a question posed for a panel he served on at last September’s ESOMAR Congress: “Do we need to get over ourselves and stop worrying too much about representativeness, as opposed to delivering new insights?” To do…

Digitisation of MRS Journal - free archive of papers from 1959-1990 available

26 September 2019

I will be retiring as Editor in Chief of IJMR when the final issue for this year is published in late November, so this will be my final Editor’s blog. As you will see below, it is rather different than the others I’ve written over the years as it celebrates…

Good qual is like being in love - you’ll know it when it happens

19 January 2017

Discussing the latest landmark paper:‘Qualitative market research: a conceptual analysis and review of practitioner criteria’, John Colwell (Middlesex Polytechnic), JMRS Vol. 32. No. 1, January 1990 At this year’s MRS annual conference, Impact 2017, IJMR is hosting four sessions, three of which are debates based on papers recently published in the journal. One of these sessions…

Landmark Paper: How do you like your data: raw, al dente or stewed?

30 July 2014

The latest Landmark Paper is drawn from the two special issues of JMRS, celebrating the 50th anniversary of the MRS. It was originally published in the Proceedings of MRS Conference 1985, and presented at that event by the authors. The theme of the paper addresses what should surely be a fundamental…

0 comments

Get the latest MRS news

Our newsletters cover the latest MRS events, policy updates and research news.

'Everything is interesting' - big challenges in big data
31 March 2014

How to access the International Journal of Market Research (IJMR)

More ijmr newsletters

IJMR Landmark Paper: ‘Role of the ESRC Data Archive in the dissemination of data for secondary analysis’

The search for ‘truth’: measurement formats in research

‘Fit for purpose’ - the researcher’s mantra

Digitisation of MRS Journal - free archive of papers from 1959-1990 available

Good qual is like being in love - you’ll know it when it happens

Landmark Paper: How do you like your data: raw, al dente or stewed?

0 comments

Get the latest MRS news

The Research Buyers Guide

Find your next agency

Advanced Search.

Research Buyer's Guide (RBG)

Research Live

International Journal of Market Research (IJMR)

Recruiter Accreditation

Global Data Quality (GDQ)

Settings

Services

'Everything is interesting' - big challenges in big data31 March 2014

How to access the International Journal of Market Research (IJMR)

More ijmr newsletters

IJMR Landmark Paper: ‘Role of the ESRC Data Archive in the dissemination of data for secondary analysis’

The search for ‘truth’: measurement formats in research

‘Fit for purpose’ - the researcher’s mantra

Digitisation of MRS Journal - free archive of papers from 1959-1990 available

Good qual is like being in love - you’ll know it when it happens

Landmark Paper: How do you like your data: raw, al dente or stewed?

0 comments

Get the latest MRS news

The Research Buyers Guide

Find your next agency

Advanced Search.

'Everything is interesting' - big challenges in big data
31 March 2014