Paper: A preliminary review of influential works in data-driven discovery. Stalzer & Mentzel (2016)
Authors used the word "data driven discovery" and they wanted to know what exactly it was…or meant. They wanted to know what tools were being used by everyone. To do this, they open up a grant competition for $1.5M dollars to ask researchers about what they believed data driven discovery to be.
They completely missed the social sciences. Home department was not a good indication of what the researcher did. There were sets of domains within those who submit––particularly in astronomy & biology.
"The dataset" contained
Big data = any datasets you don't have the tools to handle
"New instruments are also showing that data-driven discovery is not just about the volume of data, but also the “velocity”."
The amount of sequencing data outnumbers the amount of available compute.
Domain Sciences: These disciplines, at the time, were the most challenged by big data.
Methodologies
You can't infer things that you haven't sampled.
Two questions:
Submit 3-4 sentences describing either a problem that you tackled with one of the techniques described in this paper, or a problem that you want to tackle.