# Open Data Day at Cambridge Brainhack ![](https://lh3.googleusercontent.com/KMM_p4kU3phgUTWMskXNsdyGziCHWPAXNIvJYHwkYtSDY9slKRQTvRmmV3a56iVpMze7zq7UF1U9jZ7WBzFuu_vmt60sVErjmMgXWF-rbgMExQlCJTbRKC2-b-8tBgrD89SSgZcAdrObwNwLB1e4AzKK5c6FsQOY8uIIX3BS0ghsvNTyB_-At9oskg60IUyZDedAAq6wx86_a-KoO766jnJ_X0fb-CNpnxCRYhc4F_wCOpePfpPL6i3j_NbMLBExYVQsGlFlc4OlMEXAeEIfOAFOd_bOf4LllkxjREKbd1JafrS4xCRk2IQ6qIWviS7lFxn3idCIa1BOy9UXjThGRWhvLmFIFjVaMejPcPkZCakd1d1IEKEsFnVGPH1F9iHCS1Dt6m9rmGQxAOJZiukfN8DeHSn9PIwgMryj4Enu4CocR3FoDtfSsZTusM_c01xXCgU8nslmjruq4W7LRoOLIpaIaaY_Y6YWRHWah3gLupbm4QhgiSwmJIB2m5x_VSoieChFZ6fUoDuTOVMMvL0XLnMNeq7a-5R2dS3IGsOfC8fJP2EMBZqMyZzh_4hFyS6c_6emcDhoy7xZgEH038oLGOnmOkSsvPZUhAY-CVuIfgyjUYVJwNilHgV6f-4dqROlFp82ra-KpAc_ehXnnpNtyQ3t-yf4n3Gke4cPgt40TA=w800-h400-no) [Brainhack Global 2017](http://events.brainhack.org/global2017) consisted of 40 satellite events around the world on March 2nd, 3rd and 4th 2017. [Brainhack](http://brainhack.org/) is a unique hackathon and unconference that brings together researchers with disparate backgrounds to collaborate on open science projects in neuroimaging. <iframe src="https://www.google.com/maps/d/embed?mid=1XpAJEj5dFbyOfBocQErzPPJCxpw" width="640" height="480"></iframe> *A map of all the [Brainhack Global 2017](http://events.brainhack.org/global2017) satellite sites.* We had more than 40 participants at the [Cambridge satellite event](https://cambrainhack.github.io/) of Brainhack Global representing early career researchers from multiple university departments and research institutes. Over three days we supported each other as we learned new skills and developed analyses to investigate neuroimaging data. The organising committee worked hard to foster a warm and friendly atmosphere. We know how hard it is to go outside their comfort zone and we wanted to make sure that everyone felt welcome. We had a strict [code of conduct](http://events.brainhack.org/global2017/codeofconduct.html) and made it very clear that everyone was welcome, no matter your race, gender, level of coding ability, or choice of programming language. The [talks](https://www.youtube.com/channel/UC6qjm8xadCIJfcQP4LdFDqQ) that kicked off our first two days together were a great opportunity to get excited about new areas of research. They inspired participants to consider how we can all play a role in the future of big data and open science in neuroimaging. ![](https://lh3.googleusercontent.com/MltfzFOUgdfyCeyLOfDsOGlYUq02DEZUj961Pr3NdHVZzjet12fcNO8YkI6kz0OkSR3mVSt1uLV7VNqqu8Xx99_ohiWLpf6LslES4ErPwI-40zT6FGMJN0qFuuwzOOAuDCdMI_46-fmYOdcWoO0s05Xz9nQ8VE4kjQ5C_j9RA8wMD7fBN7gYx2JpfA2td0RKF5dv2Lwd3ZdisrTdxSlXLVvg8LGnMJBE3wthSDyQbnRQHM-rM3o4Khyl276DanqB2PxpIzZLYj-bd1-caUkWsYiaCs44jc6ajUrA7wS24nMgHbO6WTRrqknLfcnQybD318mFoinLh9GXWKGWcK_ZEXt_LG3jK8K5Rs-f6hihJM2xXMXKzb16OQ7eq0Lc5gtXERJet31-U7YTyeKyw0d5elWLNTUx54ZitsMOWIsm6EyW5MRBPpYvAURv0ud6CZU484VqyaitTkDTpHhjdU95cN0hxcukUn8Ev4jbP005bI_2uAcBL4r0LO5aOGv6Q-Dr7sKGyoDo3CqVL7uII5HAbmZ3Ll5rDSm0cdLUap0j2YG2J_ZOTrbmEbQFb-_Oii-Xe1qfjoszN3U7aq81QPi8mlWzl5aClTGVRtB2DPug5-uuKNhQzqwxRnDA3eG3UXkSAQkKl3Sfj8fFi9RprQWPqrxmY3qv_tDm03CxgfFiDA=w1267-h950-no) *Jakob Seidlitz outlining some excellent sources of open brain imaging data.* [František Váša](http://www.neuroscience.cam.ac.uk/directory/profile.php?fv247) and [Jakob Seidlitz](http://www.neuroscience.cam.ac.uk/directory/profile.php?jms290) gave an excellent tour of the [fundamentals of network neuroscience](https://doi.org/10.6084/m9.figshare.4714414.v1) and introduced some of the freely available datasets that researchers could utilize to carry out this type of research. I gave a presentation on [how to make your results reproducible](https://doi.org/10.6084/m9.figshare.4720996.v1) and [Dr Niko Kriegeskorte](http://www.neuroscience.cam.ac.uk/directory/profile.php?nikokri) showed how his lab are using [deep neural networks](https://en.wikipedia.org/wiki/Deep_learning) to understand visual perception. Of particular note for open data advocates was [Dr Darren Price's](http://www.mrc-cbu.cam.ac.uk/people/darren-price/) presentation of the [Cambridge Centre for Ageing and Neuroscience](http://www.cam-can.org/) (Cam-CAN) dataset. You can [apply for access]() to the data from 700 adults, aged between 18 and 88 years old, who were scanned using structural and functional Magnetic Resonance Imaging (MRI) and magnetoencephalography (MEG). These participants also completed a large battery of behavioural tasks and questionaires outside of the scanner. There's more information about the study [here](http://www.sciencedirect.com/science/article/pii/S1053811915008150). ![](https://lh3.googleusercontent.com/NjmyGsIfn-yzdscoM1evAqig2nYgglurMGlHNaz0uvAMgSXp6WYxsKCi9oBineb1ZKmTJGM7YJYvhQCDNjiZTpaIXkXu7gc8hJuiO7CX_hovI4xf_brsThIztFE1UowfBZLM17r_GEz-oMXRTYeP4mp29D5RNvk4Inh4nD14XH3kyOqPM5yvWzN9MinynYLLYvSbzAcc4hCPHr4oLOXK6s-Rvr4eEbHvdBhOR5LZTpEYtpGaTmBXnZuuxMpnxvdi2VidNVJOTr18ga3pCtx31dA8ljXlrS9hlIQiuP1Iizbe1ul3XwVs27Nq3qgkcUs_8yyJ0KBBoA2Ay6Qk0u6X4nQb-1T88Sgp8c487FCqQsTRpKJc8CnLH1gjIiKRzMwS9LW7NgWRCG3riX42G0PldbUGImS59V2wLrbHR_42ATMZLdXGf0oo6zs-sWx9RqHXntzsglpYo3zIAOMsITuY0B5egKlpAdb0qEFGTfvXkugyzwyEphyvNmRpUENv8rw4zTSaJWvyo7_6WclizIw9ioz3FYzsAuw_oyUYuBD0IYwXjMKDLIYdLBV_MzhI8CqzRIhFSwZhGP5vGqKjZjDfx5cG-wX7O5mzm0knz8QUe2zVGpfNKjq01oB-YNNLpjJXePjxOyAEA9W0Z-VvHOSeisdeAFxT6idr4MKBx-87Ow=w1267-h950-no) *Darren Price describing the Cambridge Centre for Ageing and Neuroscience study and managed access database.* We also had an excellent discussion with members of the [Wellcome Trust’s open research team](https://wellcome.ac.uk/what-we-do/our-work/open-research) on incentives and roadblocks for early career researchers who want to open their academic workflow. We covered some of the key reasons that Open Data is important: * Open data allows us to meet one of the core prinicples of the scientific method, that someone else could reproduce your results. It's what separates science from magic. * Open data also means you can get more out of a dataset. Re-using the data for a different purpose is a more efficient use of all the money spent on collecting it. A return on investment to the funder, which for Brainhack Cambridge participants is often the UK tax payer, is especially important when the datasets are expensive/difficult to collect. * Open data leads to better innovation and collaboration. By bringing together ideas from many different diciplines to understand the data from many different points of view, a diverse group of people can analyse the data in ways that you might never have imagined. ![](https://lh3.googleusercontent.com/JDIkPCZmA_vjwPTWBtB1OwE_x355FgAEN7iCk4AeBEqlM6MpvNj0ATKA1jjQ8pC1ocsB-44NS0cF0BzMrGTeWjm3QMppgcidHA1bEX3JhzvhoAen3XFI_sd_SEh_CMUKcKNRYxj4xb58lI56HGKX8CBEBj6wntJtFlFWWSzreSEeWiyJIPXUfJV9roZZ2-VecpBWQqszvmvV-x7R18BQ7WuxBRIv2opa0pONUcvnsXjFW3BPcCpIG5O9ek9t5o8LEefeT-RTt3nFGVI5U0Rx30GnSArbethZbdVRKKdFJhu2zwlXdDRl6Rkj84cWzUCuKxSZsZVxddo8NOrkT01oxXwrDYzjGePL0xaRDYUm9w3vJ8NJ_kwevWEdHXMsm-VqRNi_oMVjuGxmtINugAnzKHE0sH758_Bm1V0VEHol2XCfu4WuT5ohpxAcU4mzauUQwj4Jz8iSh7BcxZfvHuCoviwdgR8flZpX9UomntXih-DhhzGQmxOsvYK8G-6sKddQM3YBq7wdoga_QAtpSicMKR5OUqjhR65oY1FxsxmKUyfqTcjevpFMs_WekmGN6C7oRW2M1ksQXhAtEgyzX2gIss3Ick_FWtCohUdqOfCTRFAZHoa2NnsHP02mv8nqegijgVVwmHVP2uIrj1_v88AfEl8otDO40m1mUmDxxgPYKg=w1267-h950-no) *Hannah and Aki from the Wellcome Trust Open Research Team came to hear about our experiences and opinions as early career researchers.* There are, however, major challenges associated with sharing human brain imaging data: * It may be possible to identify individual people from their data. It can be very difficult to annonymise some data sets and sensitive information such as their history of mental health difficulties or intelligence measures should be protected. It can be very hard to know how to best navigate the ethics of sharing data and there may even be different requirements in different countries. * Some human neuroimaging datasets are large (a few hundred gigabytes per acquisition) and therefore many existing repositories are not suitable. Although members of Brainhack Global were working on the [Brain Imaging Data Structure](http://bids.neuroimaging.io/) project it is not yet widely adopted. This means it is difficult to organise data in a way that it is interpretable to other researchers. * Not only are the datasets large, but as it is very expensive to collect brain imaging data, there are many stakeholders and collaborators. It is unlikely that an early career researcher will be able to make the decision to share the data from their study. * There is a steep learning curve associated with learning new skils, software or platforms. Adding an additional open data burden on PhD students and postdocs may require a lot of dedicated time that they simply do not have. ![](https://lh3.googleusercontent.com/Ie73Y7lNE6tcrFJfb8KvYcyuzFIDZCR-8aa3YS9oR1moZUN5_plIbCVkDjOH7WeNEZJNxkoeqlC9EDrgzK445IRNz5cqv_KjyLiUyoKamMOG0kOmdoXVYxxpF11X5eknWxxmiPRUqgH8CAZmtUfhbJ5BCFplUu6IQ6rmz4p7KkAzP1ROXZU1asRti2bn_7rUiXsEZWwwAnuFuszFQf25fStrjJy2o786PUrbhiKTHcHHU6Z5ubJoZCoQKxcoDOMlqGvw7DS3avpPwK-0ijeAU1PrkmNHK-bHraaesJTNUgxnGJ-eHV0oOyX4FdvUGLxqFgH6h5d8eaEHhnRB9bG2eO-ioQ5gfPe3Kqbo64HpgILqAKz1hG9I3lSCTu3CXqU7q5emENHFnkdcuvdakpTR3UJ7YfnsY1DvmNUkV4h8E_iJPyBludgVUNITIRc2jetm-y5agEAd6OdDDDKMUfy2w82O-2j-XKEntHixcvD0onrhC3WZzBX3LzubJ1w8zwUxnogG0wGOKdOsl9gfyRDCpzMTNLP2drR5BQ5Q9sfhUScsoAlePAF3oYIdYeeAoE3Zx02g5y8zgM-9hHHBYkIYrhxrlPFOpDRjLLNX40I_EWb5Sof5S99Ek8hFRWPco_bUMQwfBHSTXmnybv3tfb2bOr6fWFZLQTaoFE7-sMT1hA=w713-h951-no) *Some of the skills on offer.* We all worked hard to help each other address this last challenge -- that it takes too much time to figure out new techniques on your own -- during the open hacking time at Cambridge Brainhack. It was a great opportunity to share what we know and learn from each other. We got some fantastic feedback that our participants left feeling motivated to develop their open science skills, and confident that they weren't the only one struggling. We're excited to continue to build our community together, locally, and around the world. ![](https://lh3.googleusercontent.com/XdrglH59IHeGUByFajpiwff1fmskOGcadT2Uu0Tx6VnD1Ihg-GFuzj6-iget7K5vsoFzYwiI1nAHMwZHBJm1Nhp3C28jccmyOPyW2qJpUFyBAABAIcoWIKXteXf2ySq6B1hdJvW26s0sbfFLA6VYifLveMpkl4gps-3nj4t3sJBazNrZVuIeYwQBf43e47aCSqC-BkOesqqqobAzHaTVxF_KNM7fApCn_mRcN02Ix6z5gbyD7ugCJgmRFlL0YomynzHCLnfIoSghWDOjnjAl3ryQYG1qEH-FLQilBfB8FAMVlwS2MKhbgS6w1oibB4laSe8OKhl3dsD6s9Sq0u0onmUq6dTtXHNiWNiVqIAkPf5doa6bIPPuqLmDsv2ULA2TjZ1m7eZaMDalGySeV1WojFn0KZA6TkK4q3aM1BZYYFGoaMCqkW1fB9_mQYVZPr1Z1zwH8YAGPleo0VTHihLtStUY_zJdEmnyQYhXY7moOBsLyN9dkz7Qhi0QJ2i1qbSC40LXFZvYECdjtl1robq6xalmjq-BAW11QUcrn0PaLhIQLPLkX9DTCONh3dZ9OrMNWnKdRV0G35tadwDDvCDJLNGijlHSTxcchzEYtTXIgnygYkZWsAKvC4ECgNQd_lVjQWT2l2VkN_gkkCWv7HHXgKHvGwtOZ75znPkDNynBOQ=w1267-h950-no) *Open hacking at Cambridge Brainhack.* *We are incredibly grateful to [SPARC and Open Data Day](http://opendataday.org/) for their mini-grant that provided lunch on the last day for our hungry, hard working participants. We would not have been able to hold Cambridge Brainhack without their support, along with our other fantastic sponsors: [The Wellcome Trust](https://wellcome.ac.uk/), [Overleaf](https://www.overleaf.com/), [PLOS](https://www.plos.org/), [Mozilla Science Lab](https://science.mozilla.org/), [MRC Cognition and Brain Sciences Unit](http://www.mrc-cbu.cam.ac.uk/) and the [University of Cambridge Department of Psychiatry](http://www.psychiatry.cam.ac.uk/).*