Data-sharing is often much easier said than done. In the past, researchers created large and valuable databases which would often languish on the university’s server fading into oblivion after the particular post-doc or graduate student who created it had moved on. It has actually been shown that for the field of ecology, the likelihood of accessing data ever again decreases by 17% every year.
While that study is specific to a particular field, I can imagine some level of data loss in every field. Even if data was described in a publication, there is no easy way for an outside researcher to access it, or even know if that particular data would be useful in their new study. The times they are a-changing. Continue reading “Share the Data”
I just heard about a new “big data” project called Project Tycho. They chose the name Tycho in honor of Tycho Brahe who made tons of detailed observations of the stars and planets. After his death, his data was used by Kepler to formulate the laws of planetary motion. This project wants to connect the vast amounts of public health data to scientists and policy researchers to improve their understanding of contagious diseases and their spread. Their undertaking is incredible; they digitized weekly Nationally Notifiable Disease Surveillance System reports from 1888-2013. Now that all of the data is digitized they are working their way through standardizing it and making it amenable to analysis. This entire dataset is available for search online. Continue reading “Project Tycho: Vaccines prevent diseases!”
The ENCODE media fail was epic enough that it totally dominated the discussion when the results were released to the public. Now our collective fury has abated1, I’d like to talk about, not what ENCODE did, but what it might mean for how we conduct genomic research in the future.
ENCODE produced an unprecedented amount of data with unprecedented levels of reproducibility between labs. This data will be useful to researchers around the world for year to come. To do so, however, it commanded tremendous resources and marginalized the concerns of independent researchers. Can we harness the data collection power of these collective projects without destroying the creativity and risk-taking of individual scientists in the crucible of collaborative compromise? Continue reading “ENCODE, Astronomy, & the Future of Genomics”