My precious data Data management planning workshop Open Science Course 2016

What do we do?

- Start to think data management - Get familiar with DMPtuuli - Work with data management plan (real or fictional)

How do you think your data? Research oriented - it’s just data - It’s needed for research

Contribution oriented - It’s one output of my research - It is as important as my publications - It’s part of my contribution to science community

Learning to value your data You might have an unique data set that is not available anywhere else - Samples from environment that does not exist anymore - Collection of data that no one else has collected - Value of research data might last longer than the value of publication -

It can re-analysed/used differently in different context Future research: real-time analysis

Don’t be shy!

How to make a data management plan?

Questions for data management plan - Data -

How do you collect your data? (methods, formats, tools) Where is data saved during research project? (sensitive data handling, sharing data etc.) How do you analyse/use data (methods/tools)

- Ethics/Legal -

Commitment to ethical standards

- Publishing/Long-term preservation -

Where is data after research project? (repository) How can it be accessed? (licence) How can it be found? (metadata) How can it be used (format and documentation)

DMP: Academy of Finland -

Partners Type of data Technical documentation Ethics and legal Publishing and long-term preservation http://www.aka.fi/en/funding/how-to-apply/appendices-required/data-managemen t-plan/

DMP: Finnish Social Science Data Archive http://www.fsd.uta.fi/aineistonhallinta/en/data-management-planning.html

DMPTuuli Simple service for writing data management plans - Funder specific guidance - Additional organisation specific guidance - Exports several formats (pdf, docx, csv, text, json, html, xml) https://www.dmptuuli.fi

Publishing Data

Requirements from funders We require that principal investigators of Academy-funded research projects see to that the projects data are stored and made available through major national or international archives or storage services that are important in the fields concerned. Data may for justified reasons, however, come in varying degrees of openness, ranging from fully open to strictly confidential. http://www.aka.fi/en/funding/responsible-research/open-science/

Requirements from publishers An inherent principle of publication is that others should be able to replicate and build upon the authors' published claims. A condition of publication in a Nature journal is that authors are required to make materials, data, code, and associated protocols promptly available to readers without undue qualifications. http://www.nature.com/authors/policies/availability.html

Dropbox is NOT an repository!!!

Publishing data Let people know about your data - Publishing metadata - Publishing metadata + data Re-usability? - Data is available (data dump) - Data and documentation - Data available as service -

http://avaa.tdata.fi/web/avaa/etusivu

“So called” open data Opening data that is usable for others - Messy - No documentation Re-usability? - None

Where do I put my data? Repository lists: - http://www.nature.com/sdata/policies/repositories - http://www.re3data.org/ Some generic repositories: -

https://dataverse.harvard.edu/ http://datadryad.org/ IDA (starting 2017 for long-term preservation) JYU: dataverse, JYX

http://dataverse.org/blog/scientific-data-now-recommends-harvard-dataverse-all-areas-s cience

You can always ask help: [email protected]

Please inform university library when you publish data set

My precious data - GitHub

Open Science Course 2016 ... It's part of my contribution to science community ... Exports several formats (pdf, docx, csv, text, json, html, xml) ... http://dataverse.org/blog/scientific-data-now-recommends-harvard-dataverse-all-areas-s · cience ...

43KB Sizes 4 Downloads 101 Views

Recommend Documents

My assemblies - GitHub
Page 1. My assemblies. Note: This is an example. Assembly 1. Assembly 2. HA1 rc1 my promoter. RNA stability acs. PolyA. I1 my promoter gene with a very very long name. PolyA. I1.

Open Data Canvas - GitHub
Top need for accessing data online. What data is most needed? Solution. How would you solve this problem? ... How big is the universe of users? Format/Use.

Tabloid data set - GitHub
The Predictive Analytics team builds a model for the probability the customer responds given ... 3 Summary statistics .... Predictions are stored for later analysis.

data tables - GitHub
fwrite - parallel file writer. SOURCE: http://blog.h2o.ai/2016/04/fast-csv-writing-for-r/ ... SOURCE: https://www.r-project.org/dsc/2016/slides/ParallelSort.pdf length.

Data Science - GitHub
Exploratory Data Analysis ... The Data Science Specialization covers the concepts and tools for ... a degree or official status at the Johns Hopkins University.

RN-171 Data Sheet - GitHub
Jan 27, 2012 - 171 is perfect for mobile wireless applications such as asset monitoring ... development of your application. ... sensor data to a web server.

PGP, GPG, and Enigmail... Oh My! - GitHub
People use PGP to sign, encrypt, and decrypt emails, files, folders, and even whole disk partitions. PGP allows you to specify a recipient to encrypt a message for.

Precious Provisions.pdf
Page 3 of 77. Precious Provisions.pdf. Precious Provisions.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Precious Provisions.pdf.

time is precious
2% increase in form completion after form optimisation. CTA changes result in 4% increase in clicks. 3% increase in visitors to quote after site is secure.

Prosper Loan Data Analysis - GitHub
not visible in the HTML/PDF export for the simlicity but the codes can be reviewed from the RMD file. The dataset is ... Prosper rating for borrowers in numbers ..... Household. Expenses. Personal. Loan. Auto. Business. Home. Improvement. Other ... 1

Precious Savior, Dear Redeemer.pdf
Page 1 of 1. Precious Savior, Dear Redeemer. H R Palmer. 1. 2. bind. weak. mes. but. sage. 3. 1. Thou. We. Thy. 3. 3. wilt. are. sweet. 4. 3. 3. 3. brok. thou. now.

unstructured data and the enterprise - GitHub
make up the largest amount of unstructured data cura ... Most of these systems leverage metadata to provide an extra layer of .... Various media formats (images, audio, and video) and social media chatter are also .... Web sites that are primarily da