My precious data Data management planning workshop Open Science Course 2016

What do we do?

- Start to think data management - Get familiar with DMPtuuli - Work with data management plan (real or fictional)

How do you think your data? Research oriented - it’s just data - It’s needed for research

Contribution oriented - It’s one output of my research - It is as important as my publications - It’s part of my contribution to science community

Learning to value your data You might have an unique data set that is not available anywhere else - Samples from environment that does not exist anymore - Collection of data that no one else has collected - Value of research data might last longer than the value of publication -

It can re-analysed/used differently in different context Future research: real-time analysis

Don’t be shy!

How to make a data management plan?

Questions for data management plan - Data -

How do you collect your data? (methods, formats, tools) Where is data saved during research project? (sensitive data handling, sharing data etc.) How do you analyse/use data (methods/tools)

- Ethics/Legal -

Commitment to ethical standards

- Publishing/Long-term preservation -

Where is data after research project? (repository) How can it be accessed? (licence) How can it be found? (metadata) How can it be used (format and documentation)

DMP: Academy of Finland -

Partners Type of data Technical documentation Ethics and legal Publishing and long-term preservation http://www.aka.fi/en/funding/how-to-apply/appendices-required/data-managemen t-plan/

DMP: Finnish Social Science Data Archive http://www.fsd.uta.fi/aineistonhallinta/en/data-management-planning.html

DMPTuuli Simple service for writing data management plans - Funder specific guidance - Additional organisation specific guidance - Exports several formats (pdf, docx, csv, text, json, html, xml) https://www.dmptuuli.fi

Publishing Data

Requirements from funders We require that principal investigators of Academy-funded research projects see to that the projects data are stored and made available through major national or international archives or storage services that are important in the fields concerned. Data may for justified reasons, however, come in varying degrees of openness, ranging from fully open to strictly confidential. http://www.aka.fi/en/funding/responsible-research/open-science/

Requirements from publishers An inherent principle of publication is that others should be able to replicate and build upon the authors' published claims. A condition of publication in a Nature journal is that authors are required to make materials, data, code, and associated protocols promptly available to readers without undue qualifications. http://www.nature.com/authors/policies/availability.html

Dropbox is NOT an repository!!!

Publishing data Let people know about your data - Publishing metadata - Publishing metadata + data Re-usability? - Data is available (data dump) - Data and documentation - Data available as service -

http://avaa.tdata.fi/web/avaa/etusivu

“So called” open data Opening data that is usable for others - Messy - No documentation Re-usability? - None

Where do I put my data? Repository lists: - http://www.nature.com/sdata/policies/repositories - http://www.re3data.org/ Some generic repositories: -

https://dataverse.harvard.edu/ http://datadryad.org/ IDA (starting 2017 for long-term preservation) JYU: dataverse, JYX

http://dataverse.org/blog/scientific-data-now-recommends-harvard-dataverse-all-areas-s cience

You can always ask help: [email protected]

Please inform university library when you publish data set

My precious data - GitHub

Open Science Course 2016 ... It's part of my contribution to science community ... Exports several formats (pdf, docx, csv, text, json, html, xml) ... http://dataverse.org/blog/scientific-data-now-recommends-harvard-dataverse-all-areas-s · cience ...

43KB Sizes 5 Downloads 384 Views

Recommend Documents

My Title - GitHub
[ESP07] Kai Eckert, Heiner Stuckenschmidt, and Magnus Pfeffer. Interactive thesaurus assessment for automatic document annotation. In Proceed- ings of The ...

my story - GitHub
Small software development projects for local companies. EDUCATION ... Java (6 years professional experience, 10+ years total). • Javascript (4 years ...

My title My subtitle Version 0.1 - GitHub
This is bold text. This is bold text. This is italic text ... It converts "HTML", but keep intact partial entries like “xxxHTMLyyy” and so on. 1. Footnote can have markup.

Javascript Data Exploration - GitHub
Apr 20, 2016 - Designers. I'm a sort of. « social data scientist ». Paris. Sciences Po médialab. I just received a CSV. Let me grab my laptop ... Page 9 ...

Tabloid data set - GitHub
The Predictive Analytics team builds a model for the probability the customer responds given ... 3 Summary statistics .... Predictions are stored for later analysis.

RStudio Data Import - GitHub
“A data model in which the data is organized into a tree-like structure” - Wikipedia. Page 10. WHAT IS XML, HTML AND JSON? XML: Extensible Markup ...

Data Science - GitHub
Exploratory Data Analysis ... The Data Science Specialization covers the concepts and tools for ... a degree or official status at the Johns Hopkins University.

Open Data Canvas - GitHub
Top need for accessing data online. What data is most needed? Solution. How would you solve this problem? ... How big is the universe of users? Format/Use.

data tables - GitHub
fwrite - parallel file writer. SOURCE: http://blog.h2o.ai/2016/04/fast-csv-writing-for-r/ ... SOURCE: https://www.r-project.org/dsc/2016/slides/ParallelSort.pdf length.

Reading in data - GitHub
... handles import from SPSS. Once installed, the package contents can be loaded into R (made available to the R system) with the function call. > library(Hmisc) ...

meteor's data layer - GitHub
Full-stack JavaScript Framework for both Web and. Mobile. □. Built on top of the NodeJs. □. Open Source. □ ... Meteor doesn't send HTML over the network. The server sends data ... All layers, from database to template, update themselves ...

Truly Precious -
when to apply a brake or when to turn the steering wheel. In the same way, in life's situations I need to understand that everything will not go as I plan. Instead,. I need to learn the art of knowing when to stop (apply brakes) and when to turn my t

time is precious
2% increase in form completion after form optimisation. CTA changes result in 4% increase in clicks. 3% increase in visitors to quote after site is secure.

Precious Provisions.pdf
Page 3 of 77. Precious Provisions.pdf. Precious Provisions.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Precious Provisions.pdf.

Research Data Management Training - GitHub
Overview. Research Data management Training Working Group: Approach and. Methodology ... CC Australia ported licence) licence. ... http://www.griffith.edu.au/__data/assets/pdf_file/0009/528993/Best_Practice_Guidelines.pdf. University of ...

Precious Savior, Dear Redeemer.pdf
Page 1 of 1. Precious Savior, Dear Redeemer. H R Palmer. 1. 2. bind. weak. mes. but. sage. 3. 1. Thou. We. Thy. 3. 3. wilt. are. sweet. 4. 3. 3. 3. brok. thou. now.

RN-171 Data Sheet - GitHub
Jan 27, 2012 - 171 is perfect for mobile wireless applications such as asset monitoring ... development of your application. ... sensor data to a web server.

PGP, GPG, and Enigmail... Oh My! - GitHub
People use PGP to sign, encrypt, and decrypt emails, files, folders, and even whole disk partitions. PGP allows you to specify a recipient to encrypt a message for.

Processing Big Data with Azure Data Lake - GitHub
Processing Big Data with Azure Data Lake. Lab 3 – Using C# in U-SQL. Overview. U-SQL is designed to blend the declarative nature of SQL with the procedural ...

Processing Big Data with Azure Data Lake - GitHub
Processing Big Data with Azure Data Lake. Lab 4 – Monitoring U-SQL Execution. Overview. U-SQL jobs are executed in parallel. You can use the job graph, and ...

ALOJA: Cost-effective Big Data deployments - GitHub
Cost-effective Big Data deployments. SEVERO .... Guide the future development and deployment of Big Data ... Analytical models of Hadoop cost-effectiveness.