5 Takeaway points for Open Science Festival, Netherlands

The Open Science Festival at Vrij Universiteit Amsterdam inspired many ideas. Here are a few that struck me (mainly from the strategic rather than research practice perspective)

  1. To do open science properly, we need to break down the boundary between research and research support staff. The new roles required fit in between these old boundaries (Read https://www.nature.com/articles/d41586-022-01081-8).
Casper Albers (Uni Groningen) making his statement over software sustainability
  1. There’s a young, bubbling ecosystem of platforms and ideas that seek to fundamentally change the process of publication. At the core of this is updating the position of peer review (good slides at https://zenodo.org/record/7040997) But, many outside the open science system are unaware of this, assuming that the journey is complete when are 100% of journals are open access.
  1. How to achieve change? There are discussions over vision policies, strategies, seed funding, sustainability. The words used are different, but the notion of a fusion of top down and bottom up approaches is popular. (One good example at https://zenodo.org/record/7025049)
  1. Change is not just about the world of science. For the benefits of open science to blossom, much better dialogue is needed with the broader public. Without dialogue, the risk of misunderstanding of (open) science will grow. (See https://zenodo.org/record/7038872 for more)
Panel members discuss Open Science policy

5. And nothing will really change without changing the incentives – how do we reward and recognise all staff involved in (open) science?  (Dutch approach – https://recognitionrewards.nl/)

Survey report on long-term impact of Carpentry workshops at TU Delft

Written by: Meta Keijzer-de Ruijter

Image ‘ Survey’ by Nick Youngson CC BY-SA 3.0 Alpha Stock Images

Introduction

Digitalisation offers a lot of opportunities for researchers to boost their research: automation of repetitive tasks, collection of more (complex) data, increase in computing capacity, re-use of data and code plus new ways to collaborate with colleagues within and outside of the research group.

At TU Delft, Library Research Data Services and ICT/Innovation are working closely together to define a vision and strategy to improve the support for developing digital competencies amongst the research community. In this process, we are looking at workshops and activities that are already being organised to assess their impact and possible improvements. With that aim in mind, a survey was developed to be sent to former participants of Data and Software Carpentry workshops. Our survey was based on the standard questions of the long-term impact survey that The Carpentries use, but we added questions on the actual usage of the tools taught and additional learning needs on the topics that were addressed in the workshops.

The survey was sent to 315 former participants of Data and Software Carpentry workshops held by TU Delft between 2018 and the summer of 2021. We received 45 responses. In addition, the feedback from both instructors/helpers and participants over the years was considered in this report.

Survey Outcomes

Demographics and Scope

Most of the respondents were PhD candidates (37 out 45) that attended the workshops during their first or second year (21 out of 37). Respondents came from all faculties, but the majority from Applied Sciences (8), Civil Engineering and Geosciences (6), Architecture and the Built Environment (6) and Mechanical, Maritime and Materials Engineering (6).

Most respondents provided feedback on the software carpentry workshops (30 of 45).

Main Takeaways from Carpentries

The top three takeaways from the carpentry workshops, according to the respondents, were:

  • Using programming languages including R, Python, or the Command Line to automate repetitive tasks (16)
  • Making code reusable (12)
  • Using scripts and queries to manage large data sets (10)

The workshop(s) helped them to improve their efficiency, data analysis and data management skills.

It should be noted that eight respondents wrote that they are not using any of the tools. The main reason for that was that they use alternative tools that better match their practice or that are easier to use on an ad hoc basis.

Relevance of the content

Software Carpentry

Most of the respondents found programming with Python (partly or very) relevant. But there is a lot of debate on the level of Python being taught. Although it was communicated that the level is very basic beginner, a recurrent note (also in the surveys right after the Software Carpentry Workshops) was being made that the topics addressed are too basic and the pace of the course too slow for those who want to refresh. The reason they still attended was that in order to get the full Graduate School credits for this course, everyone should attend all parts of the carpentry. In an online setting this seemed to result in less engagement in the breakout rooms.

Figure 1 Response to the question ‘How relevant was the content of the Software Carpentry for your research’? (n=22).

Data Carpentry Genomics

In the Genomics carpentry the command line part was found most relevant to the participants, closely followed by data wrangling.

Figure 2 Response to the question ‘How relevant was the content of Data Carpentry Genomics for your research?'(n=3)

Data Carpentry Social Sciences

In this data carpentry the data analysis and visualization with R is most valued. The relevance (and eventual) use of OpenRefine was questioned. None of the respondents (n=5) in the long-term survey reported using OpenRefine in their current work.

Figure 3 Response to the question ‘How relevant was the content of Data Carpentry Social Sciences for your research?'(n=5)

Use of tools that are taught in the carpentries

In the survey, respondents were asked about their usage of the tools that were taught in the workshops. Did they start using the tools and to what extent, or did they quit using the tools or never even started using them. On each tool they were also asked if and how they would like to advance their skills.

Unix

The use of Unix was picked up by a good number of respondents, those that didn’t use Unix mentioned that it did not serve their purpose. Two respondents stated that they did not feel confident enough to use it. Additional materials for self-paced learning or help with how to apply it in their work might help them get started.

Python

It appeared that the respondents that already used Python increased their usage from occasional to more frequent. Those who reported not using Python stated that they use an alternative tool.
Both frequent and occasional Python-users would like to improve their skills either by attending more advanced workshops, self-paced materials or talking with peers. Examples of topics they are interested in include data analysis, (big) data management, packaging, Jupyter Notebooks, libraries (including scikit and numba), handling large databases and automating tasks.

R

Only three people responded to the usage of R question, one of them started using R from that moment on, the other two report that they were using a different tool to do their data management. The respondent that used R said that additional training, learning materials and talking to peers would be appreciated to improve the skills.

Git

Amongst the respondents the use of Git increased. Those who didn’t use the tool (7 respondents) said either that it didn’t fit their purpose (4 respondents) or that they did not feel confident enough to start using it (3 respondents). This last group would like access to additional learning materials, guided practice or consultation on how to apply Git in their own work.
Most of the Git users would love to improve their skills in various ways. Topics that are mentioned are dealing with complex files, comparing different versions of code, collaborative use of Git and building more confidence using Git.

OpenRefine

None of the respondents (n=5) used OpenRefine after the workshop. Three of them stated that the tool doesn’t fit their purpose, while two respondents would like to have additional materials or consultation on how to apply it in their work.

Spreadsheets

The use of spreadsheets remained about the same after the workshop. And the respondents did not feel the need to improve their skills.

Cloud Computing

None of the respondents (n=3) had been using this prior to the workshop. After the workshop two of them started using it and they would like to learn more. No specific topics were mentioned.
The respondent that did not start using cloud computing stated that it did not fit his/her purpose.

Additional training on research data management or software development

Respondents could tick multiple boxes in this question about additional training needs. The top five training topics were:

  • Basic programming – 14 votes
  • Modular code development – 13 votes
  • RDM workflows for specific data types – 13 votes
  • Software versioning, documentation and sharing – 11 votes
  • Software testing – 10 votes

Impact of attending the carpentry workshops

Most respondents agreed that attending the carpentries had an impact on their work. They gained confidence in working with data, made their analysis more reproducible, were motivated to seek more knowledge about the tools, advanced their career, improved their research productivity and improved their coding practices.
The only topic where there was a wide spread of (dis)agreement was on the impact of the carpentry workshops on the professional recognition of their work.

Recommendation of the Carpentry workshops

Most respondents recommended or would recommend the carpentry workshops to their colleagues.

Our Conclusions

From the survey we learned that the carpentries fulfill an important role in the introduction of tools that help members of the research community to carry out their work more efficiently, but additional means and support are necessary. The translation of the carpentry materials to the daily practice can be challenging, causing some to quit or not even start using the tools. In the discussions with observers, helpers, and instructors of the software carpentry workshop we identified the need to link the carpentry topics more clearly to the research workflow, in order to increase the understanding of the ‘why’ we teach these topics and tools.

And of course..

We (TU Delft) would like to thank all those who participated in the survey….

Open Science is like a buffet*: take what you can and what benefits you now – come back for more!

Author: Esther Plomp

This overview highlights resources that are available for TU Delft researchers in their Open Science journey. Please see the poster for the full visual representation with the links embedded.

Open Data

Data Steward & Data Champions

Research Support TU Delft Library

Digital Competence Center (Data Managers)

Managing and Sharing Data in 2021

Policy: Research Data at TU Delft & Faculties

Open Software

GitHub/GitLab (& 4TU.ResearchData/Zenodo integration)

Support from the Digital Competence Center (Research Software Engineers)

Training: Carpentries/Code Refinery

TU Delft Research Software Policy facilitates sharing of Research Software

Open Engagement

Engagement is part of the TU Delft Core values (DIRECT)

‘Outreach and public engagement are core elements’ (TU Delft strategic priorities 2022)

Online discussion and presentation platforms such as the Virtual Science Forum

Open Education

A new policy on Open Educational Resources will be ‘a starting to point to make Open Education the default approach for teaching at TU Delft’ – TU Delft Strategic Priorities 2022

MOOCs / Open Course Ware

Open Publishing

82% Open Access

TU Delft Open Publishing

TU Delft Open Access Fund (up to €2000 for gold open access)

Publishing deals (check the Journal Browser)

Support for sustainable publishing (SciPost)

How to publish Plan S compliant?

You share, we take care (Taverne)

Open Participation

‘There is no open science if science is not open to all’ (Whitaker and Guest 2020)

Open Science Community TU Delft (@OSCDelft)

Diversity and Inclusion at TU Delft

Citizen Science

Open Methods

Electronic lab Notebooks (RSpace/eLABjournal)

protocols.io (PLOS)

Open Evaluation

‘Open science and education can play an important role in improving the quality of our work and stimulating the use of our knowledge and findings by others’ – TU Delft Recognition & Rewards Perspective 2021 – 2024

The Dutch position paper Room for everyone’s talent aims to recognise a wider range of academic contributions

Strategy Evaluation Protocol (SEP) – 2021-2027 has Open Science as a main assessment criteria

Open Hardware

Open Hardware Community Delft (@DelftOpenHW)

More information:

TU Delft Open Science Programme 2020-2024

tudelft.nl/open-science

the-turing-way.netlify.app

openlifesci.org

*The term Open Science Buffet has been coined by Christina Bergmann in 2019.

Data Stewardship at TU Delft – 2021 Report

Authors (listed in alphabetical order by the first name): Arthur Newton, Diana Popa, Esther Plomp, Heather Andrews, Jeff Love, Kees den Heijer, Lora Armstrong, Nicolas Dintzner, Santosh Ilamparuthi, Yan Wang, Yasemin Turkyilmaz-van der Velden

Advancing data stewardship

The Data Stewards team at TU Delft has finished another busy year. As changes and remote working became the new norm, the team carried on the success with great adaptability and maturity in 2021. In this report, we review the activities done in the past year and acknowledge the achievements as a team.

More transitions and finally a complete & bigger team!

The team transition mentioned in the 2020 report has continued in 2021:

  • In February, we made the team complete by welcoming Diana Popa, the new Data Steward of the Faculty Architecture and the Built Environment.
  • Sadly in April, we had to start saying farewell to Kees den Heijer, the Data Steward of the Faculty of Civil Engineering and Geosciences. Kees was one of the first Data Stewards at the start of the Data Stewardship program. Thankfully that Kees’s new position is within the Research Data Services (RDS) team at the library and he could still offer essential RDM support to the faculty before the new Data Steward is in place. Nicolas Dintzner, the Data Steward of the Faculty Technology, Policy and Management, also kindly helped with some requests with ethical approval required.
  • From the end of May till the end of September, the coordinator Yan Wang was on maternity leave. The coordination was done in a joint effort by the whole team and colleagues from the RDS team. The team’s internal coordination was shared by the Data Stewards. Some Data Stewards were taking lead in coordinating with other research support teams according to their engagement and interests in relevant topics.
  • In October, the team was finally complete. We are proud to have Armstrong and Newton on board. Lora Armstrong is the new Data Steward of the Faculty of Civil Engineering and Geosciences. Arthur Newton is the first Data Steward of QuTech. Now the sky is not even the limit anymore!

Due to COVID restrictions, the team has continued working virtually during the year, and we did not have a chance to have a proper farewell and welcome.

Strengthening connections with other research support teams

The RDM support provided by the Data Stewards team is a joint effort with valuable input from numerous research support teams. We have established good connections with many of them since the start of the data stewardship. In 2021 we further strengthened the link with those who are already working with us and reached out to others to build up more connections.

  • The privacy team, Human Research Ethics Committee (HREC), and ICT innovation have been our close partners in the university-level Personal Research Data (PRD) workflow. The communication between the Data Stewards and these teams became further regulated on a bi-weekly or monthly basis during last year.
  • The Data Stewards team has also established closer contact with the Innovation and Impact Centre (IIC). The IIC has teams of grant officers and project managers who help researchers with grant proposals and project coordination. In collaboration with the former Open Science community engagement manager Emmy Tsang, a few data stewards contributed to a series of collaborative activities:
    • A mini-workshop on Data and IP for project managers
  • The Data Stewards have been in close collaboration with the DCC team on various activities since the start of the DCC at TU Delft. Since October 2021, the coordinators of both teams were in place and started to keep each other updated.

Team achievements across all faculties

Despite different disciplinary demands among faculties, there are a few common activities performed by DSs across all faculties. As a team, we continued to provide the following types of support which form the foundation of the RDM support from the team.

Data management consultation

  • Consultation is still the main channel to deliver the RDM support. From all faculties, more than 1200 requests from researchers were received in 2021. This shows another significant increase (approximately 50%) compared to the requests received in 2020. While Data Management Plans (DMPs) were still the majority of the requests, there was a broad range of questions on data storage, sharing, licensing, privacy, tooling, and others.  

Training & Education

  • Data Stewards continue to get involved in RDM training at both the faculty and university level. Many Data Stewards contributed to the software carpentry and data carpentry workshops as instructors, helpers,s or coordinators. RDM training at the faculty level is provided in various formats, such as informative sessions at the individual, group, or department level, RDM courses for PhDs, or discipline-specific training workshops.

Policy & Strategy

  • The team has been working on publishing and implementing faculty-level data management policies since 2019. In early 2021, all faculties (except QuTech whose Data Steward only started in October 2021 but already started working on the policy draft since then) have published their policies. The team also provided valuable input and facilitated the consultation of the TU Delft Research Software Policy and the Guidelines on Research Software which were published in March 2021.

In addition to the above-mentioned common achievement of the whole team, each Data Steward further expanded their disciplinary support according to faculty needs.

Faculty of Aerospace Engineering

Disciplinary RDM support

  • Co-developed the ASCM Code Initiative which has consisted of developing an instructing wiki and giving training sessions on proper coding practices for ASCM researchers. This is expected to continue this year and probably expand to more faculty sections.
  • Performed specific training sessions for different groups to improve their coding management skills (version control).
  • 1 Astronomy Data Carpentry workshop (at the national level in collaboration with Netherlands Institute for Space Research SRON, Leiden Sterrewacht, and Netherlands eScience Center), and 4 code management workshops (at the faculty level).
  • Became the Secretary of the Data Carpentry Astronomy curriculum (https://carpentries.org/curriculum-advisors/)

Event and community engagement

  • Expanded the faculty Data Champions from 13 (one left TU Delft) in 2020 to 19 in 2021
  • Instructor in Train the Trainer FAIR and Reproducible Code workshop for 4TU Data Stewards.
  • Led sessions on Data Curation for the Consorci de Serveis Universitaris de Catalunya
  • Invited speaker at 8 (online) events showcasing Data Stewardship at TU Delft and recommended research data management practices (for Helis Academy, Graz University of Technology, Universitat Oberta de Catalunya, Universidad Nacional de Costa Rica, LA Referencia, Universidad Internacional de Ciencia y Tecnología de Panamá and Agencia Nacional de Investigación y Desarrollo de Chile).
  • Hosted a Data Steward Interest Group meeting (https://www.dtls.nl/about/community/interest-groups/data-stewards-interest-group/)

Data Stewardship coordination

  • Performed an Informative Session on Open Science and Research Data Management requirements for research funding officers of the TU Delft Innovation & Impact Center.

Faculty of Applied Science

A full overview of Data Steward activities is summarized nicely in this blog: https://openworking.wordpress.com/2021/12/24/gridding-through-2021/

Disciplinary RDM support

  • Implemented the Open Life Science programme into the Faculty Graduate School so that Ph.D. candidates from Applied Science can follow the top training on Open Science practices for credits!
  • Part of the ELN project group

Event and community engagement

  • Invited to provide Research Data Management training to the SeaChanges project, Helis Academy, DCC Spring Course
  • Mentor/expert for the Open Life Science Programme
  • Project Member of The Turing Way
  • Co-chair of RDA group: Physical Samples and Collections in the Research Data Ecosystem IG. This year the group organized a webinar series and created a 23 Things on Physical Samples overview.
  • Participant in the CSCCE Community Champions meetings
  • Was an (invited) speaker at 9 events, programme committee member or (co)organizer of 11 events, workshops, or sessions, and presented 1 poster

Self-development

  • Attended courses on Leadership and online community management

Research and publication

  • (Co)authored 10 blog posts on Open Working, The Open Archaeobotanist, Data Horror Stories, Software Sustainability Institute, and Code for Society
  • Published a data paper:

Faculty of Architecture and the Built Environment

Event and community engagement

  • Participated in Working groups:
    • WG with the IIC on project support for the new Horizon Europe program:  Next steps Open Science/Open Data/Ethics Horizon Europe;
    • Privacy and GDPR Working group;
    • IAM soundboard group
    • Measuring the Adoption of Fair data practice Working group
  • Helped organize the Delft Digital Humanities community meetings.
  • Participated in DCC events and training.
  • Participated in Online events (selection):
    • Organized by the IIC:
      • Demystifying data management and ethical considerations for grant applicants;
      • Business partnership Support for research collaborations;
      • GPS: get your bearings in funding land –  Get Prepared for Subsidies.
    • CESAER events:
      • First experiences with EU funding programmes from 2021 to 2027;
      • do no significant harm’ principle and its proposed implementation by the European Commission.
    • NWO Webinar FAIR: It Takes a Village.
    • SURF Research Bootcamp.
    • DANS Data Trail Workshop – FAIR data assessment tools: an evaluation.
    • LCRDM: Network Afternoon LUSTRUM Edition of the data support collective.
    • “FAIRsFAIR roadshow.
    • Research and Innovation days.
    • UN Open Science Conference.
    • Open Science Coffee Delft: Diversity & Inclusion in Research, Technology, and Design.

Self-development

  • Completed the Scopus Certification Program for Librarians.

Faculty of Civil Engineering and Geosciences

Due to the change of the Data Steward, there was a 5-month gap in providing the faculty support. Nevertheless, there was still significant additional work done by (both) the data stewards.

Disciplinary RDM support

  • Internal presentation to the Transport & Planning department
  • Contribution to Open Science paragraph for Midterm Civil Engineering

Event and community engagement

  • Participated in FAIR and Reproducible Code Working Group
  • Provided input for the Information Literacy masters course being designed by the library
  • Supervising data managers for DCC
  • Involvement in EPOS-NL/4TU.ResearchData/UU collaboration
  • Involvement in the Digishape initiative

Faculty of Electrical Engineering, Computer Science, and Mathematics

Disciplinary RDM support

  • Developed and delivered training sessions on the handling of personal data in Research for Masters Students
  • In conjunction with researchers at IEPG helped develop research group-specific data management structure and workflow.
  • Contributed to the development of “Beyond Essentials” – RDNL course focused on privacy and GDPR (development led by Marlon Domingus from Erasmus University)
  • Project lead Open Hardware project within the Open Science Programme, 40,000 Euro budget for developing Open Hardware activities
    • Organized multiple workshops (with a team)
      • A hands-on Online Open Hardware workshop at the Open Science Festival
      • In conjunction with the DCSE a beginner and an intermediate-level workshop on cluster computing using Raspberry Pis
    • Created the position and hired a Research Hardware Engineer (RHE) – first of a kind role for supporting open hardware activities (to the best of my knowledge)

Event and community engagement

  • Invited speaker at Helis Academy, Graz University of Technology
  • Active in three working groups as part of the 4TU community
    • Led the Privacy and GDPR Working group
    • FAIR and Reproducible code WG
    • Engagement and Education WG
  • Co-chair of RDA working group: Discipline-specific Guidance for Data Management Plans
  • Session organizer at the RDA’s 17th & 18th Plenary Meetings
  • Invited as a guest on the R2OS (Road to Open Science) podcast from Utrecht University
  • Part of the team that created and published the FAIRly Open After Dark podcast series, which dealt with Open Science, FAIR Data, and Academia in general from the perspectives of different stakeholders.
  • Participated in the Health RI conference 

Data Stewardship coordination

  • Collaborated with the privacy team and HREC on aligning workflow for research projects that handle personal data.

Faculty of Industrial Design and Engineering

Disciplinary RDM support

  • Co-taught two sessions of ‘Ethics and Research Data Management’ module for the IDE Ph.D. research school
  • Co-developed and taught a BSc course on ‘Data as a Design Material’
  • Developed and taught a module on ‘Responsible IoT Design’ for the BSc course on ‘Software-Enabled Products’

Research and publication

Event and community engagement

  • Attended multiple Design & Digital/Computational Humanities conferences and events, most notably those from DH Benelux featuring practices and practitioners from the Benelux Regions and Dutch Design Week in Eindhoven

Faculty of Mechanical, Maritime, and Materials Engineering

Disciplinary RDM support

  • Further developed and regularly delivered Data Management Plan Training for 3mE Ph.D. students 
  • Contributed to the development of and taught at the MSc course ‘Introduction to Engineering Research’
  • Taught at the BSc programme ‘Clinical Technology’
  • Co-developed and co-delivered the Workshop on FAIR for Material Design
  • Member of a cross-TU Delft working group (involving the Library and ICT) about Electronic Lab Notebooks (ELN)s which:
    • Offers ELN licenses to interested researchers
    • Organization of regular community events

Event and community engagement

  • Organized and presented (departmental) information sessions about Research Data Management, Research Software Policy, and the Open Science Program
  • Co-organized and presented an Open Science Session for 3mE Ph.D. students
  • Member of the 4TU.ResearchData FAIR and Reproducible Code, Privacy and GDPR, Engagement and Education working groups
  • Co-chair of RDA working group: Discipline-specific Guidance for Data Management Plans
  • Was an invited speaker or a session organizer at seven (inter)national conferences / training sessions / webinars (RDA’s 17th & 18th Plenary Meetings, NWO Life Conference, Material Pioneers Webinar, National Turkish and Hacettepe University Research Data Symposiums, Turkish Open Access Week) and acted as a Programme Committee member of two conferences (National Turkish and Hacettepe University Research Data Symposiums)
  • Was invited to serve on the jury of the Eurac Open Research Award

Research and publication

  • Co-authored the Whitepaper: The Future of FAIR.
    • Khodiyar, Varsha; Laine, Heidi; O’Brien, David; Rodriguez-Esteban, Raul; Turkyilmaz-van der Velden, Yasemin; Baynes, Grace; et al. (2021): Research Data: The Future of FAIR White paper. figshare. Journal contribution. https://doi.org/10.6084/m9.figshare.14393552.v1  

Data Stewardship coordination

  • Initiated set up a guidance document about data management and open science sections in Horizon Europe proposals in collaboration with the Open Science Community Manager and Data Stewards
  • Took part in a collaboration with the TU Delft Innovation & Impact Center to create awareness about the Open Science and Research Data Management requirements in Horizon Europe and presented/contributed to two information sessions 

Faculty of Technology, Policy, and Management

Disciplinary RDM support

Event and community engagement

  • Launch of the “ReproJuice” video game with the Library
  • RDM training instructor at the Summer School in KTU (Lithuania)
  • Active in 3 working groups of the 4TU community:
    • FAIR Code (GitHub pages training, organized session on container tech for Open Science).
    • Education and Engagement
    • Privacy

Research and Publication

Data Stewardship coordination

  • Collaborate with the privacy team and HREC on aligning workflow for research projects that handle personal data.

QuTech

The QuTech Data Steward started in October 2021. Within the short period left in the year, he already contributed to the training at the university level, actively provided RDM consultation, and reached out to most research teams in the faculty. Furthermore, he has drafted the QuTech Data Management policy and planned to have the faculty consultation (this policy has been published in April 2022).

Looking forward

A few lessons learned from this year can help guide us for the coming year(s). We should do better in documenting our Data Stewards’ knowledge base on RDM-related information within the organization as a guide for both researchers and Data Stewards. This would especially be helpful for new DS on boarding.

We are still in the process of further shaping the Data Steward work scope to handle increasing and more complex RDM demands. Besides all the RDM support activities, the team has also been actively engaged in discussions about the Data Stewardship model, the profile, and the career paths for Data Stewards.  Every Data Steward is encouraged to explore their own ‘Data Stewardship’ within the faculty. This does not just include working on disciplinary RDM support, but also exploring organizational solutions to sustain and expand the Data Stewardship support within the faculty. Faculty level Data Stewardship implies that a senior role of the current faculty Data Steward should play. This requires a corresponding recognition regarding the Data Steward profile and progress paths. We are all motivated to do more, meanwhile appreciate the rewards and recognition of the work we do. There is definitely more room for attention and efforts on the professional growth of Data Steward and Data Stewardship.


The theme of Artificial Intelligence at Surf Research Week

There was considerable focus on AI, and its implications for research and research support, at the Surf Research Week in Utrecht this week (10 May 2022). Here’s a brief set of bullet points on the discussions I noted:

Surf Research Week – Image from main conference hall

There’s too much focus on the negative examples of AI. How can we do more to demonstrate the positive benefits? And ally that to greater transparency in how AI functions.

Also, how can we intelligently reflect on the applied uses of AI? AI should not be viewed as a panacea for all social and research challenges; we need to gain the critical insight to understand when it is (and is not) the right methodology to be applied.

AI changes the nature of how research projects are organised. Also: if AI permits researchers to make discoveries about things they were not even aware of, should we even think of AI itself as a collaborator in research projects, rather than a methodology that we use?

In any case the need for collaboration between people with different knowledge becomes even stronger with AI. There are wide-ranging skills that are needed to deploy AI within a project – not just in technical terms, but in terms of data science and management, and embedding the ethical context. In particular, ensuring an ethical footing for AI projects with potentially profound social implications should involve the right philosophical expertise. The related need for management and archival skills in curating and documenting the datasets that underpin AI.

And also efficiencies and expertise provided by high-quality software engineering can drastically time and money when it comes to deploying the computer power needed for AI. A familiar discussion arose – should AI expertise be situated locally or nationally? Should it be generic or focussed on specific subject areas?

The tension within the AI community between acceleration and regulation. On the one hand, global challenges desperately required the novel ideas that AI can provide – let’s move quickly! On the other hand, we need standardisation to be able to deploy AI in a sustainable and regulation to provide the necessary ethical context. Let’s get this sorted out.

Are they distinctions between AI algorithms and datasets that area developed within the research community, and buying AI as a service from third parties? What does the latter mean for issues such as reproducibility, ethics? What are the implications for university procurement departments if a whole university wants to use AI as a platform.

(Thanks to SURF for organsing the event, and to the panellists and workhsop speakers who contributed so many ideas Emily Sullivan (Tu/E), Nanda Piersma (HvA), Maarten de Rijke (University of Amsterdam), Damian Podareanu (SURF), Antal van den Bosch (Meertens Institute) Sascha Caron (Nikhef) Matthieu Laneuville (SURF)

A Data Steward journey 

Author: Esther Plomp 

When I started as a Data Steward at the Faculty of Applied Sciences I attended the Essentials 4 Data Support course to learn more about research data management support. I was therefore happy to accept Eirini Zormpa’s invitation to discuss my Data Steward journey with the participants of the Essentials 4 Data Support course. Together with Zafer Öztürk from Twente University we shared our experiences during the data supporter panel on the 14th of April. This blog post is a summary of what I highlighted during the panel. 

The Essentials 4 Data Support course is an introductory course about how to support researchers with the management and sharing of research data. The course materials are helpful to gain an overview of what research data management support entails. The course also provided an opportunity to link up with peers (such as Lena Karvovskaya) and meet experts (such as Petra Overveld).

The role of a Data Steward visualised by The Turing Way. A Data Steward can facilitate the exchange of data, identify gaps in services, provide insights in best practices and point researchers to existing tools that they can use. This image was created by Scriberia for The Turing Way community and is used under a CC-BY licence.

In December 2018 I started as the Data Steward at the Faculty of Applied Sciences. In my first couple of months I had the privilege to be peer-mentored by Yasemin Türkyilmaz-van der Velden, who showed me the ropes of data management support. Initially, I had to get to know the position, the workings of the faculty, my new colleagues and the researchers I was now supporting. 

In this first year I worked together with Yasemin on our Faculties Research Data Management Policies, based on the TU Delft Research Data Framework Policy. This was an arduous process, as we visited all departments of our faculties. The policy was discussed with many stakeholders, including PhD candidates. In the beginning of 2020 the Applied Sciences Policy on Research Data Management was officially released! Yasemin and I also worked together in the Electronic Lab Notebook pilot that took place at TU Delft resulting in TU Delft licences for RSpace and eLABjournal

In 2019 I followed a Software Carpentry Workshop to learn basic programming skills so I could better support researchers with any software support questions. I later took the train-the-trainer course and became a Carpentries Instructor myself. By being a Carpentries instructor I can teach basic programming lessons set up by the Carpentries community. With the pandemic we had to shift these trainings online, and I coordinated these workshops for a year (2020-2021). 

Over the years, I also increasingly supported researchers with Open Science questions. This is an aspect of the role that I very much enjoy and currently try to expand upon. My role differs somewhat from the other Data Stewards at TU Delft: we each have our own preferences and areas of expertise next to data support (such as software, ethics, or personal data). Another difference is my involvement in a side project focused on PhD duration. At TU Delft and at my faculty we try to reduce the amount of time that PhD candidates take to finish their PhD project. While the official duration for a Dutch PhD is four years, the majority of PhD candidates take much longer. This often means that they have to finish the project in their unpaid free time. As someone who has spent seven years on a PhD project I can say that finishing your PhD next to a full time job is no joke. 

As a Data Steward I’m also a connection point in the university network. This allows me to address researcher’s questions myself or to connect them with the expert that they need. 

  • My position at the Faculty itself allows for close contact with researchers. Before the pandemic I regularly hopped between their offices to help them with any questions. At the Faculty I’m embedded in the support team where I work together with the Faculty Graduate School and the Information Coordinator. I’m in regular contact with project officers, managers and researchers from all levels at the faculty. 
  • As part of the Data Stewards team I meet the other Data Stewards once a week (virtually) and we communicate through Slack/Teams. 
  • I’m also in contact with colleagues from the Library and the Digital Competence Center, either through collaborative work or because they are the experts that can address questions from researchers. 
  • Sometimes I reach out to central experts from the Human Research Ethics Committee, the Privacy Team and ICT Security when needed. 

Next to my activities as a Data Steward at TU Delft, I’m also involved in several other initiatives that are revolving around data and open research:

Visualisation of mentoring, where you help each other in taking a step up the ladder. Image by Esther Plomp, created for an Open Life Science Programme blogpost on mentoring.

Over the years I very much enjoyed writing blogs like this one, summarising my experiences of conferences, activities and learnings. 

I very much enjoy the Data Steward role, for various reasons: 

  • I support researchers in making their research more transparent.
  • I work with amazing colleagues and collaborators 
  • I meet new people interested in similar topics.
  • I can continuously develop and learn new skills.
  • I have a lot of autonomy over my working activities and schedule.

A lot of this is made possible by a supportive manager, and many individuals that I learned from along the way. 

“Create the world you want, and fill it with the opportunities that matter to you.”

– Alicia Keys

My tips for people just starting in a data support role:

  • Accept that things can take more time than you originally anticipated. Starting in a new role will take some time to adjust and achieving cultural change in university processes will not happen overnight. 
  • The downside of being able to create your own opportunities is that there might be a lot of things that you want to do. Even if everything seems important or fun to do, it could mean that you will end up with too much on your plate. Sometimes it is good to say no to shiny opportunities. 
  • In whatever you do I would recommend you to not take the road alone and seek out others to collaborate with, or ask feedback from. Exchanging expertise and experience will not only be more efficient, it will make the road more worthwhile to walk.

Time to re-think the divide between academic and support staff

Credit: geralt, Pixabay, CC0

By Marta Teperek, Maria Cruz and Danny Kingsley

We are pleased to announce that our article “Time to re-think the divide between academic and support staff” has been just published: https://www.nature.com/articles/d41586-022-01081-8. The article speaks about the negative consequences of the divide between academic and professional support staff, and argues that this divide no longer makes sense as it is not conducive to a successful and effective research process.

By publishing this article, we hope to raise awareness about these problems, start discussions within the community and start identifying the steps which have to be taken to stop the divide. We would welcome your comments and reflections on the topic.

We also wanted to use this opportunity to express our gratitude to Jeff Love, Melanie Imming, Alastair Dunning and Shalini Kurapati for their crucial input and support throughout the process of conceiving this article. Their comments and reflections on the early drafts of the article, as well as the numerous constructive discussions we have had with them, were invaluable to us.

Finally, we also wanted to thank Connie Clare, Manuel Garcia, Hans de Jonge, Lena Karvovskaya, Esther Plomp, Diana Popa, Mark Schenk, Jeroen Sondervan, Emmy Tsang, Yasemin Turkyilmaz-van der Velden and Jose Urra for their comments and suggestions on an early draft of the manuscript.

Publishing a Data Article

Author: Esther Plomp. Contributions from Vicky Hellon, Achintya Rao, Yanina Bellini Saibene, and Lora Armstrong. 

This blog post has been adapted from the Turing Way (The Turing Way Community, The Turing Way: A Handbook for Reproducible Data Science. 10.5281/zenodo.3233853) under a CC-BY 4.0 licence.

A Data Article (also known as a Data Paper/Note/Release, or Database article) is a publication that is focused on the description of a dataset. It uses the traditional journal article structure, but focuses on the data-collection and methodological aspects and generally not on the interpretation or discussion of the results. Data articles are in line with the FAIR principles, especially since most publishers will encourage you to share the data through a data repository. The benefit of a Data Article is that your output will be peer reviewed, something which is generally not the case for datasets that are archived on data repositories. It also facilitates recognition for datasets through research assessment procedures that are more traditionally focused on publication output. Publishing a data paper will therefore increase the visibility, credibility and usability of the data, as well as giving you credit as a data producer (The Turing Way Community 2022).

Options to publish a Data Article

Below you can find some journals that publish data articles. The costs information was collected in February 2022.

DisciplinePublisher/JournalCost estimateDeals for TU Delft
AllExperimental Results£775 / €928100% APC discount for TUD authors
AllScientific Data€1790 No, but eligible for the TU Delft OA Fund
AllData in BriefUSD 500 / €440100% APC discount for TUD authors
AllChina Scientific DataRMB 3000 / €416No
AllData Science Journal£650 / €778No, but eligible for the TU Delft OA Fund
AllDataCHF 1400100% APC discount for TUD authors
AllGigaScience€1089No, but eligible for the TU Delft OA Fund
AllGigabyteUSD 350 / €308No, but eligible for the TU Delft OA Fund
AllF1000ResearchUSD 800 / €704No, but eligible for the TU Delft OA Fund
ArchaeologyJournal of Open Archaeology Data£100 / €120No, but eligible for the TU Delft OA Fund
ArchaeologyOpen Quaternary£300 / €359No, but eligible for the TU Delft OA Fund
ChemistryJournal of Cheminformatics0NA
Computer ScienceJaiiovariableNo
Earth SciencesGeoscience Data Journal€1450No, but eligible for the TU Delft OA Fund
Earth SciencesEarth System Science Data0NA
Earth SciencesBig Earth Data€910No, but eligible for the TU Delft OA Fund

For more journals that offer the Data Article format you can see the ‘Data Article’ section in The Turing Way.

Charting library use of open vs closed infrastructures

A few weeks ago, our one-time colleague Emmy Tsang, blogged on Rethinking the Scaffolds of Open.

Emmy argued that we (universities, libraries, the research and educational communities) need to make much more values-informed choices about the type of infrastructure we build and invest in; not just blindly reverting to commercial infrastructures because it seems the most convenient.

A first step of this process is actually realising what infrastructures are being funded and supported. Without a clear map of what a library or a university pays for, it’s difficult to make concrete actions. But even this tricky. Responsibility for contracts, services, tools tend to spread over many different people within an organisation

I was delighted to see that Jeroen Bosman from the University of Utrecht had made a rapid attempt to sketch the different areas where a university might be investing in infrastructure to support higher education and research.

This helped us at TU Delft Library make our own first attempts to sketch our use of different infrastructures.

The embedded Google sheet (see full version here) is written from the perspective of the library. It is a first attempt – there are other applications that are missing from this list.

It does not relate to the whole university. My colleague and university archivist Esther Maes reckons there are close to 700 applications used across the university.

This spreadsheet is therefore shared as a ‘conversation piece’. It shows

a) the huge number of infrastructures (in the broad sense of the word) a library engages with

b) that finding areas to prioritise is vital – where does one start with such a long list ?

c) in some areas, both open and closed solutions are used

So, where next?