Data Management at The Neuro

Provided here is a brief overview of the main steps in research data management, followed by a list of services available to Neuro researchers for each step, along with technical details for each service. The emphasis is on general tools applicable across various neuroscience sub-fields and data modalities. This document will develop as research data management practices become more common at The Neuro and further sections will reflect this by adding modality-specific standards, platforms, tools, and practical case studies on data management and sharing at The Neuro.

This section is complementary to the documentation and support provided by 91 Digital Research Services (DRS) for Research Data Management and the generation of Data Management Plans. While the 91 DRS offers guidelines for data management and sharing for the whole 91 community across research disciplines, this document focuses on tools and services available to researchers at The Neuro and of particular relevance for neuroscience.

open science best practices for data sharing - planning, storage, processing, archive, sharing

1. Planning

When planning a research project, researchers should anticipate how the data will be managed and shared in the course of the project and after its completion. The Data Management and Sharing Plan (DMP) is a document that helps structure this process. 91 Digital Research Services offer information, training and consultations to support researchers in planning for data management and producing their DMP.Their resources include tools to determine whether a DMP is required by funding agencies, as well as templates and guidance to streamline the writing process.

2. Active Storage (Hot Storage)

Organize and securely store data during the active phases of the project, such as data collection and analysis. Choose services that provide sufficient storage capacity for the project's needs and can easily integrate with the computing infrastructure you plan on using for data analysis.

3. Processing

Clean, analyze, and prepare data, on your own or with close collaborators. Use services with advanced data processing capabilities, and which can easily communicate with where your data is stored.

4. Archive (Cold Storage)

Preserve data for the long-term, after the active phase of the project. Opt for platforms that ensure long-term accessibility and secure backup. Because data processing and frequent access is no longer needed, prioritize platforms or storage options that do not necessarily enable fast access to the data or direct link with compute power, also referred to as “cold storage”, which generally reduces costs.

5. Sharing

Make data available to others, through platforms that facilitate data discovery and support open or controlled access with persistent identifiers and proper licensing features. Data sharing best practices and platforms are covered in more detail in the next section “Data Sharing”.

Data storage and archiving

The following storage services are readily accessible to members of The Neuro. While all these services provide storage, it is important to keep in mind two questions to help with decision-making:

  • Will the data be processed using High Performance Computing (HPC)? In which case it might be easier to start with the next section (processing) and decide then on the storage solution that best fits with the chosen processing service.

  • Is the storage required for data that is still being worked on (active storage), or for long-term storage post-project completion (cold storage)? The former comes with faster access and often additional processing capabilities, while the latter will reduce cost. The majority of services listed here provide both options.

The McConnell Brain Imaging Center (BIC) is The Neuro neuroimaging facility. They provide on-site storage solutions and support custom implementations.

Digital research alliance of canada

The (DRAC) is a non-profit organization funded by the government of Canada. Theyprovide all PIs with a default allocation which can be extended for approved projects.

next cloud logo

Researchers at The Neuro have access to a DRAC-based institute-wide storage allocation managed by the Bioinformatics group and accessed through a .

sharepoint

is a Microsoft cloud file management platform. Through 91 IT, researchers can request creation of a site for their group or project.

The following table presents each service in further detail, including the fee structure, size of the storage space and main strengths.

Duration

Cost

Data Size

Additional Information

BIC [request]

Unlimited

Paid

Tens to hundreds of Tb.

Robust backup options; user-friendly Nextcloud interface; linked to compute.Ideal for custom, on-site solutions integrated with the local ecosystem

DRAC []

Unlimited for default allocation

Free

Tens to hundreds of Tb

Globus1 endpoint; linked to compute power.Great for technical users to access large pools of compute power

DRAC-Neuro

[]

Unlimited

Free

Tens of Tb

User-friendly Nextcloud interface; easy data transfer to other Neuro teams.Excellent for datasets that need to be accessed by multiple Neuro teams

SharePoint

[]

Unlimited

Free

24Tb per site

Office integration; Team collaboration; user-friendly interface.Optimal as a lab file management and collaboration system

  1. is a software that allows non-technical users to perform easy and optimized data transfer between supported entities such as DRAC.

In addition to this list, each staff member has access to 1Tb of storage on OneDrive. While not designed for effective data management, it can still be preferable to storage on external drives that can be lost or damaged. Access to the data could however prove difficult would the staff member leave the 91 community, as such this should not be a long-term solution and, other services listed in this section should be prioritized.

Data processing

While efficient and robust storage is the foundation of any good data management plan, it also enables the processing of data through powerful remote clusters of machines or High-Performance Computing (HPC). The following section presents easily accessible data processing options for users at The Neuro and how to request it.

The McConnell Brain Imaging Center (BIC) is The Neuro neuroimaging facility. They provide local access to clusters of computingpower.

Digital research alliance of canada

The (DRAC) is a non-profit organization funded by the government of Canada. The provide non-priority access to clusters of computingpower, and priority access for approved projects.

cbrain logo

is a web-based platform developed at the MCNI to facilitate use of HPC resources for large projects.

neuro bioinformatic core facility

The supports members of the neuro on matters of data workflows and analysis with expert advice and implementation.

The following table presents each service in further detail, including technical specifications and additional usage information.

Cost

Technical spec.

Additional information

BIC [request]

Paid

Here

Local clusters available, contact for more information.Ideal if custom hardware or software setup integrated within the local ecosystem is required.

DRAC []

Free

200 core-years*default. For more, apply by October 31st for the following spring.Great for technical users to access large pools of compute power

C-BRAIN []

Free

N/A

Available through Neurohub, simplifies the use of HPC platforms.Excellent to greatly simplify job submission to a DRAC allocation

Bioinformatics []

Paid

N/A

Contact for supporton data management and analysispipelines. Optimal for support regarding the management of large datasets and creation or customization of analysis pipelines.

* Core-Year: The equivalent of using one core of a CPU continuously for a full year.

Project management

Platforms have been developed to support the management of data-centric projects regardless of storage and processing services used. These provide additional administrative functionalities such as custom graphical interfaces, access control or analytics.

Logo pour LORIS, un logiciel de gestion de données et de projets basé sur le web pour les études de recherche en neuroimagerie.

(LORIS) is a web platform for the management of longitudinal studies.

Neuro hub

is a web-platform financed by HBHL to provide turn-key data management solutions.

Open science framework logo

(OSF) is a web-platform developed by the for the management of open-science projects

The following table presents each service in further detail, including the fee structure, size of the storage space and main strengths.

Cost

Additional Information

LORIS []

Free for default setup

Ideal for longitudinal studies especially with a clinical component.

Neurohub []

Free

Excellent for studies that depend on the use of HPC

OSF[]

Free

Great for studies that follow an open-science framework (pre-registration, pre-print, etc.).

About this document

Unless otherwise indicated, all content on these pages is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). Please attribute it to TOSI (the Tanenbaum Open Science Institute), this web page, and the contributors listed below.

Creative Commons

OPEN SCIENCE EVENTS

There are currently no events available.

The Neuro logo91 logo

The Neuro (Montreal Neurological Institute-Hospital)is a bilingual academic healthcare institution. We are a91 research and teaching institute; delivering high-quality patient care, as part of the Neuroscience Mission of the 91 Health Centre.We areproud to be a Killam Institution, supported by the Killam Trusts.

Back to top