Overview FilesFoldersOrganization - ULJ-Yale/qunex GitHub Wiki
This document provides an overview of the QuNex data hierarchy structure for an exemplar generic dataset analyzed using the QuNex toolset. The logic behind the data hierarchy specifications is to provide a clear and predictable structure so that both users and tools can reference data in consistent locations with a unified naming grammar, limiting the need for referential databases and metadata management. The data hierarchy is defined primarily for individual sessions. Group-level data hierarchy specification (e.g. analyses folder) is not fixed, thus enabling flexible internal organization according to specific analysis needs and goals.
- QuNex expects that each study contains a 'master' folder that contains all the data and results.
- Such a study would by default contain the folder hierarchy defined in
$TOOLS/python/qx_utilities/templates/study_folders_default.txt
:
studyfolder # -- Overall study base folder
βββ analysis # -- The base folder for group data analysis and results that follow basic (pre)processing
β βββ scripts # -- Code for analyses that use preprocessed data would go here
βββ processing # -- The base folder for content related to data (pre)processing
β βββ logs # -- Folder storing all study processing logs
β β βββ batchlogs # -- Default log location for output captured from runs scheduled on cluster nodes
β β βββ comlogs # -- Default log location for detailed output generated by the executed command(s)
β β βββ runchecks # -- Reports on completion of individual steps / presence of resulting files
β β βββ runlogs # -- Default log location for the executed QuNex command(s) with supplied parameters
β βββ lists # -- List files used for processing and analyses
β βββ scripts # -- Code for data (pre)processing would go here
β βββ scenes # -- Location for misc scenes used in analyses and QC
β β βββ QC # -- Location for additional custom QC scenes and associated files
β β βββ T1w # -- T1w QC scenes and associated files
β β βββ T2w # -- T2w QC scenes and associated files
β β βββ myelin # -- myelin QC scenes and associated files
β β βββ BOLD # -- BOLD QC scenes and associated files
β β βββ DWI # -- DWI QC scenes and associated files
β βββ batch.txt # -- Batch files describing processing and analysis parameters in a header followed by sessions' data information
βββ info # -- The base folder that stores various information and materials
β βββ demographics # -- Participant demographic information
β βββ tasks # -- Folder containing task-related information
β βββ stimuli # -- Folder containing task-related stimuli if used
β βββ bids # -- Study-level information in BIDS format
β βββ hcpls # -- HCPLS dataset related information
βββ sessions # -- The base folder that stores individual sessions' data
βββ inbox # -- Group .fidl and .conc files for processing and analyses
β βββ MR # -- Incoming MR data from the scanner
β βββ EEG # -- Incoming EEG data
β βββ BIDS # -- Incoming BIDS dataset
β βββ HCPLS # -- Incoming HCPLS dataset
β βββ behavior # -- Incoming behavioral data
β βββ concs # -- .conc files for the sessions
β βββ events # -- .fidl files for the sessions
βββ archive # -- folder with raw zipped data from the scanner for backup
β βββ MR # -- Archive of raw data MR data imported from the scanner
β βββ EEG # -- Archive of the raw EEG data
β βββ BIDS # -- Archive of the processed BIDS dataset(s)
β βββ HCPLS # -- Archive of the processed HCPLS dataset(s)
β βββ behavior # -- Archive of the raw behavioral data
βββ specs # -- specifications files to be used on the sessions in the study that include mapping files that provide MR and/or EEG data mapping info
β βββ <pipeline>_mapping.txt # -- These mapping files are used for mapping to pipeline specific structure
β βββ parameters.txt # -- batch acquisition parameter headers that are used to specify preprocessing parameters and
β # to generate processing `batch.txt` files which are ultimately stored in <study_name>/processing
βββ QC # -- folder with group-level quality control data for all sessions
βββ <session_id> # -- session specific folder
Note here that one can specify a custom study folder structure. To do this prepare your own folder structure specification and provide its path in the --folders
parameter of the create_study
command.
Each session will have a subfolder inside the main QuNex study structure which contains everything related to the particular session, below is its folder structure.
studyfolder # -- Overall study base folder
...
βββ sessions # -- The base folder that stores individual sessions' data
...
βββ <session_id> # -- session specific processing and analysis folder (naming formula is "<subject id>[_<session name>]", e.g "s12" or "s12_pre", "s12_post")
βββ session.txt # -- an information file describing the session's data generated following sorting of DICOMs
βββ session<pipeline>.txt # -- an information file describing the sessionsβs data with mapping for further pipeline processing
βββ inbox # -- folder with incoming raw data from the scanner
βββ nii # -- folder with original data in NIfTI format after initial import from DICOMs, BIDS or HCPLS input
βββ physio # -- folder with physiological recordings files
βββ behavior # -- folder with behavioral data
βββ QC # -- folder with session-specific quality control data
βββ bids # -- folder with subject and session specific data in BIDS format
βββ dicom # -- folder with sorted dicom images along with a log txt file documenting what was acquired
β βββ 1 # -- Hypothetical T1w scan DICOM folder
β βββ 2 # -- Hypothetical T2w scan DICOM folder
β βββ 3 # -- Hypothetical SpinEcho Field Map Phase Encoding Direction 1 (original)
β βββ 4 # -- Hypothetical SpinEcho Field Map Phase Encoding Direction 2 (reversed)
β βββ 5 # -- Hypothetical BOLD_1 acquisition
β βββ 6 # -- Hypothetical BOLD_1 acquisition
β βββ 7 # -- Hypothetical DWI acquisition
β βββ 8 # -- Hypothetical DWI acquisition
β βββ 9 # -- Hypothetical DWI acquisition
β βββ 10 # -- Hypothetical DWI acquisition
βββ eyetracking # -- Eye tracking data
βββ EEG # -- EEG data
β βββ raw # -- converted unprocessed EEG data
β βββ preproc # -- EEG data in different preprocessing stages
β βββ results # -- results of EEG data ready for further group based analyses
βββ images # -- folder with processed neuroimaging data mapped for further analyses
β βββ functional # -- holds BOLD and related files
β β βββ concs # -- conc files specifying fMRI files that constitute a series
β β βββ events # -- event (.fidl) files used for de-noising or task analyses
β β βββ glm # -- general linear model descriptions (e.g. for task analyses)
β β βββ movement # -- motion regressors
β βββ ROI # -- holds any information on ROI used in preprocessing (e.g. nuisance ROI)
β β βββ nuisance # -- maps used for definition of ROI used to extract nuisance regressors used for BOLD de-noising
β βββ segmentation # -- holds any files related to segmentation (e.g. bold brain masks, freesurfer segmentations, hcp surface files, etc.)
β β βββ boldmasks # -- whole-brain masks for the BOLD data specifying actual coverage of the brain
β β βββ freesurfer # -- mapping of select freesurfer segmentation data as part of the HCP pipelines (detailed below)
β β β βββ mri # -- freesurfer mapping following HCP pipelines
β β β β βββ orig # -- freesurfer mapping following HCP pipelines
β β β βββ surf # -- freesurfer mapping following HCP pipelines
β β βββ hcp # -- mapping of surface files from the HCP pipelines (detailed below)
β β βββ fsaverage_LR32k # -- HCP mapping of surfaces
β βββ structural # -- holds structural images (T1w image and BOLD template image)
β βββ diffusion # -- holds processed DWI images
βββ hcp # -- folder with data for and from HCP pipeline
βββ <session_id> # -- session id is repeated here for hcp folder mapping
βββ BOLD_1 # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β βββ BOLD_1_nonlin_norm.wdir # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β βββ DistortionCorrectionAndEPIToT1wReg_FLIRTBBRAndFreeSurferBBRbased # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β β βββ FieldMap # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β βββ MotionCorrection_FLIRTbased # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β β βββ BOLD_1_mc.mat # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β βββ MotionMatrices # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β βββ OneStepResampling # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β βββ postvols # -- HCP pipeline outputs (Initial processing for BOLD series 1)
β βββ prevols # -- HCP pipeline outputs (Initial processing for BOLD series 1)
βββ BOLD_2 # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β βββ BOLD_2_nonlin_norm.wdir # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β βββ DistortionCorrectionAndEPIToT1wReg_FLIRTBBRAndFreeSurferBBRbased # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β β βββ FieldMap # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β βββ MotionCorrection_FLIRTbased # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β β βββ BOLD_2_mc.mat # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β βββ MotionMatrices # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β βββ OneStepResampling # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β βββ postvols # -- HCP pipeline outputs (Initial processing for BOLD series 2)
β βββ prevols # -- HCP pipeline outputs (Initial processing for BOLD series 2)
βββ Diffusion # -- HCP pipeline outputs (Raw DWI images mapped for input)
β βββ data # -- HCP pipeline outputs (DWI processing output)
β βββ eddy # -- HCP pipeline outputs (DWI processing output)
β βββ rawdata # -- HCP pipeline outputs (DWI processing output)
β βββ reg # -- HCP pipeline outputs (DWI processing output)
β βββ topup # -- HCP pipeline outputs (DWI processing output)
βββ MNINonLinear # -- HCP pipeline outputs (Final HCP processing results)
β βββ fsaverage # -- HCP pipeline outputs (Low-res surfaces used for CIFTI format generation in atlas space)
β βββ fsaverage_LR32k # -- HCP pipeline outputs (Low-res surfaces used for CIFTI format generation in atlas space)
β βββ Native # -- HCP pipeline outputs (???)
β βββ Results # -- HCP pipeline outputs (All images processed and mapped in MNI space)
β β βββ BOLD_1 # -- HCP pipeline outputs (BOLD 1 images results in MNI space)
β β β βββ BOLD_1_hp2000.ica # -- HCP pipeline outputs (BOLD 1 images FIXICA denoising results)
β β β β βββ filtered_func_data.ica # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β β βββ report # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β β βββ stats # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β βββ fix # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β βββ mc # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β βββ reg # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β βββ RibbonVolumeToSurfaceMapping # -- HCP pipeline outputs (BOLD 1 images surface mapping ouputs)
β β βββ BOLD_2 # -- HCP pipeline outputs (BOLD 2 images results in MNI space)
β β β βββ BOLD_2_hp2000.ica # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β βββ filtered_func_data.ica # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β β βββ report # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β β βββ stats # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β βββ fix # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β βββ mc # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β β βββ reg # -- HCP pipeline outputs (BOLD 2 images FIXICA denoising results)
β β β βββ RibbonVolumeToSurfaceMapping # -- HCP pipeline outputs (BOLD 2 images surface mapping ouputs)
β β βββ Tractography # -- HCP pipeline outputs (DWI tractography results in MNI space)
β β βββ Mat1_logs # -- HCP pipeline outputs (DWI tractography results in MNI space)
β β βββ Mat3_logs # -- HCP pipeline outputs (DWI tractography results in MNI space)
β βββ ROIs # -- HCP pipeline outputs (Structural ROIs for CIFTI file generation in MNI space)
β βββ xfms # -- HCP pipeline outputs (Transformation matrix files)
βββ T1w # -- HCP pipeline outputs (T1w processing inputs mapped here and all structural and DWI native space results)
β βββ ACPCAlignment # -- HCP pipeline outputs (T1w processing results)
β βββ BiasFieldCorrection_sqrtT1wXT1w # -- HCP pipeline outputs (T1w processing results)
β βββ BrainExtraction_FNIRTbased # -- HCP pipeline outputs (T1w processing results)
β βββ Diffusion # -- HCP pipeline outputs (DWI processing results for dtifit)
β βββ Diffusion.bedpostX # -- HCP pipeline outputs (DWI processing results for bedpostX)
β β βββ logs # -- HCP pipeline outputs (DWI processing results for bedpostX)
β β β βββ logs_gpu # -- HCP pipeline outputs (DWI processing results for bedpostX)
β β β βββ monitor # -- HCP pipeline outputs (DWI processing results for bedpostX)
β β βββ xfms # -- HCP pipeline outputs (Transformation matrix files)
β βββ fsaverage # -- HCP pipeline outputs (Hi-res surfaces used for CIFTI format generation in native space)
β βββ fsaverage_LR32k # -- HCP pipeline outputs (Low-res surfaces used for CIFTI format generation native space)
β βββ Native # -- HCP pipeline outputs (???)
β βββ <session_id> # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ bem # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ label # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ mri # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β β βββ orig # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β β βββ transforms # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β β βββ bak # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ scripts # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ src # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ stats # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ surf # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ tmp # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ touch # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β β βββ trash # -- HCP pipeline outputs (FreeSurfer processing results in session native space)
β βββ Results # -- HCP pipeline outputs (DWI tractography results in session native space)
β β βββ log_pretractographydense # -- HCP pipeline outputs
β βββ ROIs # -- HCP pipeline outputs (Structural ROIs for CIFTI file generation in native space)
β βββ T1w1_GradientDistortionUnwarp # -- HCP pipeline outputs
β βββ xfms # -- HCP pipeline outputs (Transformation matrix files)
βββ T2w # -- HCP pipeline outputs (T2w processing inputs mapped here)
β βββ ACPCAlignment # -- HCP pipeline outputs
β βββ BrainExtraction_FNIRTbased # -- HCP pipeline outputs
β βββ T2w1_GradientDistortionUnwarp # -- HCP pipeline outputs
β βββ T2wToT1wDistortionCorrectAndReg # -- HCP pipeline outputs
β β βββ FieldMap # -- HCP pipeline outputs
β β βββ T2w2T1w # -- HCP pipeline outputs
β βββ xfms # -- HCP pipeline outputs (Transformation matrix files)
βββ unprocessed # -- The folder holding unprocessed data
βββ T1w # -- HCP pipeline unprocessed input folder: high-resolution T1w image(s)
βββ SpinEchoFieldMap1 # -- HCP pipeline unprocessed input folder: high-resolution T2w image(s)
βββ SpinEchoFieldMap1 # -- HCP pipeline unprocessed input folder: spin echo field map images
βββ BOLD_1 # -- HCP pipeline unprocessed input folder: bold 1 image
βββ BOLD_1_SBRef # -- HCP pipeline unprocessed input folder: bold 1 single-band reference image
βββ BOLD_2 # -- HCP pipeline unprocessed input folder: bold 2 image
βββ BOLD_2_SBRef # -- HCP pipeline unprocessed input folder: bold 2 single-band reference image
βββ Diffusion # -- HCP pipeline unprocessed input folder: diffusion weighted images
-
Please note that the specific names of folders in the hcp hierarchy may vary based on provided batch parameters and structure options.
-
If
hcp_filename
is set touserdefined
instead of the defaultautomated
, the image and folder names will be named based on the filename specification insession_<pipeline>.txt
. This is a stylistic decision whether you want images to be named numerically (automated
, for exampleBOLD_1
,BOLD_2
...) or in a more verbose way (userdefined
, for examplerfMRI_REST_AP
,rfMRI_REST_PA
, ...). -
For instance:
rfMRI_REST2_PA
andrfMRI_REST2_PA_SBRef
instead ofBOLD_1
andBOLD_1_SBRef
. -
Additionally, if
automated
hcp_filename
is used, the prefix for the resulting bold file names and folders can be defined using--hcp_bold_prefix
parameter. The default value isBOLD_
. -
For full details on how
hcp processing
naming convention settings affect the file hierarchy see HCP File Naming Wiki Section