# Raw Data Archives

## Dataset reference

This README documents the split raw-data archives of the dataset:

**Dataset title:** PhD Results Repository
**Dataset DOI:** https://doi.org/10.15480/882.16414
**Creator:** Ryan Rautenbach
**Affiliation:** Institute of Multiphase Flows, Hamburg University of Technology (TUHH), Hamburg, Germany
**ORCID iD:** 0000-0002-6943-2077
**Version:** 1.0
**Date:** 09.06.2026
**License:** CC BY 4.0 for raw data, processed data, results, figures, and supplementary material; MIT License for code and scripts

For the full dataset-level documentation, recommended citation, related publication, software environment, and reuse information, see the main `README.md`.

## Purpose of this README

This file documents the raw numerical simulation archives contained in the repository.

The original raw-data folder was split into several ZIP archives for storage and upload to TORE. Therefore, the raw data are not provided as one single `00_RAW_data/` directory. Instead, the individual archives listed below together represent the raw numerical simulation data used as the basis for later post-processing, analysis, and figure generation.

The split into several archives is organisational and storage-related. It does not indicate separate studies.

## Raw data archives

### `00_RAW_data-03_cell_lifeline.zip`

Raw cell-lifeline data used for particle-tracking, axial probability-of-presence analysis, axial residence-time analysis, and related numerical reactor characterisation.

### `00_RAW_data-work_on_multiphase_FTLE-FTLE_simulations-0.00vvm.zip`

Raw FTLE simulation data for the non-aerated operating case.

### `00_RAW_data-work_on_multiphase_FTLE-FTLE_simulations-0.05vvm.zip`

Raw FTLE simulation data for the aerated operating case at 0.05 vvm.

### `00_RAW_data-work_on_multiphase_FTLE-FTLE_simulations-0.10vvm.zip`

Raw FTLE simulation data for the aerated operating case at 0.10 vvm.

### `00_RAW_data-work_on_multiphase_FTLE-FTLE_simulations-0.15vvm.zip`

Raw FTLE simulation data for the aerated operating case at 0.15 vvm.

### `00_raw_simulations_no_vol_results.zip`

Raw simulation results excluding large volumetric result files. This archive contains retained numerical simulation outputs without the full volume-result datasets in order to reduce storage size.

## Typical contents of the raw archives

The raw data archives contain numerical simulation cases and directly exported solver outputs. Depending on the archive, this may include:

* numerical simulation case setups
* geometry files
* `input.xml` files
* solver logs
* output folders
* probe statistics
* scalar fields
* particle data
* FTLE-related simulation data
* archived or copied result sets

Many simulation folders contain repeated solver-output structures such as:

* `Input`
* `Output`
* `Stats`
* `BoundaryConditions`
* `ParticleFreq...`
* `Slice...`
* `Volume...`
* probe-specific folders

These repeated structures are documented at archive or directory level rather than by listing every timestep, probe file, slice file, volume file, or repeated solver-output file individually.

## File formats and data structures

Depending on the archive, the raw data may include the following file formats:

* `.zip` compressed archives
* `.xml` simulation input files
* `.txt` tabular or solver-output files
* `.csv` exported or processed tabular files, where included
* `.vti` volumetric simulation output files, where retained
* image or figure files such as `.png`, `.svg`, or `.pdf`, where included
* Python scripts or notebooks, where included as part of the raw or semi-raw processing context

The exact file structure differs between simulation cases. Repeated solver-output directories are therefore described by archive and directory type rather than by listing every individual output file.

## Relation to processed data

Derived analysis files, post-processing scripts, extracted results, plots, and comparison datasets are stored separately in:

`01_processed_data_related_RAW_data.zip`

Raw solver files should generally not be edited directly. Any corrections, derived datasets, or analysis outputs should be documented and stored in the processed-data archive instead.

## Software and system environment

The raw simulation data were generated using M-Star CFD / M-Star LBM-LES and analysed using Python-based post-processing tools.

For the consolidated software and system-environment information, including software versions and general hardware requirements, see the main `README.md`.

Reprocessing the full raw simulation data may require substantial local storage and high-performance computing resources. Some archives may require several hundred GB of free local storage after extraction.

## License and reuse

The raw data archives are licensed under the **Creative Commons Attribution 4.0 International License (CC BY 4.0)**, unless stated otherwise.

Code and scripts contained elsewhere in the repository are licensed under the **MIT License**, unless stated otherwise.

Users reusing the raw data should cite the dataset DOI:

https://doi.org/10.15480/882.16414

For the recommended citation and related dissertation citation, see the main `README.md`.

## Handling notes

The ZIP archives are large and should be extracted only when needed.

Raw solver files should not be manually edited unless a correction is explicitly documented. Derived analysis should be performed using the processed-data archive and associated scripts where available.

The individual raw-data archives together form the raw-data basis for the dissertation-related analyses.
