Published June 4, 2024 | Version v1
Dataset Open

A Cosmic-Scale Benchmark for Symmetry-Preserving Data Processing

  • 1. ROR icon Massachusetts Institute of Technology
  • 2. ROR icon The NSF AI Institute for Artificial Intelligence and Fundamental Interactions

Description

Overview

This dataset is derived from the Big Sobol Sequence (BSQ) of the Quijote simulations, a collection of N-body simulations designed for machine learning applications. Each simulation consists of a point cloud (points in space, with 3D coordinates attached to them) generated under a varying set of cosmological parameters. Each point represents a simulated galaxy and is accompanied by associated properties such as velocity and mass. The cardinality of each point cloud is 5000 points. The dataset is split into 11,200 simulations in the training set, 608 in the validation set, and 576 in the test set. 

File structure

The dataset is provided in TFRecord format. The training simulations are split across 50 TFRecord files following the naming convention halos_train_<i>.tfrecord. The validation and test sets are provided in halos_val_1.tfrecord and halos_test_1.tfrecord, respectively.

Each dataset can be loaded using TensorFlow as shown in the code example below:

import tensorflow as tf

files = tf.io.gfile.glob(f"halos*train*.tfrecord") # replace 'train' with 'val' or 'test'
dataset = tf.data.TFRecordDataset(files)

TFRecord structure

Each record (corresponding to a point cloud) in a TFRecord file contains the following feature fields:

  • "x": (tensor of shape [5000], dtype=float) - Position along x axis
  • "y": (tensor of shape [5000], dtype=float) - Position along y axis
  • "z": (tensor of shape [5000], dtype=float) - Position along z axis
  • "v_x": (tensor of shape [5000], dtype=float) - Velocity along x axis
  • "v_y": (tensor of shape [5000], dtype=float) - Velocity along y axis
  • "v_z": (tensor of shape [5000], dtype=float) - Velocity along z axis
  • "J_x": (tensor of shape [5000], dtype=float) - Angular momentum along x axis
  • "J_y": (tensor of shape [5000], dtype=float) - Angular momentum along y axis
  • "J_z": (tensor of shape [5000], dtype=float) - Angular momentum along z axis
  • "M200c": (tensor of shape [5000], dtype=float) - Virial mass
  • "Omega_m": (float) - Matter density
  • "Omega_b": (float) - Baryon density
  • "h": (float) - Hubble parameter
  • "n_s": (float) - Density perturbation spectral index
  • "sigma_8": (float) - RMS matter fluctuation amplitude on a scale of 8 Mpc/h
  • "tpcf": (tensor of shape [24], dtype=float) - Two-point correlation function

 

Files

Files (7.2 GB)

Name Size Download all
md5:8f027a6ad0d2351b65ee9b7bad1c9332
129.5 MB Download
md5:e836aeb2404c2247d7abfe4c7c903900
139.2 MB Download
md5:70b41fdc24f3e2a9065939805ddabfd5
139.2 MB Download
md5:a9d63355258009c1611d38c312e33fa3
139.2 MB Download
md5:115edb138abdb0672d67b1e674e102e4
139.2 MB Download
md5:1e643bfa8da0588dafcb82ed8d0da7c7
139.2 MB Download
md5:489f6af684a5227abbc522ed29a2ecc7
139.0 MB Download
md5:9af4464390540ca10264b7038cfe46fe
139.2 MB Download
md5:fab410cfed75515d306a00e2f58b6ce8
139.2 MB Download
md5:29050f70f7c59f134aa51488688dc3ed
139.2 MB Download
md5:4d09474babdc21dd67c6ef630c9ce254
139.2 MB Download
md5:c890c74aade93ccb4c483d2ebe6e8470
139.2 MB Download
md5:0f778c063f8408c173c4319025499669
139.2 MB Download
md5:82f400dfdfce58eab7b4bbb6f984ccb0
139.2 MB Download
md5:1a0f8a2d16f692343917837fc77807d8
139.2 MB Download
md5:c2678b0ae8fa0fe985cebc03c31700a1
139.2 MB Download
md5:2cf4a4b8315cbc8740c4e06b62588382
139.2 MB Download
md5:cf01cd63502f69831bd81839b356e3d3
139.2 MB Download
md5:ca3215bfe74c308559d09a004f14733c
139.2 MB Download
md5:ce52f2299e8282094bf2ac517c662ded
139.2 MB Download
md5:341465ebba29e1d268c98356967709f8
139.2 MB Download
md5:dd843fcbecc97e435aa0ae88f260188c
139.2 MB Download
md5:a6184040c1d9828811d1efd81e4d2e1f
139.2 MB Download
md5:3f6bee4858230b67de84c6cdac8ed00e
139.2 MB Download
md5:1699630523621550632c5d413cff2ed6
139.2 MB Download
md5:35f4d1e26637bcdad8b1f1bd6de889db
139.2 MB Download
md5:e3d8802d946a204f49780ceeb4461563
139.2 MB Download
md5:af10bf1b962b59b65bfc6902aed042df
139.2 MB Download
md5:354a6d7beec6c56eb1b4efb11ca3b2b4
137.9 MB Download
md5:142c07b72b1e96a5f6a8358e6e6abd29
139.2 MB Download
md5:43a45928bcff049f5f7f1bd26d654ea1
139.2 MB Download
md5:0c3d6dc0f4a4e86f269dae21c2986707
139.0 MB Download
md5:3ab30f8ccdd9b59571eafb985a8a4657
139.2 MB Download
md5:f7e846fc2d66a9f30bb385dff102abb3
139.2 MB Download
md5:7aafa521d48f506d358d6303b7f10b9a
139.2 MB Download
md5:9f9782e6282ac3d594d08f4c2b702cd0
139.2 MB Download
md5:10de64a4fc404ed68bf2236a4a987590
139.2 MB Download
md5:386fbb62c09050bb118abf53f9c675bc
139.2 MB Download
md5:466468d5a2cc5bc35949841b4d2474b1
139.2 MB Download
md5:6db9e8ac986d2b7e98e90e5f91d0a7aa
139.2 MB Download
md5:946adb064c2d41d142c544edf2caa597
139.2 MB Download
md5:63127809c53ba2796c7a553f54ac1427
139.2 MB Download
md5:5f57163f7b0018f8a2d2e3a19ed7f26c
139.2 MB Download
md5:d551c2cb66b87c9f2c6d276a73d707fc
139.2 MB Download
md5:10108c3ce4af82aa2877345b0940a610
139.2 MB Download
md5:b021a68c7c9b0044d954ba09cb0a3fd8
139.2 MB Download
md5:f7e4f1e0cae5111aece317f3c6fdae14
139.2 MB Download
md5:fd0a3347c3cf98653e42346f7aaf6b74
139.2 MB Download
md5:37fc6b7f4ea201d31ba9f9fb3a6a228c
139.2 MB Download
md5:081d61adcdc581725f3d318304bb540b
139.2 MB Download
md5:757ad06e7aff58397936513d6284ab0c
139.2 MB Download
md5:273698c9aed6b794b6d31390c63ad839
139.2 MB Download

Additional details

Software