Skip to content

NYU-CDS-Capstone-Project/HydroGAN

Repository files navigation

HydroGAN

TODO

  • Data

    • Change the subcube sampling procedure to let non-128 multiple cubes to be selected as well
      • Current way = (2048/128 - 2)^3 = 2744 different subcubes only
      • New way = (2048 - 128*2)^3 = 5.7 billion different subcube combinations
    • Change the subcube sampling procedure
      • Currently, the subsampling subsamples all the training samples and stores them.
        • in the self.samples under HydrogenDataset2
      • Change this to quickly sample when the get_samples() function is called.
        • incorporate get_samples into "__getitem__"
        • get_samples should open f itself without reading the whole 2048 cube wholly, sample using the coordinates and f.close()
    • Calculate the total number of different subcubes we can sample from
  • Cube transformations/Scaling

  • VAE

    • Investigate why VAE is not producing any hydrogen masses? (Probably an issue with the decoder part of the VAE). In the decode() of the VAE class, create multiple checkpoints: sum all the values in the out variable and plot the evolution of the sum. Compare with the sum of the input subcubes.

      • No Transformation Output
        • First Convolution Output alt text
        • First ReLU Output alt text
        • First MaxUnpool Output alt text
        • Second Convolution Output alt text
        • Second ReLU Output alt text
        • Second MaxUnpool Output alt text
        • Third Convolution Output alt text
        • Third ReLU Output alt text
  • GAN

  • Hybrid Models

  • Validation

    • 1 PDF
    • Log Histograms (compare the real distribution vs. the generated ones)
      • VAE: the output of the decoder
      • GAN: The output of Generator(noise)
    • Power spectrum
    • 3D Plot comparisons

Data

sample data is in data folder where a .h5 file is put. sample_32.h5 is 32 of randomly sampled cubes with dimensions ?x?x?